National Cyber Warfare Foundation (NCWF)

OpenAI highlights GPT-5 scores on math, coding, and health benchmarks: 94.6% on AIME 2025 without tools, 74.9% on SWE-bench Verified, 46.2% on HealthB


0 user ratings
2025-08-07 19:06:23
milo
Developers

Carl Franzen / VentureBeat:

OpenAI highlights GPT-5 scores on math, coding, and health benchmarks: 94.6% on AIME 2025 without tools, 74.9% on SWE-bench Verified, 46.2% on HealthBench Hard  —  After literally years of hype and speculation, OpenAI has officially launched a new lineup of large language models (LLMs) …




Carl Franzen / VentureBeat:

OpenAI highlights GPT-5 scores on math, coding, and health benchmarks: 94.6% on AIME 2025 without tools, 74.9% on SWE-bench Verified, 46.2% on HealthBench Hard  —  After literally years of hype and speculation, OpenAI has officially launched a new lineup of large language models (LLMs) …



Source: TechMeme
Source Link: http://www.techmeme.com/250807/p36#a250807p36


Comments
new comment
Nobody has commented yet. Will you be the first?
 
Forum
Developers



Copyright 2012 through 2025 - National Cyber Warfare Foundation - All rights reserved worldwide.