
Maxwell Zeff / TechCrunch:
The Arc Prize Foundation says its new ARC-AGI-2 test stumps most AI models; humans get 60% of the questions right but GPT-4.5 and Claude 3.7 Sonnet score ~1% — The Arc Prize Foundation, a nonprofit co-founded by prominent AI researcher François Chollet, announced in a blog post …

Maxwell Zeff / TechCrunch:
The Arc Prize Foundation says its new ARC-AGI-2 test stumps most AI models; humans get 60% of the questions right but GPT-4.5 and Claude 3.7 Sonnet score ~1% — The Arc Prize Foundation, a nonprofit co-founded by prominent AI researcher François Chollet, announced in a blog post …
Source: TechMeme
Source Link: http://www.techmeme.com/250325/p28#a250325p28