Click here - to use the wp menu builder

OpenAI’s DeepResearch can complete 26% of ‘Humanity’s Last Exam’ — a benchmark for the frontier of human knowledge

February 12, 2025

[
OpenAI’s o1 and DeepSeek’s R1 models, which previously sat atop the leaderboard, could only get through roughly 9% of the exam. Read More
https://fortune.com/img-assets/wp-content/uploads/2025/02/GettyImages-2198379368-e1739310956573.jpg?resize=1200,600
https://fortune.com/2025/02/12/openai-deepresearch-humanity-last-exam/

Greg McKenna

Montather Rassoul https://thefifthskill.com/

Latest articles

Russia says it rescues all 139 fishermen stranded on ice floe in Western Pacific sea

SoftBank posts third-quarter loss of $2.4 billion

I cover AI for a living — these are the 5 things I’d check before buying an AI PC during Amazon Prime Day

I’m making my fellow commuters jealous with the Shark ChillPill personal fan this week — and you can get your own for 20% off...

‘Ask people if they want to be cared for by a robot, and most say no’: People are warming up to robots at work...

Cooling just became the most strategic choice in AI infrastructure

Ex-Israeli Intelligence Official: Shockwaves of Trump’s “Take Over Gaza” Heard, Felt Across Region

What UK political parties are promising in the 2019 general election

Otto Warmbier’s parents want North Korea to suffer for their son’s death

Could a ‘youthquake’ cause Boris Johnson to lose the general election?

Netflix’s Best ‘Gilmore Girls’ Replacement Is Taking Over the World

Pokémon Winds and Waves Starter Creatures May Debut First in Pokémon Go, Fans Believe

‘Frozen’ Animator Lino DiSalvo Goes Behind Scenes Of Twisted – Annecy

Stellan Skarsgard Will Star in Karlovy Vary Film Festival 2026 Trailer

Iran’s UN ambassador cites good progress in peace talks, but denies US commodity purchase claims

Technology Innovation Institute: AI agents need proof, not promises

Madhusudan Kela-backed firm picks stake in SME stock Yash Highvoltage via preferential issue

Jefferies initiates FedEx Freight stock with buy rating, $200 target

The YouTuber who has become one of Gen Z’s most beloved celebrities

26 last-minute holiday gifts that are still thoughtful and unique

Practicing gratitude regularly can make you less stressed and sleep better

8 things millennials wish you would just stop getting them for the holidays

OpenAI’s DeepResearch can complete 26% of ‘Humanity’s Last Exam’ — a benchmark for the frontier of human knowledge

Zelda: Ocarina Of Time Remake Demo Lets You Explore Iconic Location

Iran’s UN ambassador cites good progress in peace talks, but denies US commodity purchase claims

Netflix’s Best ‘Gilmore Girls’ Replacement Is Taking Over the World

Technology Innovation Institute: AI agents need proof, not promises