OpenAI and Google Surpass Human Mathletes, But Remain Neck and Neck

OpenAI and Google outperform top human mathletes, but neither pulls ahead—highlighting fierce competition in AI mathematical reasoning.

July 22, 2025 10:00 AM

2 MIN TO READ

Eva Robinson

OpenAI and Google Surpass Human Mathletes, But Remain Neck and Neck

Image Credits:

Unsplash

I models from OpenAI and Google DeepMind achieved gold-medal scores in the 2025 International Math Olympiad, also known as IMO, which is known

for the world’s one of the oldest and most challenging high-school level math competitions, the companies independently announced in recent days.

The results from the underscore just how fast AI systems are developing and advancing, and yet how evenly matched Google and OpenAI seem to be in the AI race. AI companies are also competing fiercely for the public perception of being ahead in the AI race, marking an intangible battle of actions that could trigger big implications for securing top AI talent.

A vast majority of those AI researchers come from backgrounds in competitive math, so benchmarks such as IMO mean more than others

Last year, Google got a silver medal at IMO using a “formal” system, meaning it required humans to translate problems into a machine-readable format. For this year, OpenAI and Google entered “informal” systems into the competition, which were able to ingest questions and generate proof-based answers in natural language. The companies are saying that their AI models correctly responded to four out of six questions on IMO’s test, scoring higher compared to most high school students and Google’s AI model from last year, without needing any human-machine translation.

‍TechCrunch conducted an interview with both research teams from OpenAI and Google’s IMO efforts, claiming that those efforts put in winning the gold-medal performances represent breakthroughs around AI reasoning models in non-verifiable domains.

More so, shortly after OpenAI announced its feat on Saturday morning, Google DeepMind’s CEO and researchers took to social media to slam OpenAI for announcing its gold medal prematurely – shortly after IMO announced which high schoolers had won, the competition on Friday night – and for not having theri medal’s test officially evaluated by IMO.

We achieved this year’s impressive result using an advanced version of Gemini Deep Think (an enhanced reasoning mode for complex problems). Our model operated end-to-end in natural language, producing rigorous mathematical proofs directly from the official problem descriptions –…
— Demis Hassabis (@demishassabis) July 21, 2025

Luong also said that Google has been working with IMO’s organizers since last year in preparation for the test and wanted to have the IMO president’s blessing and official grinding before announcing its official results, which it did on Monday morning.

“The IMO organizers have their grading guideline,” Luong said. “So any evaluation that’s not based on that guideline could not make any claim about gold-medal level [performance].”

Become a member to unlock this article
and everything we write.

This post is part of our member-only content. It’s just one of the many stories waiting for you inside.

By joining, you’ll get:

Full access to all exclusive, member-only articles

A distraction-free, ad-free reading experience

Support the authors and ideas you care about

Early access to upcoming content and features

We achieved this year’s impressive result using an advanced version of Gemini Deep Think (an enhanced reasoning mode for complex problems). Our model operated end-to-end in natural language, producing rigorous mathematical proofs directly from the official problem descriptions –…
— Demis Hassabis (@demishassabis) July 21, 2025

“The IMO organizers have their grading guideline,” Luong said. “So any evaluation that’s not based on that guideline could not make any claim about gold-medal level [performance].”