Very Difficult Mathematical Proofs

AI scores a ‘C-’ on its hardest math test yet

The second batch of “First Proof” problems is meant to evaluate AI’s usefulness for research-level math. The best model got ...

The Cool Down on MSN

More than 150 math experts tell governments not to trust AI hype after headline-making proof claims

"Current automated techniques can produce plausible but unreliable (or even incorrect) arguments which are difficult to ...

Hosted on MSN

Google’s AlphaProof Can Work on Mathematical Proofs Once Thought Beyond Machines

In 2024, an AI entered the fray of the International Mathematical Olympiad (IMO). Google’s AlphaProof is part of the same Alpha group that also created AlphaFold and AlphaGo. It solved problems that ...

Science News

AI cracked an Erdős math problem. Now experts want guardrails

The result is correct but challenges core norms of mathematics: checking proofs, crediting ideas and keeping research open to ...

Tech Times

AI Math Proof Milestone: DeepMind Cracks 9 Erdős Problems, Magnetar Confirmed

AI math proof verification reached a new frontier as DeepMind’s AlphaProof Nexus solved nine open Erdős research problems with Lean-verified proofs, some unsolved for 56 years. The May 2026 Science Ne ...

Phys.org

ChatGPT can provide original mathematical proofs, researchers show

VUB's Data Analytics Lab has published new results showing that it is possible to develop original mathematical proofs using commercial language models. In a paper posted to the arXiv preprint server, ...

12d

New Declaration Warns AI Could Threaten the Foundations of Mathematics: 130+ Top Mathematicians Fight Back

Mathematicians warn AI could undermine scientific trust with new declaration calling for transparency and human oversight in ...

13d

An OpenAI Model ‘Disproved’ a Famous Math Conjecture. This Mathematician Couldn’t Leave It Alone

Mathematician Will Sawin discusses his experience reviewing and refining a mathematical proof devised by OpenAI's internal ...

Popular Mechanics

Machines Are on the Verge of Tackling Fermat’s Last Theorem—a Proof That Once Defied Them

A mathematician will turn a groundbreaking 100-page proof into computer code. The proof tool, Lean, lets users turn proofs written in prose into rules and logic for testing. Kevin Buzzard already uses ...

Science News

Math long resisted a digital disruption. AI is poised to change that

Mathematician Kevin Buzzard of Imperial College London is training computers how to prove one of the most famous problems in math history: Fermat’s last theorem. Resolving the problem isn’t the point.

Forbes

AI LLMs Astonishingly Bad At Doing Proofs And Disturbingly Using Blarney In Their Answers

Forbes contributors publish independent expert analyses and insights. Dr. Lance B. Eliot is a world-renowned AI scientist and consultant. In today’s column, I examine an insightful AI research study ...

Hosted on MSN

'Proof by intimidation': AI is confidently solving 'impossible' math problems. But can it convince the world's top mathematicians?

At a secret meeting in 2025, some of the world's leading mathematicians gathered to test OpenAI's newest large language model, o4-mini. Experts at the meeting were amazed by how much the model's ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results