An AI system has for the first time solved a problem from FrontierMath, a benchmark consisting of real research problems that mathematicians have failed to solve. Multiple AI models have now demonstrated the ability to solve the problem, including GPT-5.4 Pro, Gemini 3.1 Pro, and Claude Opus 4.6.
Since October, AI tools have helped move about 100 of Paul Erdős' mathematical problems into the "solved" category. Large language models function as powerful research assistants that can find and combine existing mathematical results in new ways.