News

PM EDT Now that the OpenAI livestream has ended, this article has been updated with the latest information about the o3 and ...
Tech giant's newest artificial intelligence models outperform predecessors, slash costs, and confuse everyone with their names ...
A discrepancy between first- and third-party benchmark results for OpenAI's o3 AI model is raising questions about the ...
The FrontierMath benchmark from Epoch AI tests generative models on difficult math problems. Find out how OpenAI’s o3 and ...
A discrepancy between first- and third-party benchmark results for OpenAI’s o3 AI model is raising questions about the company’s transparency and model testing practices. When OpenAI unveiled ...
By OpenAI 's own testing, its newest reasoning models, o3 and o4 -mini, hallucinate significantly higher than o1.
Historically, each new generation of OpenAI's models has delivered incremental improvements in factual accuracy, with ...