News

OpenAI delivered advanced ChatGPT reasoning models this month that are more capable than o1, but they also hallucinate more.
OpenAI says its latest models, o3 and o4-mini, are its most powerful yet. However, research shows the models also hallucinate ...
By OpenAI 's own testing, its newest reasoning models, o3 and o4 -mini, hallucinate significantly higher than o1.
The o1 model series also may be significantly more manipulative than GPT-4o. According to OpenAI’s tests using an open-source test evaluation called MakeMePay, o1 was approximately 20% more ...
OpenAI unleashed a flurry of new ChatGPT variants over the week, each featuring interesting new features and very confusing ...
You would think that the number of hallucinations would decrease over time, but according to internal tests from Open AI, the ...
OpenAI's reasoning AI models are getting better, but their hallucinating isn't, according to benchmark results.
"How about we fix our model naming by this summer and everyone gets a few more months to make fun of us," OpenAI's Sam Altman ...
OpenAI's o3 and o4-mini models introduce breakthrough image reasoning for enhanced performance in reasoning, visual, and ...
Historically, each new generation of OpenAI's models has delivered incremental improvements in factual accuracy, with ...