News

OpenAI delivered advanced ChatGPT reasoning models this month that are more capable than o1, but they also hallucinate more.
OpenAI says its latest models, o3 and o4-mini, are its most powerful yet. However, research shows the models also hallucinate ...
OpenAI's reasoning AI models are getting better, but their hallucinating isn't, according to benchmark results.
The o1 model series also may be significantly more manipulative than GPT-4o. According to OpenAI’s tests using an open-source test evaluation called MakeMePay, o1 was approximately 20% more ...
You would think that the number of hallucinations would decrease over time, but according to internal tests from Open AI, the ...
OpenAI unleashed a flurry of new ChatGPT variants over the week, each featuring interesting new features and very confusing ...
Historically, each new generation of OpenAI's models has delivered incremental improvements in factual accuracy, with ...
On Wednesday, OpenAI announced the release of two new models—o3 and o4-mini—that combine simulated reasoning capabilities ...
AI models are numerous and confusing to navigate, but the benchmarks used to measure their performance are also challenging.
OpenAI's o3 and o4-mini models introduce breakthrough image reasoning for enhanced performance in reasoning, visual, and ...
Specifically, o3 tends to make more claims overall, leading to more accurate claims as well as more inaccurate/hallucinated claims,” according to OpenAI, as reported on by TechCrunch. [Link: OpenAI's ...
"How about we fix our model naming by this summer and everyone gets a few more months to make fun of us," OpenAI's Sam Altman ...