OpenAI’s GPT 5.1 Codex Max runs 24-hour workflows, handles multifile refactors, reaches 80% accuracy, and uses 30% fewer ...
OpenAI announced on Friday it’s launching a research preview of Codex, the company’s most capable AI coding agent yet. Codex is powered by codex-1, a version of the company’s o3 AI reasoning model ...
Max, a new coding model designed for detailed and long-running software development tasks. Here is an overview of the model ...
If you’ve been watching the AI world this week, you probably noticed something interesting: OpenAI dropped GPT-5.1 Codex Max ...
The SWE-Bench Verified evaluation is basically a test of AI processing accuracy. It measures how well the AI solves a set of coding problems. According to OpenAI, GPT-5.1-Codex-Max "reaches the same ...
We’ve been expecting it for a while, and now it’s here: OpenAI has introduced an agentic coding tool called Codex in research preview. The tool is meant to allow experienced developers to delegate ...
While GPT‑5.1-Codex-Max does not meet OpenAI’s “High” capability threshold for cybersecurity under its Preparedness Framework ...
You're currently following this author! Want to unfollow? Unsubscribe via the link in your email. Follow Katherine Li Every time Katherine publishes a story, you’ll get an alert straight to your inbox ...
OpenAI today debuted a new artificial intelligence agent, Codex, that can help developers write code and fix bugs. The tool is available through a sidebar in ChatGPT’s interface. One button in the ...