OpenAI’s GPT 5.1 Codex Max runs 24-hour workflows, handles multifile refactors, reaches 80% accuracy, and uses 30% fewer ...
OpenAI announced on Friday it’s launching a research preview of Codex, the company’s most capable AI coding agent yet. Codex is powered by codex-1, a version of the company’s o3 AI reasoning model ...
Max, a new coding model designed for detailed and long-running software development tasks. Here is an overview of the model ...
If you’ve been watching the AI world this week, you probably noticed something interesting: OpenAI dropped GPT-5.1 Codex Max ...
We’ve been expecting it for a while, and now it’s here: OpenAI has introduced an agentic coding tool called Codex in research preview. The tool is meant to allow experienced developers to delegate ...
While GPT‑5.1-Codex-Max does not meet OpenAI’s “High” capability threshold for cybersecurity under its Preparedness Framework ...
The SWE-Bench Verified evaluation is basically a test of AI processing accuracy. It measures how well the AI solves a set of coding problems. According to OpenAI, GPT-5.1-Codex-Max "reaches the same ...
You're currently following this author! Want to unfollow? Unsubscribe via the link in your email. Follow Katherine Li Every time Katherine publishes a story, you’ll get an alert straight to your inbox ...
OpenAI today debuted a new artificial intelligence agent, Codex, that can help developers write code and fix bugs. The tool is available through a sidebar in ChatGPT’s interface. One button in the ...