News

A new benchmark can test how much LLMs become sycophants, and found that GPT-4o was the most sycophantic of the models tested.
Operator is one of several agentic tools created by AI firms as they race to build agents capable of reliably performing ...
OpenAI is updating the AI model powering Operator, its AI agent that can autonomously browse the web and use certain software ...
OpenAI upgraded Operator, it's AI agent that uses the web to perform tasks, to a model based on o3 after previously using a ...
Support for remote Model Context Protocol servers, integration of image generation and Code Interpreter tools, and upgrades ...
ChatGPT, OpenAI ... that “GPT-4.1 is not a frontier model, so there won’t be a separate system card released for it.” OpenAI’s o3 AI model scored lower than expected on a benchmark ...