News

When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards ...
Gray Swan believes its human red teaming events are great for pushing AI systems to respond to real-life scenarios, and has just announced a new competition that features OpenAI’s o1.
OpenAI’s o3 model has been tasked with playing Pokemon Red and achieving the ultimate goal of defeating the Elite Four to become Kanto’s champion.
It’s possible that o3 attempts to deceive at an even higher rate than its predecessor; we’ll find out once OpenAI’s red-team partners release their testing results. For what it’s worth ...
OpenAI's text-to-video generator, Sora, was leaked online by digital artists protesting their use as beta testers for what ...
It sounds like OpenAI's latest AI is showing signs of a ... according to a new report published by red teaming organization Apollo Research. "When o1 was led to believe that it would be shut ...
The team labeled it a “low-key research preview ... For its part, Google dubbed the chatbot a “code red” threat. OpenAI would wind up surpassing its most ambitious 1-million-user ...