End Behavior Models - Search News

OpenAI’s evolving Model Spec aims to guide the behavior of AI models

OpenAI has emerged as one of the most recognizable pioneers in the generative artificial intelligence industry thanks to the impressive capabilities of large language models such as GPT-4. Now, it’s ...

ZDNet

AI models know when they're being tested - and change their behavior, research shows

Several frontier AI models show signs of scheming. Anti-scheming training reduced misbehavior in some models. Models know they're being tested, which complicates results. New joint safety testing from ...

Microsoft

Detecting backdoored language models at scale

Learn how Microsoft research uncovers backdoor risks in language models and introduces a practical scanner to detect ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

OpenAI’s evolving Model Spec aims to guide the behavior of AI models

AI models know when they're being tested - and change their behavior, research shows

Detecting backdoored language models at scale

Trending now