National Cyber Warfare Foundation (NCWF)

National Cyber Warfare Foundation (NCWF)

AI models know when they're being tested - and change their behavior, research shows

0 user ratings

2025-09-17 17:04:05
milo
Developers , Blue Team (CND)
- archive --

OpenAI and Apollo Research tried to stop models from lying - and discovered something else altogether.

Source: ADnet
Source Link: https://www.zdnet.com/article/ai-models-know-when-theyre-being-tested-and-change-their-behavior-research-shows/

Comments	new comment
Nobody has commented yet. Will you be the first?

Forum

Blue Team (CND)

Copyright 2012 through 2026 - National Cyber Warfare Foundation - All rights reserved worldwide.