The 'truth serum' for AI: OpenAI’s new method for training models to confess their mistakes
OpenAI researchers have introduced a novel method that acts as a "truth serum" for large language models (LLMs), compelling them to self-report their own misbehavior, hallucinations and policy violati...

