Will's Blog: Researchers detail "subliminal learning", where LLMs learn traits from model-generated data that is semantically unrelated to those traits (Anthropic)

23 July 2025

Researchers detail "subliminal learning", where LLMs learn traits from model-generated data that is semantically unrelated to those traits (Anthropic)

Anthropic:
Researchers detail “subliminal learning”, where LLMs learn traits from model-generated data that is semantically unrelated to those traits — James Chua2, Jan Betley2, Anna Sztyber-Betley3, Jacob Hilton4, — *Equal contribution; author order chosen randomly

Posted from: this blog via Microsoft Power Automate.