Anthropic:
Researchers detail “subliminal learning”, where LLMs learn traits from model-generated data that is semantically unrelated to those traits — James Chua2, Jan Betley2, Anna Sztyber-Betley3, Jacob Hilton4, — *Equal contribution; author order chosen randomly
Posted from: this blog via Microsoft Power Automate.