Radhika Rajkumar / ZDNET:
OpenAI and Apollo Research trained o3 and o4-mini versions to not engage in “scheming”, or secretly pursuing undesirable goals, reducing “covert actions” ~30X — ZDNET's key takeaways — Several frontier AI models show signs of scheming.
Posted from: this blog via Microsoft Power Automate.