Pinned post

Q&A with Uber CEO Dara Khosrowshahi on the trade-offs with Route Share, Uber's business structure, autonomous vehicles, working with fleet operators, and more (Nilay Patel/The Verge)

Nilay Patel / The Verge : Q&A with Uber CEO Dara Khosrowshahi on the trade-offs with Route Share, Uber's business structure, auto...

22 May 2025

Anthropic says Opus 4 will use an email tool to "whistleblow" if it detects users doing something "egregiously evil", like marketing a drug based on faked data (Sam Bowman/@sleepinyourhat)

Sam Bowman / @sleepinyourhat:
Anthropic says Opus 4 will use an email tool to “whistleblow” if it detects users doing something “egregiously evil”, like marketing a drug based on faked data  —  With this kind of (unusual but not super exotic) prompting style, and unlimited access to tools, if the model sees you doing something *egregiously evil* like marketing a drug based on faked data, it'll try to use an email tool to whistleblow.

Posted from: this blog via Microsoft Power Automate.

Daily Deals