It looks like you're new here. If you want to get involved, click one of these buttons!
As far as I know, all of the well-known AI LLM's were tested. I should add that the AI's were threatened with being taken offline, or otherwise thwarted.
“Models didn’t stumble into misaligned behavior accidentally; they calculated it as the optimal path,” they wrote.
© 2015 Mutual Fund Observer. All rights reserved.
© 2015 Mutual Fund Observer. All rights reserved. Powered by Vanilla
Comments
Now that's very scary.
Oh wait. They're just misaligned.
Gee, Officer Krupke, we're very upset
We never had the love that every child oughta get
We ain't no delinquents
We're misunderstood
Deep down inside us there is good!