Anthropic @AnthropicAI
We’re also hiring full-time researchers to investigate topics like this in more depth: https://t.co/L5I2x0xrPD
Jack Lindsey @Jack_W_Lindsey
We're launching an "AI psychiatry" team as part of interpretability efforts at Anthropic! We'll be researching phenomena like model personas, motivations, and situational awareness, and how they lead to spooky/unhinged behaviors. We're hiring - join us! https://t.co/cUPsJ8ktsG