热点
"情境意识" 相关文章
These two new AI benchmarks could help make models less biased
MIT Technology Review » Artificial Intelligence 2025-03-11T09:37:35.000000Z
“Alignment Faking” frame is somewhat fake
少点错误 2024-12-20T09:51:39.000000Z
Catastrophic sabotage as a major threat model for human-level AI systems
少点错误 2024-10-22T21:08:02.000000Z
Owain Evans on Situational Awareness and Out-of-Context Reasoning in LLMs
少点错误 2024-08-24T04:37:13.000000Z
AI Safety at the Frontier: Paper Highlights, July '24
少点错误 2024-08-05T13:06:44.000000Z
Me, Myself, and AI: the Situational Awareness Dataset (SAD) for LLMs
少点错误 2024-07-08T22:35:23.000000Z