热点
"OOCR" 相关文章
Training on Documents About Reward Hacking Induces Reward Hacking
少点错误 2025-01-21T21:36:15.000000Z
Inductive Out-of-Context Reasoning (OOCR) in Large Language Models (LLMs): Its Capabilities, Challenges, and Implications for Artificial Intelligence (AI) Safety
MarkTechPost@AI 2024-06-24T07:01:46.000000Z