热点
"CoT激活" 相关文章
Can We Predict Alignment Before Models Finish Thinking? Towards Monitoring Misaligned Reasoning Models
cs.AI updates on arXiv.org 2025-07-17T04:14:35.000000Z