热点
"ColBench" 相关文章
田渊栋和Sergey Levine参与开发新型RL算法,能通过多轮训练让智能体学会协作推理
机器之心 2025-04-09T10:04:04.000000Z
Meta 推出强化学习新框架 SWEET-RL,让 AI 更懂人类意图
IT之家 2025-03-24T02:57:50.000000Z
Meta AI Researchers Introduced SWEET-RL and CollaborativeAgentBench: A Step-Wise Reinforcement Learning Framework to Train Multi-Turn Language Agents for Realistic Human-AI Collaboration Tasks
MarkTechPost@AI 2025-03-23T00:20:13.000000Z