热点
关于我们
xx
xx
"
ColBench
" 相关文章
田渊栋和Sergey Levine参与开发新型RL算法,能通过多轮训练让智能体学会协作推理
机器之心
2025-04-09T10:04:04.000000Z
Meta 推出强化学习新框架 SWEET-RL,让 AI 更懂人类意图
IT之家
2025-03-24T02:57:50.000000Z
Meta AI Researchers Introduced SWEET-RL and CollaborativeAgentBench: A Step-Wise Reinforcement Learning Framework to Train Multi-Turn Language Agents for Realistic Human-AI Collaboration Tasks
MarkTechPost@AI
2025-03-23T00:20:13.000000Z