热点
"BALROG" 相关文章
Meet ‘BALROG’: A Novel AI Benchmark Evaluating Agentic LLM and VLM Capabilities on Long-Horizon Interactive Tasks Using Reinforcement Learning Environment
MarkTechPost@AI 2024-11-22T12:05:33.000000Z