热点
"HeroBench" 相关文章
HeroBench: A Benchmark for Long-Horizon Planning and Structured Reasoning in Virtual Worlds
cs.AI updates on arXiv.org 2025-08-19T04:01:36.000000Z