热点
"AbGen基准" 相关文章
AbGen: Evaluating Large Language Models in Ablation Study Design and Evaluation for Scientific Research
cs.AI updates on arXiv.org 2025-07-18T04:13:57.000000Z