热点
"NLP评估" 相关文章
How Far Are LLMs from Symbolic Planners? An NLP-Based Perspective
cs.AI updates on arXiv.org 2025-08-05T11:10:04.000000Z
EleutherAI Presents Language Model Evaluation Harness (lm-eval) for Reproducible and Rigorous NLP Assessments, Enhancing Language Model Evaluation
MarkTechPost@AI 2024-05-26T06:31:00.000000Z