热点
"HumanEval" 相关文章
Benchmarks for AI in Software Engineering
Communications of the ACM - Artificial Intelligence 2025-07-24T16:13:44.000000Z
Key Metrics for Evaluating Large Language Models (LLMs)
MarkTechPost@AI 2024-06-20T03:01:46.000000Z