热点
关于我们
xx
xx
"
代码评估
" 相关文章
ParaStudent: Generating and Evaluating Realistic Student Code by Teaching LLMs to Struggle
cs.AI updates on arXiv.org
2025-07-18T04:13:54.000000Z
AGACCI : Affiliated Grading Agents for Criteria-Centric Interface in Educational Coding Contexts
cs.AI updates on arXiv.org
2025-07-09T04:01:39.000000Z
首次覆盖超 11 类真实编程场景!豆包大模型团队开源代码大模型全新基准
字节跳动技术团队
2024-12-07T10:45:19.000000Z
首次覆盖超 11 类真实编程场景!豆包大模型团队开源代码大模型全新基准
豆包MarsCode
2024-12-06T11:44:28.000000Z
CodeJudge: An Machine Learning Framework that Leverages LLMs to Evaluate Code Generation Without the Need for Test Cases
MarkTechPost@AI
2024-10-17T12:03:46.000000Z
左脚踩右脚上天!OpenAI全新模型让GPT-4训练GPT-4
快科技资讯
2024-06-28T09:05:23.000000Z