热点
"SWE-Bench" 相关文章
Augment Code 深度评测:是下一个 Cursor,还是昙花一现的 SWE-BENCH 冠军?
掘金 人工智能 2025-04-18T10:57:52.000000Z
Augment Code Released Augment SWE-bench Verified Agent: An Open-Source Agent Combining Claude Sonnet 3.7 and OpenAI O1 to Excel in Complex Software Engineering Tasks
MarkTechPost@AI 2025-04-04T20:35:28.000000Z
The new Claude 3.5 Sonnet, Computer Use, and Building SOTA Agents — with Erik Schluntz, Anthropic
Latent 2025-01-15T06:34:57.000000Z
2024 in Agents [LS Live! @ NeurIPS 2024]
Latent 2025-01-15T06:34:57.000000Z
Z Potentials|王星尧,99年博士创业AI编程,获Anthropic投资,全球权威榜单第一,解决过半编程问题
Z Potentials 2025-01-06T07:30:37.000000Z
Cognition Reveals Devin the World’s First Fully Autonomous AI Software Engineer
GreatAIPrompts 2024-11-26T06:32:22.000000Z
解决真实GitHub Issue能力登顶,字节豆包MarsCode团队分享背后工程实践,踩过的坑也分享了
智源社区 2024-11-05T07:07:15.000000Z
All Hands AI Open Sources OpenHands CodeAct 2.1: A New Software Development Agent to Solve Over 50% of Real Github Issues in SWE-Bench
MarkTechPost@AI 2024-11-01T16:05:52.000000Z
Is finetuning GPT4o worth it? — with Alistair Pullen, Cosine (Genie)
Latent 2024-10-22T02:56:29.000000Z
再见,Devin,基于GPT-4o,最强”AI工程师“Genie诞生
36kr 2024-08-13T09:18:18.000000Z