热点
关于我们
xx
xx
"
SWE-Bench
" 相关文章
Augment Code 深度评测:是下一个 Cursor,还是昙花一现的 SWE-BENCH 冠军?
掘金 人工智能
2025-04-18T10:57:52.000000Z
Augment Code Released Augment SWE-bench Verified Agent: An Open-Source Agent Combining Claude Sonnet 3.7 and OpenAI O1 to Excel in Complex Software Engineering Tasks
MarkTechPost@AI
2025-04-04T20:35:28.000000Z
The new Claude 3.5 Sonnet, Computer Use, and Building SOTA Agents — with Erik Schluntz, Anthropic
Latent
2025-01-15T06:34:57.000000Z
2024 in Agents [LS Live! @ NeurIPS 2024]
Latent
2025-01-15T06:34:57.000000Z
Z Potentials|王星尧,99年博士创业AI编程,获Anthropic投资,全球权威榜单第一,解决过半编程问题
Z Potentials
2025-01-06T07:30:37.000000Z
Cognition Reveals Devin the World’s First Fully Autonomous AI Software Engineer
GreatAIPrompts
2024-11-26T06:32:22.000000Z
解决真实GitHub Issue能力登顶,字节豆包MarsCode团队分享背后工程实践,踩过的坑也分享了
智源社区
2024-11-05T07:07:15.000000Z
All Hands AI Open Sources OpenHands CodeAct 2.1: A New Software Development Agent to Solve Over 50% of Real Github Issues in SWE-Bench
MarkTechPost@AI
2024-11-01T16:05:52.000000Z
Is finetuning GPT4o worth it? — with Alistair Pullen, Cosine (Genie)
Latent
2024-10-22T02:56:29.000000Z
再见,Devin,基于GPT-4o,最强”AI工程师“Genie诞生
36kr
2024-08-13T09:18:18.000000Z