GreatAIPrompts 2024年11月26日
Cognition Reveals Devin the World’s First Fully Autonomous AI Software Engineer
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

美国初创公司认知推出Devin,一款号称世界首款完全自主的AI软件工程师。Devin能独立解决工程任务,可利用浏览器学习API文档,自动调试代码、构建部署应用等。它在SWE-bench测试中表现出色,超过GPT4等。认知公司为有需求的企业提供早期使用通道。

🎯Devin是世界首款完全自主的AI软件工程师,能独立完成工程任务。

💻Devin可利用浏览器学习API文档,自动调试代码并解决工程问题。

🚀Devin在SWE-bench测试中表现优异,超过GPT4等,能自主完成编码任务。

🎉认知公司为企业提供Devin的早期使用通道,推动AI软件工程发展。

March 17th, 2024: US-based startup Cognition introduced Devin, an AI-powered tool the company claims is the “world’s first fully autonomous AI software engineer.”

Devin is designed to solve engineering tasks independently using its own shell, code editor, and web browser.

Devin AI fixing GitHub bugs autonomously

According to demonstrations provided by Cognition, Devin can utilize its web browser to access and learn from API documentation, enabling it to plug into various APIs.

https://youtu.be/fjHtjT7GO1c

When the AI agent encounters an error, it automatically adds a debugging print statement to the main code within its code editor interface and reruns the code.

Cognition has showcased Devin’s capabilities in building and deploying apps, identifying and fixing bugs in codebases, and even fine-tuning AI models.

To assess Devin’s accuracy, Cognition tested the AI agent on SWE-bench, a benchmarking platform that challenges agents to resolve real-world issues found in open-source projects on GitHub.

Devin successfully resolved 13.86% of the issues end-to-end, surpassing the performance of GPT4 (1.74%) and the previous best score held by Anthropic’s Claude 2 (4.80%).

Notably, Devin achieved this without assistance in locating the relevant files within the repository.

While Microsoft offers AI-powered developer tools like GitHub Copilot, which provides code completion and assistive features for programmers, it cannot complete codes end-to-end without human interference or assistance.

In contrast, Devin is capable of autonomously completing coding tasks.

Cognition is currently offering early access to Devin for businesses who wish to utilize the AI agent for engineering work. Interested customers can request early access through the company’s website.

With its impressive performance on the SWE-bench platform and its ability to operate independently, Devin represents a significant step forward in the development of AI-powered software engineering solutions.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Devin AI软件工程 自主编程 SWE-bench