TechCrunch News 03月31日 21:05
Amazon unveils Nova Act, an AI agent that can control a web browser
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

亚马逊推出通用AI代理Nova Act,可控制浏览器执行简单操作。它还附带SDK,开发者可用其构建原型。Nova Act将为亚马逊Alexa+升级提供动力,虽目前是研究预览版,但被认为能让AI聊天机器人更有用,且在内部测试中表现出色。

🎈亚马逊推出通用AI代理Nova Act,可控制浏览器操作

🛠️发布Nova Act SDK,供开发者构建代理原型

💪Nova Act将助力亚马逊Alexa+升级,增强语音助手功能

📈在内部测试中,Nova Act表现优于OpenAI和Anthropic的部分产品

Amazon on Monday unveiled Nova Act, a general-purpose AI agent that can take control of a web browser and independently perform some simple actions. Alongside the new agentic AI model, Amazon is releasing the Nova Act SDK, a toolkit that allows developers to build agent prototypes with Nova Act.

Nova Act, developed by Amazon’s recently opened San Francisco-based AGI lab, will also power key features of the company’s upcoming Alexa+ upgrade, a generative AI-enhanced version of Amazon’s popular voice assistant. The version of Nova Act available starting today is a little less polished, however. Amazon is calling it a research preview.

Developers can access the Nova Act toolkit on a new website, nova.amazon.com, which also serves as a showcase for Amazon’s various Nova foundation models.

Nova Act is Amazon’s attempt to take on OpenAI’s Operator and Anthropic’s Computer Use with general-purpose AI agent technology of its own. Several leading tech companies believe AI agents that can navigate the web for users will make today’s AI chatbots significantly more useful.

Amazon may not be the first to develop this sort of agentic technology, but via Alexa+, it may have the widest reach.

Amazon says developers building with the Nova Act SDK should be able to automate basic actions on behalf of users, such as ordering salads from Sweetgreen or making dinner reservations. With the Nova Act toolkit, developers can pull together tools that allow an AI agent to navigate web pages, fill out forms, or pick dates on a calendar.

Amazon claims that Nova Act outperforms agents from OpenAI and Anthropic on several of the company’s internal tests. For example, on ScreenSpot Web Text, which measures how an AI agent interacts with text on a screen, Nova Act scored 94%, outperforming OpenAI’s CUA (which scored 88%) and Anthropic’s Claude 3.7 Sonnet (90%).

However, Amazon didn’t benchmark Nova Act using more common agent evaluations, such as WebVoyager.

Nova Act is the first public product to emerge from Amazon’s aforementioned AGI lab, an initiative co-led by former OpenAI researchers David Luan and Pieter Abbeel. Both previously founded startups of their own — Luan started Adept, while Abbeel cofounded Covariant — before Amazon hired them away last year to spearhead its AI agent efforts.

While it may seem strange for an AGI lab to be building AI agents that can order SweetGreen, Luan told TechCrunch that he sees agents as a key step toward creating superintelligent AI systems. Luan defines AGI as “an AI system that can help you do anything a human does on a computer.”

Luan says his team designed the Nova Act SDK to reliably automate short, simple tasks, and give developers tools to precisely define when they want a human to intervene in an agentic workflow. He hopes it will allow developers to create more reliable agentic applications, albeit not necessarily fully autonomous ones.

Amazon is releasing its first generalist AI agent in a crowded space, but it’s a crucial technology that the company has a lot riding on. Early tests of Nova Act could provide a glimpse into some of the capabilities of the long-delayed Alexa+, a make-or-break moment for Amazon’s AI efforts.

A major problem with early AI agents from OpenAI, Google, and Anthropic is their reliability across different domains. In TechCrunch’s tests, the systems are slow, struggle to operate independently for very long, and are prone to mistakes a human wouldn’t make. It won’t be long until we see whether Amazon has cracked the code — or whether its agents suffer from the same flaws plaguing competitors.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

亚马逊 Nova Act AI代理 SDK
相关文章