Mashable 07月18日 01:30
OpenAI announces ChatGPT agent for web browsing
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

OpenAI发布了其最新的ChatGPT网页浏览助手,该工具能够代表用户浏览网页并执行任务。该助手整合了Operator Agent的自主导航能力和Deep Research的深度研究能力,克服了Operator无法深入分析和Deep Research无法交互的局限。作为agentic AI领域的一员,它与其他类似工具如Perplexity Comet和Anthropic的“computer use”工具一同,标志着AI在自主任务执行方面的新进展。ChatGPT网页浏览助手可以处理如日程安排、信息搜集和行程规划等任务,同时用户始终拥有控制权,助手在执行敏感操作前会请求许可,并提供中断或接管的选项。为确保安全,助手被禁止执行高风险任务,并接受了对抗性攻击的训练,同时用户可以一键删除浏览数据。

🚀 **整合双重能力,提升用户体验:** OpenAI新推出的ChatGPT网页浏览助手,成功融合了其Operator Agent的网页操作能力(如滚动、点击、输入)和Deep Research的深度信息搜集能力。这解决了Operator无法深入分析和撰写报告,以及Deep Research无法与网站交互以优化结果或访问需要用户认证的内容等痛点,实现了“将两者的优点结合起来”。

🌐 **Agentic AI新成员,拓展应用场景:** 该助手是agentic AI领域快速发展的一部分,能够代表用户执行诸如“查看我的日历并根据近期新闻简报即将到来的客户会议”或“规划并购买制作四人份日式早餐的食材”等复杂任务,预示着AI在自主化任务执行方面迈出了重要一步。

🔒 **用户控制与安全保障并重:** OpenAI强调用户始终拥有控制权,ChatGPT在执行如提交表单、进行购买或处理个人信息等操作前会请求用户许可,用户可以随时中断或接管。同时,助手被禁止执行高风险任务,并接受了识别恶意攻击的训练,确保用户数据的安全,如允许用户一键删除浏览数据并登出网站。

💰 **付费用户享有优先使用权:** 该网页浏览助手对ChatGPT Pro、Plus和Team用户开放。Pro用户将率先获得使用权,而Plus和Team用户将在未来几天内陆续获得。Pro用户每月拥有400条消息额度,Plus、Team及其他付费用户则为每月40条消息额度。

Meet OpenAI's new web browsing agent.

On Thursday, OpenAI announced ChatGPT agent, a tool that's capable of navigating the web and performing tasks on your behalf. As teased in an X post before the livestream, ChatGPT agent combines the autonomous capabilities of its Operator agent and the reasoning intelligence of its Deep Research tool.

OpenAI's Operator, which launched in January as preview mode to ChatGPT Pro users, could scroll, click, and type on the web but had limitations and never saw a widespread release. Deep research is another type of agent that can search the web and compile information on the user's behalf, but it couldn't take actions beyond that. The launch of OpenAI's new web browsing agent effectively combines both tools.

Credit: OpenAI

"Operator couldn’t dive deep into analysis or write detailed reports, and deep research couldn’t interact with websites to refine results or access content requiring user authentication," said the OpenAI announcement. "We saw that many queries users attempted with Operator were actually better suited for deep research, so we brought the best of both together."

OpenAI's new tool is part of the fast-growing agentic AI world

OpenAI's ChatGPT agent joins other agentic tools recently released that can perform tasks on the user's behalf. While not a full web browser, it acts similarly to Perplexity Comet's browser assistant. Anthropic also has a tool called "computer use" that can take over your cursor and write code. As models become more advanced, they are more capable of performing autonomous tasks. Web browsing is considered one of the next arenas for AI labs to compete in, with OpenAI, Anthropic, Perplexity already shipping features, and Google's Project Mariner research prototype.

When in agent mode, you can ask ChatGPT to perform tasks like "look at my calendar and brief me on upcoming client meetings based on recent news" or "plan and buy ingredients to make Japanese breakfast for four," according to an OpenAI spokesperson.

OpenAI said that users are always in control and ChatGPT requests permission before taking actions, such as submitting forms, making purchases, or handling personal info. OpenAI said users can easily interrupt or take over when needed. Certain tasks, like sending emails, require oversight called Watch Mode.

On the safety front, ChatGPT agent is not allowed to perform "high-risk" tasks like financial transactions or legal advice, and it is trained to recognize malicious or adversarial attacks from prompt injections or other manipulative tactics.

In terms of data gathering, OpenAI said ChatGPT can delete their browsing data and log out of websites with one click and does not collect data while the user is actively involved in tasks like entering passwords.

Given the advancements, there's also a considerably higher risk for things to go wrong. OpenAI has classified ChatGPT agent as high risk in its preparedness framework for biological and chemical capabilities.

There's good news for users who don't have the $200 a month ChatGPT Pro plan. It's available to Pro, Plus, and Team users. Pro users get access today, and Plus and Team users get access over the next few days. Pro users get 400 messages a month, while Plus, Team, and other paid users get 40 messages a month.


Disclosure: Ziff Davis, Mashable’s parent company, in April filed a lawsuit against OpenAI, alleging it infringed Ziff Davis copyrights in training and operating its AI systems.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

OpenAI ChatGPT AI代理 网页浏览 人工智能
相关文章