Mashable 01月24日
OpenAI announces Operator AI agent that can browse the web for you
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

OpenAI推出Operator,可自主执行多步任务。它能浏览网页、购物、发邮件等,还能在电脑上进行多种操作。Operator的模型结合了GPT-4o的视觉模式和图形用户界面。OpenAI采取多种方式考虑其安全性,目前该预览模式仅在美国对ChatGPT Pro用户开放,每月收费200美元。

💻Operator可自主执行多步任务,如计算退款、查找客户等

📱它能进行多种电脑操作,还可浏览网页、购买物品等

🔒OpenAI采取多种措施确保Operator的安全性

💰Operator预览模式目前仅对美国ChatGPT Pro用户开放,收费较高

Operator, OpenAI's agent that can perform multi-step tasks autonomously, has arrived.

The ChatGPT maker introduced a preview mode of Operator on Thursday, detailing how it works and what it's capable of. Operator can browse the web, performing tasks like calculating refunds from a canceled order and finding customers with specific criteria in an internal sales database. It can also buy groceries and send emails.

On a computer, Operator can perform various tasks, like downloading files, combining PDFs, analyzing spreadsheets, and exporting images.

OpenAI is delivering on its promise of making 2025 the year of agentic AI. Last week, the company launched Tasks for ChatGPT, which lets users automate future prompts like sending a daily brief of tech news or scheduling reminders. While many of these tasks are already possible through basic tools like Google Alerts and calendars, it's an early example of AI bots doing the legwork for the user. Combined with the release of Operator and its ability to autonomously take on more complex tasks, you can start to see OpenAI's vision for making ChatGPT an indispensable tool leveraging its core product.

The model underpinning Operator is a Computer-Using Agent (CUA) that combines GPT-4o's vision mode to "see" what's on the user's screen through screenshots with graphical user interfaces (GUIs) that enable Operator to interact with the screen (clicking buttons, typing, scrolling, etc.).

Operator in action, browsing for a campsite in Yosemite with picnic tables. Credit: OpenAI

OpenAI's safety approach with Operator

Obviously, safety is a big concern for a semi-autonomous AI agent like Operator. OpenAI says it has taken risks into account in a few different ways. Operator mitigates misuse by blocking harmful or illegal tasks, and can't access blacklisted sites like gambling and adult entertainment sites and drug or gun retailers.

And OpenAI is looking over your shoulder as you use Operator. The announcement says that "user interactions are reviewed in real-time by automated safety checkers that are designed to ensure compliance with Usage Policies and have the ability to issue warnings or blocks for prohibited activities," and that the company has developed "automated detection and human review pipelines to identify prohibited usage in priority policy areas, including child safety and deceptive activities."

Since Operator can make costly mistakes without human supervision, the model will ask for confirmation "before submitting an order, sending an email, etc., so that the user can double-check the model’s work before it becomes permanent." Operator is also currently limited from "higher-risk tasks like banking transactions."

Operator availability

Now is where we start to see OpenAI's new premium tier subscription, ChatGPT Pro. Operator in preview mode is only available in the U.S. to those who pay $200 a month as Pro users. But over time, OpenAI expects to expand availability to Plus, Team, and Enterprise users.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

OpenAI Operator 多任务 安全性 收费模式
相关文章