TechCrunch News 前天 01:51
OpenAI upgrades the AI model powering its Operator agent
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

OpenAI正在升级其AI代理Operator,该代理能够自主浏览网页并在云端虚拟机中使用软件来完成用户请求。新版Operator将基于o3模型,这是OpenAI最新的“推理”模型之一。o3在数学和推理任务上表现更出色。OpenAI表示,新模型o3 Operator经过了针对计算机使用的额外安全数据微调,并发布了技术报告,展示了其在特定安全评估中的表现。与GPT-4o Operator相比,o3 Operator更不容易拒绝执行非法活动或搜索敏感个人数据,并且不易受到提示注入攻击。

🧐 OpenAI正在更新Operator,一个能够自主浏览网页和使用软件的AI代理。

💡 新的Operator将使用基于o3的模型,o3是OpenAI最新的“推理”模型系列之一,在数学和推理方面表现更出色。

🛡️ o3 Operator经过了额外的安全数据微调,特别针对计算机使用,以提高安全性。OpenAI发布的技术报告显示,与GPT-4o Operator相比,新模型在拒绝非法活动和抵御提示注入攻击方面有所改进。

OpenAI is updating the AI model powering Operator, its AI agent that can autonomously browse the web and use certain software within a cloud-hosted virtual machine to fulfill users’ requests.

Soon, Operator will use a model based on o3, one of the latest in OpenAI’s o series of “reasoning” models. Previously, Operator relied on a custom version of GPT-4o.

By many benchmarks, o3 is a far more advanced model, particularly on tasks involving math and reasoning.

“We are replacing the existing GPT‑4o-based model for Operator with a version based on OpenAI o3,” OpenAI wrote in a blog post. “The API version [of Operator] will remain based on 4o.”

Operator is one among many agentic tools released by AI companies in recent months. Companies are racing to make highly sophisticated agents that can reliably carry out chores more or less without supervision.

Google offers a “computer use” agent through its Gemini API that can similarly browse the web and take actions on behalf of users, as well as a more consumer-focused offering called Mariner. Anthropic’s models are also able to perform computer tasks, including opening files and navigating webpages.

According to OpenAI, the new Operator model, called o3 Operator, was “fine-tuned with additional safety data for computer use,” including data sets designed to “teach the model [OpenAI’s] decision boundaries on confirmations and refusals.”

Techcrunch event

Berkeley, CA | June 5

REGISTER NOW

OpenAI has released a technical report showing o3 Operator’s performance on specific safety evaluations. Compared to the GPT-4o Operator model, o3 Operator is less likely to refuse to perform “illicit” activities and search for sensitive personal data, and less susceptible to a form of AI attack known as prompt injection, per the technical report.

“o3 Operator uses the same multi-layered approach to safety that we used for the 4o version of Operator,” OpenAI wrote in its blog post. “Although o3 Operator inherits o3’s coding capabilities, it does not have native access to a coding environment or terminal.”

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

OpenAI Operator o3 AI代理 安全性
相关文章