Microsoft Azure Blog Announcements 03月12日 18:03
Announcing the Responses API and Computer-Using Agent in Azure AI Foundry
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

微软Azure AI Foundry推出Responses API和Computer-Using Agent (CUA),旨在通过自动化工作流程、提高生产力并支持智能决策,革新各行各业。Responses API简化了AI代理的开发,通过单一API调用实现数据检索、信息处理和操作执行。CUA则是一种突破性的AI模型,它能够通过自然语言指令与软件界面交互、执行任务并自动化工作流程。这些创新共同赋能企业,将AI不仅视为助手,更视为积极的数字劳动力,从而推动大规模的自动化、效率和智能化。

🔑Responses API是Azure AI Foundry中解锁Agentic AI的关键,它将Azure OpenAI Service强大的内置工具与Chat Completions API的简洁性相结合,支持工具调用、计算机使用、文件搜索、代码解释器和函数调用等功能,简化了AI代理的开发流程。

💻Computer-Using Agent (CUA)是一种专门的AI模型,它允许AI通过自然语言指令与图形用户界面(GUI)交互,导航应用程序并自动化多步骤任务。CUA能够自主导航UI、动态适应界面变化,并在不同的应用程序中执行任务,无需依赖API集成。

🛡️为了确保AI系统的安全性、可靠性以及与人类意图的一致性,微软和OpenAI实施了多层安全方法,包括模型层面的有害任务拒绝和未授权行为拒绝,系统层面的内容过滤和执行监控,以及人为监督,以防止滥用、意外行为和对抗性风险。

AI agents are transforming industries by automating workflows, enhancing productivity, and enabling intelligent decision-making. Businesses are leveraging AI agents to process insurance claims, manage IT service desks, optimize supply chain logistics, and even assist healthcare professionals in analyzing medical records. The potential is vast, and we’re excited to introduce two powerful innovations in Azure AI Foundry:

Together, these capabilities empower businesses to reimagine AI not just as an assistant—but as an active digital workforce. Enterprise customers will soon gain access to these innovations driving automation, efficiency, and intelligence at scale.

Enhancing AI Agents with the Responses API 

The Responses API is the key to unlocking agentic AI in Azure AI Foundry, transforming how enterprises harness AI for real-world impact. It is the new foundation for leveraging Azure OpenAI Service’s powerful built-in tools, combining the simplicity of the Chat Completions API with the advanced capabilities available through Assistants API and Azure AI Agent Service. The Responses API enables seamless interaction with tools like CUA, code interpreter, function calling, and file search—all in a single API call. This API enables AI systems to retrieve data, process information, and take actions—seamlessly connecting agentic AI with enterprise workflows. 

How the Responses API Works 

The Responses API provides a structured response format that allows AI to interact with multiple tools while maintaining context across interactions. It supports: 

By consolidating retrieval, reasoning, and action execution into a single API, the Responses API simplifies AI agent development, reducing the complexity of orchestrating multiple AI tools within an automation pipeline.

This scalability makes it well-suited for enterprise use cases across industries such as customer service, IT operations, finance, and supply chain management, where AI-powered automation can streamline workflows and improve efficiency. For even greater flexibility and control, organizations can explore Azure AI Agent Service, which offers additional tools and models for developing and scaling AI agents. Azure AI Agent Service integrates with Semantic Kernel and AutoGen, enabling seamless multi-agent orchestration for more complex scenarios requiring multiple agents to collaborate on tasks.

Empowering AI Agents with the Computer-Using Agent

The Computer-Using Agent (CUA) is a specialized AI model in Azure OpenAI Service that allows AI to interact with graphical user interfaces (GUIs), navigate applications, and automate multi-step tasks—all through natural language instructions. Unlike traditional automation tools that rely on predefined scripts or API-based integrations, CUA can interpret visual elements, adapt dynamically, and take action based on on-screen content.

What makes the Computer-Using Agent unique?

With today’s announcement, developers can start building additional agentic capabilities right away with CUA. As enterprises look to deploy this technology at scale, we are evaluating integration with Windows 365 and Azure Virtual Desktop to enable CUA automation to run seamlessly in a managed host environment on Cloud PCs or virtual machines (VMs), ensuring consistent performance while maintaining enterprise compliance and security standards.

Ensuring secure and trustworthy AI automation

As AI systems become more autonomous, ensuring security, reliability, and alignment with human intent is critical. The CUA model is one of the first agentic AI models capable of directly interacting with software environments, bringing new challenges in misuse prevention, unintended actions, and adversarial risks. To address these, Microsoft and OpenAI have implemented a multi-layered safety approach spanning the model, system, and deployment levels.

The CUA model is developed with safeguards to refuse harmful tasks, reject unauthorized actions, and prevent misuse. At the system level, Microsoft implements enterprise-grade content filtering and execution monitoring to help detect and prevent policy violations. To minimize unintended actions, CUA is designed to request user confirmations before executing irreversible tasks and to restrict high-risk actions such as financial transactions. 

Microsoft’s Trustworthy AI framework further ensures real-time observability, logging, and compliance auditing for enterprise deployments. Automated and human-in-the-loop detection systems monitor execution patterns, identifying anomalous behaviors and enforcing governance policies. These safeguards are continuously refined based on internal red-teaming, external audits, and real-world testing to strengthen protection against prompt injections, adversarial manipulations, and unauthorized access. Given the current reliability level of the CUA model—particularly in non-browser environments—human oversight remains strongly recommended for sensitive operations.

As AI agents evolve, Microsoft is committed to transparency, security, and ongoing risk mitigation. By combining CUA’s built-in safeguards with Azure’s enterprise compliance and governance tools, organizations can deploy AI-powered automation with confidence, ensuring safe and responsible AI adoption at scale.

Getting started with CUA and Responses API

Azure AI Foundry continues to push the boundaries of AI-powered automation. Enterprise customers will gain access to the Responses API and CUA in Azure OpenAI Service in the coming weeks.

We’re excited to see how developers and businesses innovate with these new capabilities.  

The post Announcing the Responses API and Computer-Using Agent in Azure AI Foundry appeared first on Microsoft Azure Blog.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Azure AI Foundry Responses API Computer-Using Agent (CUA)
相关文章