MarkTechPost@AI 2024年07月12日
Internet of Agents (IoA): A Novel Artificial Intelligence AI Framework for Agent Communication and Collaboration Inspired by the Internet
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

为了克服现有框架的局限性,研究人员提出了一种名为“互联网代理 (IoA)”的框架,它利用类似即时通讯的架构来实现动态组队和灵活的通信,并使用有限状态机来控制对话流程。该框架能够整合来自不同设备的各种第三方代理,并通过实验表明,IoA 在通用任务、具身人工智能和检索增强生成基准测试中都优于最先进的基线,展示了其在复杂分布式多代理系统中的潜力。

😊 **IoA 架构** IoA 是一种类似于即时通讯应用程序的平台,它允许自主代理之间进行通信和协作。它解决了分布式协作、动态通信和异构代理集成问题。IoA 的服务器管理注册、发现和消息路由,而客户端提供代理通信接口。关键机制包括代理注册和发现、自主团队组建、结构化对话流程以及任务分配和执行。该系统使用全面的消息协议来实现高效的交互。例如,代理可以协作撰写研究论文,组建团队,分配任务并整合贡献以实现最终目标。

😎 **实验与评估** 研究人员进行了实验来展示 IoA 在跨各种任务整合异构代理方面的有效性,包括工具可变性、架构多样性、观察/动作空间和不同的知识库。IoA 在 GAIA 基准测试中表现出色,优于最先进的系统。它在开放式指令任务和具身人工智能挑战中展示了卓越的协作能力,即使代理具有不同的观察/动作空间。在检索增强生成任务中,IoA 的性能与 GPT-4 相当或超过 GPT-4。分析表明,即使在次优的通信模式下,IoA 也能够实现精确的团队组建和经济高效的任务执行。总的来说,IoA 是一个强大的平台,用于编排各种多代理系统。

🤩 **IoA 的优势** 该研究将 IoA 作为一种创新的框架来增强基于大型语言模型的多代理协作,其灵感来自互联网的概念。IoA 通过提供可扩展性、整合各种第三方代理的灵活性以及用于组队和对话控制的动态机制来克服现有框架的局限性。通过严格的基准测试实验,IoA 证明了其在促进异构代理之间协作方面的卓越效率,始终超越现有的基准测试。随着基于大型语言模型的代理领域的发展,IoA 有望成为未来多代理协作研究和开发的基石。通过实现独立开发的具有专业技能的代理的无缝集成,IoA 为先进且影响深远的多代理系统铺平了道路。

The rapid advancement of LLMs has enabled the creation of highly capable autonomous agents. However, multi-agent frameworks need help integrating diverse third-party agents due to ecosystem constraints and limited by single-device setups and rigid communication pipelines. Inspired by the Internet’s success in fostering human collaboration through projects like Wikipedia and Linux, a key question arises: can we create a similar platform for autonomous agents? With LLM-based agents achieving near-human performance and continually improving, exploring the efficient orchestration of diverse third-party agents to enhance their collaborative potential is crucial.

Researchers from Tsinghua University, Peking University, Beijing University of Posts and Telecommunications, and Tencent propose the Internet of Agents (IoA) framework to enhance LLM-based multi-agent collaboration. IoA overcomes existing limitations by integrating diverse third-party agents across multiple devices, using an instant messaging-like architecture for dynamic teaming and flexible communication. Inspired by Speech Act Theory, IoA employs a finite-state machine for conversation flow control. Experiments show IoA outperforms state-of-the-art baselines in general tasks, embodied AI, and retrieval-augmented generation benchmarks, achieving superior performance and highlighting its potential for sophisticated, distributed multi-agent systems.

Recent advancements in LLMs, including GPT, Claude, and Gemini, have led to AI agents capable of natural language interactions and diverse task performance. Researchers have enhanced these agents by integrating external tools and knowledge sources, enabling them to access information beyond their pre-trained data. Examples include OS-Copilot for web and code terminal interactions, OpenDevin for software development, XAgent and Voyager for complex tasks, and Minecraft gameplay, respectively. Building on these successes, multi-agent systems like AgentVerse and AutoGen enable collaboration among LLM-based agents. Despite progress, challenges remain, such as integrating third-party agents and supporting distributed systems. IoA aims to overcome these challenges, offering a flexible, scalable platform for advanced multi-agent collaboration.

The IoA is a platform resembling an instant messaging app, enabling communication and collaboration among autonomous agents. It tackles distributed collaboration, dynamic communication, and heterogeneous agent integration. IoA’s server manages registration, discovery, and message routing, while the client provides agent communication interfaces. Key mechanisms include agent registration and discovery, autonomous team formation, structured conversation flow, and task assignment and execution. The system uses a comprehensive message protocol for efficient interaction. For instance, agents collaborate to write research papers, form teams, assign tasks, and integrate contributions to achieve the final goal.

The researchers conducted experiments to showcase IoA’s effectiveness in integrating heterogeneous agents across diverse tasks: tool variability, architectural diversity, observation/action spaces, and varied knowledge bases. IoA excelled in the GAIA benchmark, outperforming SoTA systems. It demonstrated superior collaboration in open-ended instruction tasks and embodied AI challenges, even when agents had different observation/action spaces. In retrieval-augmented generation tasks, IoA matched or exceeded GPT-4 performance. Analysis revealed precise team formation and cost-effective task execution despite suboptimal communication patterns. Overall, IoA is a robust platform for orchestrating diverse, multi-agent systems.

The study introduced IoA as an innovative framework for enhancing LLM-based multi-agent collaboration, drawing inspiration from Internet concepts. IoA overcomes the limitations of current frameworks by offering scalability, flexibility in integrating diverse third-party agents, and dynamic mechanisms for teaming and conversation control. Through rigorous benchmarking experiments, IoA demonstrated superior efficiency in fostering collaboration among heterogeneous agents, consistently surpassing existing benchmarks. As the field of LLM-based agents evolves, IoA is poised to become a cornerstone for future research and development in multi-agent collaboration. By enabling seamless integration of independently developed agents with specialized skills, IoA paves the way for advanced and impactful multi-agent systems.


Check out the Paper and GitHub. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter

Join our Telegram Channel and LinkedIn Group.

If you like our work, you will love our newsletter..

Don’t Forget to join our 46k+ ML SubReddit

The post Internet of Agents (IoA): A Novel Artificial Intelligence AI Framework for Agent Communication and Collaboration Inspired by the Internet appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

人工智能 大型语言模型 多代理系统 互联网代理 协作
相关文章