MarkTechPost@AI 2024年08月04日
AgentGen: Automating Environment and Task Generation to Enhance Planning Abilities in LLM-Based Agents with 592 Environments and 7,246 Trajectories
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

AgentGen 是一种利用大型语言模型 (LLM) 自动生成环境和规划任务的新框架,旨在解决当前 LLM 智能体训练数据有限的问题。该框架分为两个主要阶段:环境生成和任务生成。环境生成阶段通过文本片段的启发式语料库,生成详细且多样化的环境规范,包括环境概述、状态和动作空间描述以及转移函数定义。任务生成阶段则采用双向演化方法 (BI-EVOL),从简单到复杂地生成各种规划任务,以支持 LLM 的渐进式学习。研究人员使用 AgentBoard 平台评估了 AgentGen 的有效性,结果表明,使用 AgentGen 训练的 Llama-3 8B 模型在规划能力方面显著优于 GPT-3.5,甚至在某些任务上超过了 GPT-4。

🤖 AgentGen 利用大型语言模型 (LLM) 自动生成环境和规划任务,旨在解决当前 LLM 智能体训练数据有限的问题。

🗺️ 环境生成阶段通过文本片段的启发式语料库,生成详细且多样化的环境规范,包括环境概述、状态和动作空间描述以及转移函数定义。例如,一个文本片段可能会生成一个环境,其中智能体是一个营养学家,负责开发一本包含花生酱粉的新食谱。

🎯 任务生成阶段采用双向演化方法 (BI-EVOL),从简单到复杂地生成各种规划任务,以支持 LLM 的渐进式学习。这种方法通过简化目标条件来创建更简单的任务,并增加复杂性来创建更具挑战性的任务。

📈 研究人员使用 AgentBoard 平台评估了 AgentGen 的有效性,结果表明,使用 AgentGen 训练的 Llama-3 8B 模型在规划能力方面显著优于 GPT-3.5,甚至在某些任务上超过了 GPT-4。

🚀 AgentGen 的成功表明,自动生成环境和规划任务可以有效地提高 LLM 智能体的规划能力,为开发能够执行复杂规划任务的智能系统铺平道路。

Large Language Models (LLMs) have transformed artificial intelligence, particularly in developing agent-based systems. These systems require interacting with various environments and executing actions to achieve specific goals. Enhancing the planning capabilities of LLM-based agents has become a critical area of research due to the intricate nature and essential need for precise task completion in numerous applications.

One significant challenge in this research domain is the intensive manual labor required to create diverse and extensive planning environments and tasks. Current methodologies predominantly depend on manually designed scenarios, limiting the diversity and quantity of training data available. This limitation hampers the potential of LLMs to generalize and perform well across a wide range of situations. Addressing this issue, researchers have introduced automated techniques to generate a broad spectrum of environments and planning tasks, thus enriching the training datasets for LLM-based agents.

The research team from the University of Hong Kong and Microsoft Corporation has proposed a novel framework named AGENTGEN, which utilizes LLMs to automate the generation of environments and their corresponding planning tasks. This innovative approach involves two primary stages: environment generation and task generation. Initially, the framework uses an inspiration corpus comprising diverse text segments to create detailed and varied environment specifications. Following this, AGENTGEN generates related planning tasks that range from simple to complex, ensuring a smooth progression of difficulty and facilitating effective learning for the LLMs.

AGENTGEN distinguishes itself by employing a sophisticated environment generation process. The researchers designed an inspiration corpus to serve as the context for synthesizing environment specifications, which include a comprehensive overview of the environment, descriptions of the state and action spaces, and definitions of transition functions. For instance, one sample text segment might prompt the creation of an environment where the agent is a nutritionist tasked with developing a new recipe book featuring peanut butter powder. This method ensures a high level of diversity in the generated environments, creating numerous unique and challenging scenarios for agent training.

The task generation process within AGENTGEN further enhances the training data by applying a bidirectional evolution method known as BI-EVOL. This method evolves tasks in two directions: simplifying goal conditions to create easier tasks and increasing complexity to develop more challenging ones. This bidirectional approach results in a comprehensive set of planning tasks that support a gradual and effective learning curve for the LLMs—by implementing BI-EVOL, the research team generated 592 unique environments, each with 20 tasks, resulting in 7,246 high-quality trajectories for training.

The efficacy of AGENTGEN was rigorously evaluated using the AgentBoard platform. The results were impressive, demonstrating significant improvements in the planning abilities of LLM-based agents. The AGENTGEN-tuned Llama-3 8B model surpassed GPT-3.5 in overall performance and, in certain tasks, even outperformed GPT-4. Specifically, AGENTGEN achieved over five times the improvement compared to the raw Llama-3 8B on in-domain tasks, with success rates increasing from 1.67 to 11.67. Additionally, AGENTGEN showed a substantial performance enhancement in out-of-domain tasks, achieving a success rate of 29.1 on Alfworld, compared to 17.2 for GPT-3.5.

AGENTGEN demonstrated robust generalization capabilities across various models and tasks. The framework’s success was evident in its ability to improve the planning performance of multiple LLMs, including the smaller 7-8B models. For example, Llama-3 8B, after training with AGENTGEN, exhibited a success rate increase of 10.0 and a progress rate increase of 9.95. These results underscore the effectiveness of AGENTGEN in enhancing the capabilities of LLM-based agents, regardless of the specific model used.

In conclusion, AGENTGEN, by automating the generation of diverse environments and planning tasks, addresses the limitations of manual design and offers a scalable, efficient approach to improving agent performance. The framework’s ability to generate high-quality trajectory data and its demonstrated success in and out of domain tasks highlight its potential to revolutionize the training and application of LLM-based agents. AGENTGEN’s contributions to agent training methodologies are poised to enhance the development of intelligent systems capable of performing complex planning tasks with greater accuracy and efficiency.


Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter..

Don’t Forget to join our 47k+ ML SubReddit

Find Upcoming AI Webinars here


The post AgentGen: Automating Environment and Task Generation to Enhance Planning Abilities in LLM-Based Agents with 592 Environments and 7,246 Trajectories appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

AgentGen LLM 规划 环境生成 任务生成
相关文章