MarkTechPost@AI 03月01日
Google AI Introduces PlanGEN: A Multi-Agent AI Framework Designed to Enhance Planning and Reasoning in LLMs through Constraint-Guided Iterative Verification and Adaptive Algorithm Selection
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Google AI推出PlanGEN,这是一个多代理框架,旨在改进大语言模型的规划和推理,通过约束引导的迭代验证和自适应算法选择来解决复杂任务中的问题。

PlanGEN是多代理框架,包含三个协同工作的代理:约束代理、验证代理和选择代理。

该框架强调模块化和细化,约束代理提取问题参数,验证代理评估计划质量并给予反馈。

选择代理采用改进的UCB政策,动态选择推理算法,PlanGEN在多个基准测试中表现出色。

Large language models have made remarkable strides in natural language processing, yet they still encounter difficulties when addressing complex planning and reasoning tasks. Traditional methods often rely on static templates or single-agent systems that fall short in capturing the subtleties of real-world problems. This shortfall is evident when models must verify generated plans, adapt to varying levels of complexity, or refine outputs iteratively. Whether it is scheduling meetings or solving scientific problems, the limitations of conventional approaches prompt the need for more nuanced and adaptable strategies.

Google AI introduces PlanGEN—a multi-agent framework designed to improve planning and reasoning in large language models by incorporating constraint-guided iterative verification and adaptive algorithm selection. PlanGEN comprises three agents that work in concert: the constraint agent extracts problem-specific details, the verification agent evaluates the quality of the proposed plan, and the selection agent chooses the most appropriate inference algorithm based on the problem’s complexity. Rather than relying on a single, rigid approach, this framework facilitates a process in which initial plans are refined iteratively, ensuring that the final output is both accurate and contextually appropriate.

Technical Underpinnings and Advantages

At the core of PlanGEN is its emphasis on modularity and refinement. The process begins with the constraint agent, which carefully extracts essential parameters from the problem description—such as individual schedules in calendar planning or key concepts in scientific reasoning tasks. This extracted information forms a set of criteria against which potential plans are measured. The verification agent then steps in, assessing each candidate plan against these constraints and assigning a reward score on a scale that ranges from –100 to 100. This feedback, expressed in natural language, not only quantifies plan quality but also highlights areas for improvement.

The selection agent adds another layer of sophistication by employing a modified Upper Confidence Bound (UCB) policy. This adaptive mechanism weighs factors like historical performance, the need to explore less-tested methods, and recovery from previous errors. By dynamically selecting among different inference algorithms—such as Best of N, Tree-of-Thought (ToT), or REBASE—PlanGEN is able to tailor its approach to the complexity of each specific task. The framework’s design allows it to transition smoothly between different strategies, balancing exploration and exploitation without overcommitting to any one method.

Empirical Insights and Experimental Results

PlanGEN has been evaluated across several benchmarks, demonstrating consistent improvements in planning and reasoning tasks. In the NATURAL PLAN benchmark, which covers tasks such as calendar scheduling, meeting planning, and trip planning, PlanGEN has shown notable improvements in exact match scores. For example, one variant of the framework achieved better performance in calendar scheduling by effectively refining the planning steps through iterative verification.

Similarly, in mathematical and scientific reasoning benchmarks like OlympiadBench, the framework’s adaptive approach has led to higher accuracy in both mathematics and physics categories. On the DocFinQA dataset, which focuses on financial document understanding, PlanGEN has been able to enhance both accuracy and F1 scores. These improvements are attributed to the framework’s ability to harness detailed feedback and adjust its inference strategy accordingly. By integrating both verification and selection mechanisms, PlanGEN demonstrates a balanced and methodical approach to problem solving that adapts to the demands of each task.

Conclusion

PlanGEN represents a thoughtful advance in addressing the challenges inherent in complex planning and reasoning for large language models. By combining the strengths of multiple specialized agents, the framework supports a more deliberate and iterative approach to generating high-quality plans. Its modular design—centered on the extraction of constraints, iterative verification, and adaptive selection of inference algorithms—ensures that each solution is carefully refined to meet the specific demands of the task at hand.

The results from various benchmarks illustrate that a collaborative, multi-agent system can indeed outperform more conventional single-agent methods, without relying on overly aggressive claims. Instead, the improvements observed are the result of measured, incremental advancements achieved by systematically incorporating feedback and adapting to instance-level complexity. As the field continues to develop, PlanGEN’s balanced methodology offers a promising foundation for future work in enhancing the natural language planning capabilities of large language models. This approach, grounded in careful analysis and iterative improvement, provides a practical pathway toward more robust and reliable AI systems for complex reasoning tasks.


Check out the Paper. All credit for this research goes to the researchers of this project. Also, feel free to follow us on Twitter and don’t forget to join our 80k+ ML SubReddit.

Recommended Read- LG AI Research Releases NEXUS: An Advanced System Integrating Agent AI System and Data Compliance Standards to Address Legal Concerns in AI Datasets

The post Google AI Introduces PlanGEN: A Multi-Agent AI Framework Designed to Enhance Planning and Reasoning in LLMs through Constraint-Guided Iterative Verification and Adaptive Algorithm Selection appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Google AI PlanGEN 规划推理 多代理框架
相关文章