MarkTechPost@AI 08月01日 12:26
Google AI Introduces the Test-Time Diffusion Deep Researcher (TTD-DR): A Human-Inspired Diffusion Framework for Advanced Deep Research Agents
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

谷歌近期推出的Test-Time Diffusion Deep Researcher (TTD-DR) 是一个创新的人工智能研究框架,旨在弥合当前AI研究助手与人类研究者在思维和写作过程上的差距。与现有助手不同,TTD-DR借鉴了人类研究中搜索、思考、反馈和迭代优化的循环过程。它将研究报告生成视为一个扩散过程,以一个不断更新的草稿作为研究方向的指导,并通过检索机制在每一步动态地融入外部信息进行“去噪”式优化。该框架包含研究计划生成、迭代搜索与合成、以及最终报告生成三个主要阶段,并辅以自进化算法提升各环节性能。相较于现有模型,TTD-DR在多项基准测试中展现出优越的性能,尤其在需要深度搜索和多步推理的任务上,其长篇研究报告的生成质量得到了显著提升。

💡 TTD-DR借鉴了人类研究的迭代思维模式,将研究报告生成过程比作扩散过程,以一个动态更新的草稿为核心,通过检索和“去噪”优化来逐步完善研究内容,旨在解决现有AI研究助手缺乏结构化、人性化研究流程的问题。

🚀 该框架包含三个核心阶段:研究计划生成、迭代搜索与合成、以及最终报告生成。每个阶段都由单元LLM代理、工作流和代理状态组成,并引入了自进化算法,旨在提升每个阶段的性能并有效保存高质量的上下文信息。

📊 在与OpenAI Deep Research的对比评测中,TTD-DR在长篇研究报告生成任务上取得了显著的优势,在多个研究数据集上表现出更高的胜率和更优的各项指标,特别是在需要密集搜索和多步推理的场景下,其性能得到了充分验证。

🧠 TTD-DR通过整合“扩散”与“检索”机制,有效克服了现有AI研究助手在处理复杂研究任务时的局限性,其以草稿为中心的設計使得报告撰写更及时、连贯,并能减少信息在迭代搜索过程中的损失。

Deep Research (DR) agents have rapidly gained popularity in both research and industry, thanks to recent progress in LLMs. However, most popular public DR agents are not designed with human thinking and writing processes in mind. They often lack structured steps that support human researchers, such as drafting, searching, and using feedback. Current DR agents compile test-time algorithms and various tools without cohesive frameworks, highlighting the critical need for purpose-built frameworks that can match or excel human research capabilities. The absence of human-inspired cognitive processes in current methods creates a gap between how humans do research and how AI agents handle complex research tasks.

Existing works, such as test-time scaling, utilize iterative refinement algorithms, debate mechanisms, tournaments for hypothesis ranking, and self-critique systems to generate research proposals. Multi-agent systems utilize planners, coordinators, researchers, and reporters to produce detailed responses, while some frameworks enable human co-pilot modes for feedback integration. Agent tuning approaches focus on training through multitask learning objectives, component-wise supervised fine-tuning, and reinforcement learning to improve search and browsing capabilities. LLM diffusion models attempt to break autoregressive sampling assumptions by generating complete noisy drafts and iteratively denoising tokens for high-quality outputs.

Researchers at Google introduced Test-Time Diffusion Deep Researcher (TTD-DR), inspired by the iterative nature of human research through repeated cycles of searching, thinking, and refining. It conceptualizes research report generation as a diffusion process, starting with a draft that serves as an updated outline and evolving foundation to guide research direction. The draft undergoes iterative refinement through a “denoising” process, dynamically informed by a retrieval mechanism that incorporates external information at each step. This draft-centric design makes report writing more timely and coherent while reducing information loss during iterative search processes. TTD-DR achieves state-of-the-art results on benchmarks that require intensive search and multi-hop reasoning.

The TTD-DR framework addresses limitations of existing DR agents that employ linear or parallelized processes. The proposed backbone DR agent contains three major stages: Research Plan Generation, Iterative Search and Synthesis, and Final Report Generation, each containing unit LLM agents, workflows, and agent states. The agent utilizes self-evolving algorithms to enhance the performance of each stage, helping it to find and preserve high-quality context. The proposed algorithm, inspired by recent self-evolution work, is implemented in a parallel workflow along with sequential and loop workflows. This algorithm can be applied to all three stages of agents to improve overall output quality.

In side-by-side comparisons with OpenAI Deep Research, TTD-DR achieves 69.1% and 74.5% win rates for long-form research report generation tasks, while outperforming by 4.8%, 7.7%, and 1.7% on three research datasets with short-form ground-truth answers. It shows strong performance in Helpfulness and Comprehensiveness auto-rater scores, especially on LongForm Research datasets. Moreover, the self-evolution algorithm achieves 60.9% and 59.8% win rates against OpenAI Deep Research on LongForm Research and DeepConsult. The correctness score shows an enhancement of 1.5% and 2.8% on HLE datasets, though the performance on GAIA remains 4.4% below OpenAI DR. The incorporation of Diffusion with Retrieval leads to substantial gains over OpenAI Deep Research across all benchmarks.

In conclusion, Google presents TTD-DR, a method that addresses fundamental limitations through human-inspired cognitive design. The framework’s approach conceptualizes research report generation as a diffusion process, utilizing an updatable draft skeleton that guides research direction. TTD-DR, enhanced by self-evolutionary algorithms applied to each workflow component, ensures high-quality context generation throughout the research process. Moreover, evaluations demonstrate that TTD-DR’s state-of-the-art performance across various benchmarks that require intensive search and multi-hop reasoning, with superior results in both comprehensive long-form research reports and concise multi-hop reasoning tasks.


Check out the Paper here. Feel free to check our Tutorials page on AI Agent and Agentic AI for various applications. Also, feel free to follow us on Twitter and don’t forget to join our 100k+ ML SubReddit and Subscribe to our Newsletter.

The post Google AI Introduces the Test-Time Diffusion Deep Researcher (TTD-DR): A Human-Inspired Diffusion Framework for Advanced Deep Research Agents appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

TTD-DR AI研究助手 深度研究 扩散模型 人工智能
相关文章