MarkTechPost@AI 05月31日 02:25
Stanford Researchers Introduced Biomni: A Biomedical AI Agent for Automation Across Diverse Tasks and Data Types
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

斯坦福大学的研究人员推出了Biomni,一款通用的生物医学AI智能体,旨在自动化各种生物医学任务和数据类型。Biomni结合了基础的生物医学环境和先进的任务执行架构,能够动态选择工具、制定计划并通过生成和运行代码来执行任务。它在多个基准测试中表现出色,甚至超越了人类专家,展现了在加速科学发现、处理复杂数据集和生成可读报告方面的巨大潜力。Biomni的出现,有望显著减轻研究人员的负担,推动生物医学领域的创新。

💡 Biomni由斯坦福大学等机构的研究人员开发,是一个通用的生物医学AI智能体,旨在自动化生物医学研究的各个环节。

⚙️ Biomni的核心在于其结合了基础生物医学环境和先进的任务执行架构,能够动态选择工具、制定计划并执行任务。

🔬 Biomni在多个基准测试中表现出色,例如在LAB-Bench基准测试中,其DbQA准确率达到74.4%,SeqQA准确率达到81.9%,均超越了人类专家。

📊 Biomni能够处理复杂的多组学数据集,例如处理超过336,000个单核RNA-seq和ATAC-seq分析,并生成可解释的输出,如基因调控网络和基序富集分析。

🚀 Biomni具备处理大型数据集、构建复杂流程和生成人类可读报告的能力,有望加速生物医学发现,减少研究人员的工作量,并带来新的见解。

Biomedical research is a rapidly evolving field that seeks to advance human health by uncovering the mechanisms behind diseases, identifying new therapeutic targets, and developing effective treatments. This field encompasses diverse areas, including genetics, molecular biology, pharmacology, and clinical studies, which require specialized tools and in-depth expertise. The increasing complexity of biomedical data, experiments, and literature has created both opportunities and challenges. Researchers must integrate findings from genomics, proteomics, and other data sources to generate hypotheses, design experiments, and interpret results. The ability to efficiently manage this complexity is crucial for accelerating scientific discovery and translating findings into clinical applications.

The core challenges in biomedical research are the sheer volume of data, methods, and tools that must be managed to produce meaningful results. Researchers often face fragmented workflows, relying on numerous specialized tools that don’t integrate well with each other. This creates bottlenecks when attempting to design experiments, process large datasets, or interpret multimodal biomedical information. The problem is further compounded by the fact that expert human researchers are limited in availability, making it difficult to keep pace with the growing body of scientific knowledge. As a result, significant portions of biomedical data remain underutilized, and connections between findings across different subfields are often missed. Addressing these concerns requires a new approach that can scale expertise, handle data complexity, and support integrated workflows across various biomedical domains.

Existing tools for biomedical research often focus on narrow tasks such as specific gene analysis, protein structure prediction, or drug-target interaction studies. These tools require careful setup, domain-specific knowledge, and manual integration into broader workflows. While large language models (LLMs) have shown promise in tasks like biomedical question answering, they cannot typically interact with specialized tools or databases directly. Past efforts to create AI agents for biomedical tasks have relied on predefined workflows or templates, limiting their flexibility. Consequently, researchers have struggled to find AI systems that can adapt to diverse biomedical tasks, dynamically compose new workflows, or execute complex analyses end-to-end.

Researchers from Stanford University, Genentech, the Arc Institute, the University of Washington, Princeton University, and the University of California, San Francisco, introduced Biomni, a general-purpose biomedical AI agent. Biomni combines a foundational biomedical environment, Biomni-E1, with an advanced task-executing architecture, Biomni-A1. Biomni-E1 was constructed by mining tens of thousands of biomedical publications across 25 subfields, extracting 150 specialized tools, 105 software packages, and 59 databases, forming a unified biomedical action space. Biomni-A1 dynamically selects tools, formulates plans, and executes tasks by generating and running code, enabling the system to adapt to diverse biomedical problems. This integration of reasoning, code-based execution, and resource selection allows Biomni to perform a wide range of tasks autonomously, including bioinformatics analyses, hypothesis generation, and protocol design. Unlike static function-calling models, Biomni’s architecture allows it to flexibly interleave code execution, data querying, and tool invocation, creating a seamless pipeline for complex biomedical workflows.

Biomni-A1 uses an LLM-based tool selection mechanism to identify relevant resources based on user goals. It applies code as a universal interface to compose complex workflows with procedural logic, including loops, parallelization, and conditional steps. An adaptive planning strategy enables Biomni to iteratively refine plans as it executes tasks, ensuring context-aware and responsive behavior. Biomni’s performance has been rigorously evaluated through multiple benchmarks. On the LAB-Bench benchmark, Biomni achieved 74.4% accuracy in DbQA and 81.9% in SeqQA, outperforming human experts (74.7% and 78.8%, respectively). On the HLE benchmark covering 14 subfields, Biomni scored 17.3%, outperforming base LLMs by 402.3%, coding agents by 43.0%, and its own ablated variant by 20.4%. Real-world case studies demonstrated Biomni’s ability to autonomously generate 10-step pipelines analyzing 458 wearable sensor files autonomously, identifying a postprandial temperature increase of 2.19°C across individuals. It also analyzed 227 nights of sleep data, uncovering insights such as mid-week peaks in sleep efficiency and the importance of circadian regularity over total sleep duration.

Biomni’s ability to handle real-world research questions extends to complex multi-omics analyses, where it processed over 336,000 single-nucleus RNA-seq and ATAC-seq profiles from human embryonic skeletal data. Biomni constructed a 10-stage analysis pipeline to predict transcription factor-target gene links, filter results using chromatin accessibility data, and summarize findings in a structured report. The agent handled all aspects of the analysis, including code generation, error debugging, and results interpretation, producing outputs such as trajectory plots, heatmaps, and PCA biplots. These capabilities demonstrate Biomni’s capacity to manage large-scale, multi-modal datasets, identify biological patterns, and accelerate the path from raw data to testable hypotheses. By executing between 6 and 24 distinct steps per task, integrating up to 4 specialized tools, eight software packages, and three unique data lake items, Biomni mirrors the workflows of human scientists while drastically reducing manual effort.

Several Key Takeaways from the Research on Biomni include:

In conclusion, Biomni represents a major step forward in biomedical AI, combining reasoning, code execution, and dynamic resource integration into a single system. The researchers have demonstrated that it can generalize across tasks, execute complex workflows without manual templates, and produce results that rival or exceed human expertise in several areas. The system’s ability to handle large datasets, compose complex pipelines, and generate human-readable reports suggests it has the potential to significantly accelerate biomedical discovery, reduce the burden on researchers, and enable new insights.


Check out the Paper, Code and Try it here. All credit for this research goes to the researchers of this project. Also, feel free to follow us on Twitter and don’t forget to join our 95k+ ML SubReddit and Subscribe to our Newsletter.

The post Stanford Researchers Introduced Biomni: A Biomedical AI Agent for Automation Across Diverse Tasks and Data Types appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Biomni 生物医学 人工智能 AI研究
相关文章