Unite.AI 02月05日
AI Just Simulated 500 Million Years of Evolution – And Created a New Protein!
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

人工智能领域迎来重大突破!ESM3模型通过模拟5亿年的蛋白质进化,成功创造出一种全新的荧光蛋白esmGFP。这项技术不仅加速了蛋白质工程的进程,还深化了我们对生物进化本质的理解。ESM3通过分析蛋白质的序列、结构和功能,设计出自然界中前所未见的蛋白质,为药物发现、生物工程等领域开辟了新的可能性。该研究强调了负责任地发展人工智能的重要性,以及对AI设计蛋白质进行全面测试的必要性,以确保其安全性和可靠性。

🧬ESM3是一种多模态语言模型,通过分析蛋白质的序列、结构和功能来理解和生成蛋白质。与只能预测现有蛋白质结构的AlphaFold不同,ESM3本质上是一种蛋白质工程模型,允许研究人员指定功能和结构要求来设计全新的蛋白质。

🧪ESM3通过链式思考推理方法优化序列,创造出一种新型绿色荧光蛋白(esmGFP)。与天然蛋白质相比,esmGFP具有显著差异,其序列与最接近的天然同源物仅有58%的同一性,相当于超过5亿年的进化差异。

🔬AI驱动的蛋白质设计过程包括:提示AI提供序列和结构线索以引导ESM3朝着与荧光相关的特征发展;生成新型蛋白质,ESM3探索了大量的潜在序列以产生数千种候选蛋白质;过滤和改进,对最有希望的设计进行过滤和合成以进行实验室测试;在活细胞中进行验证,在细菌中表达选定的AI设计的蛋白质以确认其荧光和功能。

💡ESM3的突破为药物发现、生物工程等领域带来了新的机遇。AI设计的蛋白质可以加速药物发现过程,并为医疗保健、环境保护甚至新材料的开发创造定制蛋白质。此外,ESM3将AI定位为进化模拟器,能够探索自然界之外的进化可能性。

Evolution has been fine-tuning life at the molecular level for billions of years. Proteins, the fundamental building blocks of life, have evolved through this process to perform various biological functions, from fighting infections to digesting food. These complex molecules comprise long chains of amino acids arranged in precise sequences that dictate their structure and function. While nature has produced an extraordinary diversity of proteins, understanding their structure and designing entirely new proteins has long been a complex challenge for scientists.

Recent advancements in artificial intelligence are transforming our ability to tackle some of biology’s most significant challenges. Previously, AI was used to predict how a given protein sequence would fold and behave – a complex challenge due to the vast number of configurations. Recently, AI has advanced to generate entirely new proteins at an unprecedented scale. This milestone has been achieved with ESM3, a multimodal generative language model designed by EvolutionaryScale. Unlike conventional AI systems designed for text processing, ESM3 has been trained to understand protein sequences, structures, and functions. What makes it truly remarkable is its ability to simulate 500 million years of evolution—a feat that has led to the creation of a completely new fluorescent protein, something never before seen in nature.

This breakthrough is a significant step toward making biology more programmable, opening new possibilities for designing custom proteins with applications in medicine, materials science, and beyond. In this article, we explore how ESM3 works, what it has achieved, and why this advancement is reshaping our understanding of biology and evolution.

Meet ESM3: The AI That Simulates Evolution

ESM3 is a multimodal language model trained to understand and generate proteins by analyzing their sequences, structures, and functions. Unlike AlphaFold, which can predict the structure of existing proteins, ESM3 is essentially a protein engineering model, allowing researchers to specify functional and structural requirements to design entirely new proteins.

The model holds deep knowledge of protein sequences, structures, and functions along with the ability to generate proteins through an interaction with users. This capability empowers the model to generate proteins that may not exist in nature yet remain biologically viable. Creating a novel green fluorescent protein (esmGFP) is a striking demonstration of this capability. Fluorescent proteins, initially discovered in jellyfish and corals, are widely used in medical research and biotechnology. To develop esmGFP, researchers provided ESM3 with key structural and functional characteristics of known fluorescent proteins. The model then iteratively refined the design, applying a chain-of-thought reasoning approach to optimize the sequence. While natural evolution could take millions of years to produce similar protein, ESM3 accelerates this process to achieve it in days or weeks.

The AI-Driven Protein Design Process

Here is how researchers have used ESM3 to develop esmGFP:

    Prompting the AI – Initially, they input sequence and structural cues to guide ESM3 toward fluorescence-related features.Generating Novel Proteins – ESM3 explored a vast space of potential sequences to produce thousands of candidate proteins.Filtering and Refinement – The most promising designs were filtered and synthesized for laboratory testing.Validation in Living Cells – Selected AI-designed proteins were expressed in bacteria to confirm their fluorescence and functionality.

This process has resulted to a fluorescent protein (esmGFP) unlike anything in nature.

How esmGFP Compares to Natural Proteins

What makes esmGFP extraordinary is how distant it is from known fluorescent proteins. While most newly discovered GFPs have slight variations from existing ones, esmGFP has a sequence identity of only 58% to its closest natural relative. Evolutionarily, such a difference corresponds to a diverging time of over 500 million years.

To put this into perspective, the last time proteins with similar evolutionary distances emerged, dinosaurs had not yet appeared, and multicellular life was still in its early stages. This means AI has not just accelerated evolution – it has simulated an entirely new evolutionary pathway, producing proteins that nature might never have created.

Why This Discovery Matters

This development is a significant step forward in protein engineering and deepens our understanding of evolution. By simulating millions of years of evolution in just days, AI is opening doors to exciting new possibilities:

Ethical Considerations and Responsible AI Development

While the potential benefits of AI-driven protein engineering are immense, this technology also raises ethical and safety questions. What happens when AI starts designing proteins beyond human understanding? How do we ensure these proteins are safe for medical or environmental use?

We need to focus on responsible AI development and thorough testing to tackle these concerns. AI-generated proteins, like esmGFP, should undergo extensive laboratory testing before being considered for real-world applications. Additionally, ethical frameworks for AI-driven biology are being developed to ensure transparency, safety, and public trust.

The Bottom Line

The launch of ESM3 is a vital development in the field of biotechnology. ESM3 demonstrates that evolution shouldn’t be a slow, trial-and-error process. Compressing 500 million years of protein evolution into just days opens a future where scientists can design brand-new proteins with incredible speed and accuracy. The development of ESM3 means that we can not just use AI to understand biology but also to reshape it.  This breakthrough helps us to advance our ability to program biology the way we program software, unlocking possibilities we’re only beginning to imagine.

The post AI Just Simulated 500 Million Years of Evolution – And Created a New Protein! appeared first on Unite.AI.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

ESM3 蛋白质工程 人工智能 进化模拟 esmGFP
相关文章