MarkTechPost@AI 2024年11月08日
AI2BMD: A Quantum-Accurate Machine Learning Approach for Large-Scale Biomolecular Dynamics
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

AI2BMD是一个基于人工智能的系统,用于模拟大型生物分子,并达到从头算的精度。它结合了蛋白质片段化技术和机器学习力场,能够高效准确地计算包含超过10,000个原子的蛋白质的能量和力。AI2BMD在模拟速度上远超传统的DFT方法,能够进行数百纳秒的模拟,捕捉蛋白质折叠、展开和构象动力学等过程。此外,该系统在热力学预测方面与实验数据高度吻合,为补充实验研究和推动生物医学研究提供了宝贵的工具。

🤔**蛋白质片段化技术:**AI2BMD将蛋白质分解成重叠的二肽片段,每个片段包含主链原子和相邻氨基酸的部分原子,这种方法确保了对各种蛋白质的全面覆盖,并通过优化原子位置来准确计算蛋白质能量和原子力。

🧬**机器学习力场:**AI2BMD利用名为ViSNet的深度学习模型,基于原子序数和坐标预测能量和原子力。该模型通过大量二肽构象样本进行训练,以捕捉蛋白质结构的多样性,并采用超参数优化和提前停止技术来提高准确性。

📈**热力学预测的准确性:**AI2BMD在蛋白质热力学分析中表现出色,能够准确预测蛋白质的熔点(Tm)和自由能(ΔG)等参数,其预测结果与实验数据高度一致,优于传统的分子力学(MM)方法。

🧪**应用于生物化学研究:**AI2BMD可用于进行化学自由能计算,例如pKa预测,并且由于其从头算方法能够对整个蛋白质进行建模,因此在复杂蛋白质和动态状态的模拟中具有优势,为药物发现、蛋白质设计和酶工程等领域提供了强大的工具。

🚀**高效模拟:**AI2BMD的模拟速度虽然仍低于经典MD,但其效率远超DFT方法,并且可以通过未来的优化和应用于其他生物分子系统来进一步提高,使其成为生物分子模拟领域的一项有前景的技术。

Biomolecular dynamics simulations are crucial for life sciences, offering insights into molecular interactions. While classical molecular dynamics (MD) simulations are efficient, they lack chemical precision. Methods like density functional theory (DFT) achieve high accuracy but are too computationally intense for large biomolecules. MD simulations allow observation of molecular behavior, with classical MD using interatomic potentials and ab initio MD (AIMD) deriving forces from electronic structures. AIMD’s scalability issues limit its use in biomolecular studies. Machine learning force fields (MLFFs), trained on DFT-level data, promise accuracy at lower costs, though generalization across varied molecular conformations remains challenging.

Researchers from Microsoft Research in Beijing introduced AI2BMD, an AI-based system for simulating large biomolecules with ab initio accuracy. AI2BMD utilizes a protein fragmentation technique and a machine learning force field, allowing it to accurately compute energy and forces for proteins with over 10,000 atoms. This system is vastly more efficient than traditional DFT, reducing simulation times by orders of magnitude. AI2BMD can conduct hundreds of nanoseconds of simulations, capturing protein folding, unfolding, and conformational dynamics. Its thermodynamic predictions align closely with experimental data, making it a valuable tool for complementing wet lab experiments and advancing biomedical research.

The protein fragmentation approach builds on the foundational structure of amino acids in proteins, where each amino acid contains a main chain of atoms (Cα, C, O, N, and H) and a distinct side chain. To create a model that applies broadly to various proteins, each amino acid is treated as a dipeptide, capped with Ace and Nme groups at its ends. This approach, based on overlapping fragments of dipeptides, helps ensure comprehensive protein coverage. Using a sliding window, protein chains are divided into these dipeptides, where each fragment includes main chain atoms and partial atoms from adjacent amino acids. This method accurately calculates protein energies and atomic forces by adding hydrogens as required for Cα bonds and optimizing positions using a quasi-Newton algorithm. This generalizable method allows the systematic application to all proteins, reducing complexities while maximizing model accuracy.

The training dataset for the AI2BMD potential involves sampling millions of dipeptide conformations to capture the variety in protein structures. A deep learning model called ViSNet was trained using this extensive dataset to predict the energy and atomic forces based on atomic numbers and coordinates. The model used specific hyperparameters to optimize accuracy and was trained with early-stopping techniques. Simulations based on the AI2BMD potential are processed using a cloud-compatible AI-driven simulation program, enabling flexible deployment across computing environments. This system supports parallelized simulation processes and automatically preserves progress on cloud storage, ensuring robust and efficient handling of protein dynamics modeling.

AI2BMD showcases significant potential in protein property estimation, especially for thermodynamic analysis of fast-folding proteins. AI2BMD could categorize structures into folded and unfolded states by simulating various protein types and accurately predicting potential energy values. Its melting temperature (Tm) estimations for proteins like the WW domain and NTL9 closely matched experimental data, frequently outperforming traditional molecular mechanics (MM) methods. Additionally, AI2BMD’s calculations for free energy (ΔG), enthalpy, and heat capacity were highly consistent with experimental findings, reinforcing its accuracy. This robustness in thermodynamic estimation highlights AI2BMD’s value as an advanced tool for protein analysis.

In addition to thermodynamics, AI2BMD proved effective in alchemical free-energy calculations, such as pKa prediction, and is valuable in biochemical research. Unlike traditional QM-MM methods that restrict calculations to preset regions, AI2BMD’s ab initio approach allows full-protein modeling without boundary inconsistencies, making it versatile for complex proteins and dynamic states. Although AI2BMD’s speed is still slower than classical MD, future optimizations and applications to other biomolecular systems could enhance its efficiency. AI2BMD’s adaptability makes it a promising tool for drug discovery, protein design, and enzyme engineering, offering highly accurate simulations for various biomolecular applications.


Check out the Paper and Details. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter.. Don’t Forget to join our 55k+ ML SubReddit.

[Sponsorship Opportunity with us] Promote Your Research/Product/Webinar with 1Million+ Monthly Readers and 500k+ Community Members

The post AI2BMD: A Quantum-Accurate Machine Learning Approach for Large-Scale Biomolecular Dynamics appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

AI2BMD 生物分子动力学 机器学习 蛋白质模拟 从头算
相关文章