MarkTechPost@AI 2024年10月22日
JAMUN: A Walk-Jump Sampling Model for Generating Ensembles of Molecular Conformations
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

JAMUN是一种新型机器学习模型,旨在解决现有生成蛋白质构象集合方法的不足。它能高效采样蛋白质构象集合,速度显著高于传统方法,且对新系统有良好的转移性,对蛋白质结构预测和药物发现有重要意义。

🎯JAMUN是为克服现有生成蛋白质构象集合方法的挑战而设计的新型机器学习模型,可对蛋白质原子坐标的3D点云进行采样,能以比传统方法更高的速度采样玻尔兹曼分布的任意蛋白质。

💪JAMUN的方法基于Walk-Jump采样概念,利用Langevin动力学进行‘walk’阶段,‘jump’步骤则投射回原始数据分布,使数据分布平滑以解决采样困难,同时保留MD数据中固有的物理先验。

✨JAMUN在两个氨基酸肽的分子动力学模拟数据集上进行训练,能快速采样小肽的构象集合,比标准MD模拟快很多,且与TBG模型相比有显著的加速和相当的准确性。

The dynamics of protein structures are crucial for understanding their functions and developing targeted drug treatments, particularly for cryptic binding sites. However, existing methods for generating conformational ensembles are plagued by inefficiencies or lack of generalizability to work beyond the systems they were trained on. Molecular dynamics (MD) simulations, the current standard for exploring protein movements, are computationally expensive and limited by short time-step requirements, making it difficult to capture the broader scope of protein conformational changes that occur over longer timescales.

Researchers from Prescient Design and Genentech have introduced JAMUN (walk-Jump Accelerated Molecular ensembles with Universal Noise), a novel machine-learning model designed to overcome these challenges by enabling efficient sampling of protein conformational ensembles. JAMUN extends Walk-Jump Sampling (WJS) to 3D point clouds, which represent protein atomic coordinates. By utilizing a SE(3)-equivariant denoising network, JAMUN can sample the Boltzmann distribution of arbitrary proteins at a speed significantly higher than traditional MD methods or current ML-based approaches. JAMUN also demonstrated a significant ability to transfer to new systems, meaning it can generate reliable conformational ensembles even for protein structures that were not part of its training dataset.

The proposed methodology is rooted in the concept of Walk-Jump Sampling, where noise is added to clean data, followed by training a neural network to denoise it, thereby allowing a smooth sampling process. JAMUN utilizes Langevin dynamics for the ‘walk’ phase, which is already a standard approach in Molecular dynamics MD simulations. The ‘jump’ step then projects back to the original data distribution, decoupling the process from starting over each time as is typically done with diffusion models. By decoupling the walk and jump steps, JAMUN smooths out the data distribution just enough to resolve sampling difficulties while retaining the physical priors inherent in MD data.

JAMUN was trained on a dataset of molecular dynamics simulations of two amino acid peptides and successfully generalized to unseen peptides. Results show that JAMUN can sample conformational ensembles of small peptides significantly faster than standard MD simulations. For instance, JAMUN generated conformational states of challenging capped peptides within an hour of computation, while traditional MD approaches required much longer to cover similar distributions. JAMUN was also compared against the Transferable Boltzmann Generators (TBG) model, showcasing a remarkable speedup and comparable accuracy, although it was limited to Boltzmann emulation rather than exact sampling.

JAMUN provides a powerful new approach to generating conformational ensembles of proteins, balancing efficiency with physical accuracy. Its ability to generate ensembles much faster than MD while maintaining reliable sampling makes it a promising tool for applications in protein structure prediction and drug discovery. Future work will focus on extending JAMUN to larger proteins and refining the denoising network for even faster sampling. By leveraging Walk-Jump Sampling, JAMUN offers a significant step towards a generalizable, transferable solution for protein conformational ensemble generation, crucial for both biological understanding and pharmaceutical innovation.


Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter.. Don’t Forget to join our 50k+ ML SubReddit.

[Upcoming Live Webinar- Oct 29, 2024] The Best Platform for Serving Fine-Tuned Models: Predibase Inference Engine (Promoted)

The post JAMUN: A Walk-Jump Sampling Model for Generating Ensembles of Molecular Conformations appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

JAMUN 蛋白质构象 机器学习 药物发现
相关文章