MarkTechPost@AI 2024年11月19日
VirtuDockDL: A Deep Learning-Powered Platform for Accelerated Drug Discovery through Advanced Compound Screening and Binding Prediction
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

VirtuDockDL是一个基于Python的深度学习平台,旨在加速药物发现过程。它利用图神经网络(GNN)预测化合物的有效性,并在HER2数据集上实现了99%的准确率,优于DeepChem和AutoDock Vina等工具。该平台集成了分子图构建、虚拟筛选和化合物聚类等功能,提供了一个自动化框架,可以高效地识别潜在药物,推动人工智能驱动的药物研发。VirtuDockDL还具有用户友好的图形界面,支持分子上传、任务启动和结果下载,方便用户使用。该平台在多个目标上都展现出了优异的预测准确性和实用性,例如HER2(癌症)、TEM-1 β-内酰胺酶(细菌感染)和CYP51(念珠菌病)的抑制剂。

🤔 **基于图神经网络的化合物筛选:**VirtuDockDL的核心是利用图神经网络(GNN)来预测化合物的生物活性,通过将分子数据转换为图表示,GNN能够学习分子内部复杂的结构关系,并预测诸如分子活性或结合亲和力等特性。该平台在HER2数据集上实现了99%的准确率,优于DeepChem和AutoDock Vina等现有工具。

🧬 **自动化虚拟筛选和聚类:**VirtuDockDL集成了虚拟筛选和聚类工具,允许用户评估大型化合物库针对特定蛋白质靶标的活性。基于预测的活性,利用高斯混合模型(GMM)对筛选出的分子进行聚类,并通过轮廓系数和戴维斯-博尔丁指数评估聚类质量。

🧪 **蛋白质结构优化和配体对接:**该平台支持使用OpenMM进行蛋白质结构优化,并使用AutoDock Vina进行配体对接,从而预测分子结合亲和力。例如,在马尔堡病毒研究中,VirtuDockDL利用VP35蛋白作为案例研究,准确地分类了化合物,并提供了包括AUC、准确率和F1分数等关键指标的虚拟筛选和对接结果,为潜在的VP35抑制剂的药物发现提供了可行的见解。

🖥️ **用户友好的图形界面:**VirtuDockDL基于Flask框架,提供了一个用户友好的图形界面,支持分子上传、任务启动和结果下载,将功能组织成选项卡,方便用户使用。该平台的自动化设计和易用性使其成为推进药物研发和应对紧迫健康挑战的有效工具。

Drug discovery is a costly, lengthy process with high failure rates, as only one viable drug typically emerges from a million screened compounds. Advanced high-throughput (HTS) and ultra-high-throughput screening (uHTS) technologies allow rapid testing of large compound libraries, enabling Pharma and Biotech companies to explore more chemical compounds and novel biological targets. Despite these technologies, challenges still need to be addressed, including limited breakthroughs in identifying new drug targets and data quality issues. ML and DL now offer promising solutions, enhancing drug discovery through data-driven insights, feature extraction, and predictive capabilities to identify effective drug candidates more efficiently.

VirtuDockDL, developed by researchers from the Institute of Molecular Biology and Biotechnology at The University of Lahore, the Integrative Omics and Molecular Modeling Laboratory at Government College University Faisalabad (GCUF), Shenzhen University and Taif University, is a Python-based platform leveraging deep learning to streamline drug discovery. Utilizing a Graph Neural Network (GNN) for predicting compound effectiveness, VirtuDockDL achieved 99% accuracy on the HER2 dataset, surpassing tools like DeepChem and AutoDock Vina. This platform’s automated framework integrates molecular graph construction, virtual screening, and compound clustering, enabling efficient identification of potential drugs and advancing AI-driven pharmaceutical research.

VirtuDockDL is a comprehensive pipeline designed to streamline the prediction and screening of biologically active compounds using a GNN. Initially encoded as SMILES strings, molecular data is transformed into graph representations through RDKit and processed by PyTorch Geometric’s GNN architecture. This transformation allows the GNN to learn complex structural relationships within molecules and predict properties like molecular activity or binding affinity. The architecture incorporates several layers of graph convolution to capture molecular features at different hierarchical levels, along with batch normalization, dropout, and residual connections, which stabilize training and enhance predictive accuracy. This process merges graph-based representations with cheminformatics descriptors and fingerprints, providing a robust feature set for accurate activity prediction.

The application also features virtual screening and clustering tools, enabling users to evaluate large compound libraries against specific protein targets. Based on their predicted activity, the clustering of screened molecules is accomplished using Gaussian Mixture Models (GMM), with clustering quality assessed via Silhouette and Davies-Bouldin scores. The pipeline supports protein structure refinement through OpenMM and ligand docking with AutoDock Vina, allowing molecular binding affinity predictions. VirtuDockDL was applied to Marburg virus research, using the VP35 protein as a case study. Positive and decoy datasets were generated, and the GNN model accurately classified compounds with cross-entropy loss and RMSprop optimization. Virtual screening and docking results, including key metrics like AUC, accuracy, and F1-score, are automatically visualized, providing actionable insights into potential VP35 inhibitors for drug discovery.

VirtuDockDL’s user-friendly GUI, based on the Flask framework, supports molecule uploads, task initiation, and result downloads, organizing features into tabs for ease of use. A GNN model was trained using active/inactive VP35 protein molecules, achieving high accuracy (97.79%) with strong metrics (AUC 0.9972). Non-covalent inhibitors from ZINC and PubChem databases were re-screened, identifying 146 potential candidates. Further tests on HER2, beta-lactamase, and CYP51 datasets demonstrated VirtuDockDL’s superior performance in binding affinity predictions compared to PyRMD, RosettaVS, MzDOCK, AutoDock Vina, and Glide. VirtuDockDL’s integration of ligand- and structure-based screening provides efficient and accurate virtual screening.

In conclusion, VirtuDockDL is a new Python-based web platform designed to streamline drug discovery using deep learning. By employing a Graph Neural Network for compound screening, it has shown outstanding predictive accuracy and practical utility across multiple targets, including inhibitors for HER2 (cancer), TEM-1 beta-lactamase (bacterial infections), and CYP51 (Candidiasis). It achieved superior results in benchmarking, surpassing tools like DeepChem and AutoDock Vina with a 99% accuracy and an F1 score of 0.992 on the HER2 dataset. This platform combines full automation and user-friendly design, making it an efficient, cost-effective tool for advancing pharmaceutical research and addressing urgent health challenges.


Check out the Paper. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter.. Don’t Forget to join our 55k+ ML SubReddit.

[FREE AI WEBINAR] Implementing Intelligent Document Processing with GenAI in Financial Services and Real Estate TransactionsFrom Framework to Production

The post VirtuDockDL: A Deep Learning-Powered Platform for Accelerated Drug Discovery through Advanced Compound Screening and Binding Prediction appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

药物发现 深度学习 图神经网络 虚拟筛选 人工智能
相关文章