MarkTechPost@AI 2024年08月03日
SPRITE (Spatial Propagation and Reinforcement of Imputed Transcript Expression): Enhancing Spatial Gene Expression Predictions and Downstream Analyses Through Meta-Algorithmic Integration
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

SPRITE是一种元算法,旨在通过在基因相关网络和空间邻域图上传播信息来增强空间基因表达预测。SPRITE通过两个步骤来改进现有方法的预测,从而提高空间基因表达预测的准确性,并改进细胞聚类、可视化和分类等下游分析的性能。

🧬 **SPRITE 算法:**SPRITE 通过两个关键步骤来增强空间基因表达预测,即“Reinforce”步骤和“Smooth”步骤。 “Reinforce”步骤通过基因相关网络传播预测误差,以使用迭代平滑过程校正目标基因预测。该网络基于预测基因表达之间的斯皮尔曼等级相关性构建。 “Smooth”步骤通过基于细胞质心之间的欧氏距离并根据细胞类型相似性进行调整的空间邻域图传播预测来进一步优化预测。

📊 **性能评估:**研究人员使用包含来自四种物种(人类、小鼠、果蝇和蝾螈)的四个空间转录组学和 RNA-seq 数据的 11 个基准数据集对 SPRITE 算法进行了测试。SPRITE 与三种空间基因表达预测方法(SpaGE、Tangram 和 Harmony-kNN)一起评估,这些方法都利用不同的方法来对齐和预测基因表达。这些预测的准确性(在应用 SPRITE 之前和之后)使用 PCC 和平均绝对误差 (MAE) 来衡量。

📈 **下游分析的改进:**SPRITE 预测通过这些步骤增强,并对其对下游分析(如细胞聚类、可视化和分类)的影响进行了评估,证明了预测准确性和生物推断质量的提高。

💡 **SPRITE 的优势:**SPRITE 是一种用途广泛的元算法,旨在增强空间基因表达预测,通过结合“Reinforce”和“Smooth”步骤来提高各种预测方法的准确性。它改进了基因表达预测,并增强了下游分析,如细胞聚类、可视化和分类。令人惊讶的是,SPRITE 有时甚至胜过地面真实数据,这表明它可能消除基因表达的噪声。SPRITE 可扩展,其复杂性可以通过使用的交叉验证折叠数量进行调整。

🚀 **未来研究:**未来的研究可以探索将空间和基因相关信息直接整合到预测方法中,以及将 SPRITE 扩展到其他数据类型,如空间蛋白质组学。

Spatially resolved single-cell transcriptomics offers insights into gene expression within tissues, but current technologies are limited by their ability to measure only a small number of genes. To address this, algorithms have been developed to predict or impute the expression of additional genes. These methods often use paired single-cell RNA sequencing data, embedding spatial and RNA-seq data together to make predictions. However, most approaches still need to fully utilize the relational information between genes (like co-expression) or cells (like spatial proximity). Incorporating this relational data could improve the accuracy of gene expression predictions and enhance subsequent biological analyses.

Stanford and Harvard University researchers have developed SPRITE (Spatial Propagation and Reinforcement of Imputed Transcript Expression), a meta-algorithm designed to enhance spatial gene expression predictions. SPRITE refines predictions from existing methods by propagating information across gene correlation networks and spatial neighborhood graphs. This two-step process improves the accuracy of spatial gene expression predictions, leading to better performance in downstream analyses like cell clustering, visualization, and classification. SPRITE can be integrated into spatial transcriptomics data analysis to enhance the quality of inferences based on predicted gene expression.

The SPRITE algorithm was tested using eleven benchmark datasets that combined spatial transcriptomics with RNA-seq data from four species: human, mouse, fruit fly, and axolotl. These datasets, which utilized various technologies and tissue types, were chosen to maintain consistency within species and tissue categories. Before being used, the RNA-seq data underwent normalization and log transformation. SPRITE was evaluated with three spatial gene expression prediction methods—SpaGE, Tangram, and Harmony-kNN—each utilizing distinct approaches for aligning and predicting gene expression. The accuracy of these predictions, both before and following the application of SPRITE, was measured using the PCC and mean absolute error (MAE).

SPRITE operates in two key steps: the “Reinforce” step and the “Smooth” step. The Reinforce step propagates prediction errors across a gene correlation network to correct target gene predictions using an iterative smoothing process. This network is constructed based on Spearman rank correlations between predicted gene expressions. The Smooth step further refines the predictions by propagating them across a spatial neighborhood graph based on the Euclidean distances between cell centroids and adjusted for cell-type similarity. The SPRITE predictions, enhanced through these steps, were evaluated for their impact on downstream analyses such as cell clustering, visualization, and classification, demonstrating improvements in prediction accuracy and the quality of biological inferences.

SPRITE is a meta-algorithm that enhances spatial gene expression predictions by correcting errors through a gene correlation network (“Reinforce”) and smoothing predictions across a spatial neighborhood graph (“Smooth”). Applied to predictions from methods like SpaGE, Tangram, and Harmony-kNN across various datasets, SPRITE generally improved prediction accuracy, reducing mean absolute error and often increasing correlation with ground truth data. Both components of SPRITE are essential, as their combination yields better results. Moreover, SPRITE enhances downstream tasks such as cell clustering, data visualization, and cell type classification, often outperforming models trained on the original measured data.

SPRITE is a versatile meta-algorithm designed to enhance spatial gene expression predictions, improving the accuracy of various prediction methods by combining “Reinforce” and “Smooth” steps. It improves gene expression predictions and enhances downstream analyses like cell clustering, visualization, and classification. Surprisingly, SPRITE sometimes outperforms even ground truth data, suggesting it may de-noise gene expression. SPRITE is scalable, with its complexity adjustable by the number of cross-validation folds used. Future research could explore integrating spatial and gene correlation information directly into prediction methods and extending SPRITE to other data types like spatial proteomics.


Check out the Paper and GitHub. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter..

Don’t Forget to join our 47k+ ML SubReddit

Find Upcoming AI Webinars here


The post SPRITE (Spatial Propagation and Reinforcement of Imputed Transcript Expression): Enhancing Spatial Gene Expression Predictions and Downstream Analyses Through Meta-Algorithmic Integration appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

SPRITE 空间基因表达 元算法 生物信息学 基因相关网络 空间邻域图
相关文章