少点错误 7小时前
Multispecies Metagenomic Calibration
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文探讨了如何利用宏基因组测序数据,对污水样本中的病原体进行定量分析。由于宏基因组数据具有相对性,需要进行校准才能进行定量观察。文章介绍了校准过程,包括考虑样本中人类贡献、降雨量、饮食、温度和样本浓度等因素的影响。此外,文章还讨论了使用PMMoV等指标进行校准的方法,以及利用深度宏基因组测序数据,结合多种物种进行更精确校准的可能性,以减少噪声,提高分析的准确性。

🦠 宏基因组测序数据本质上是相对的,每个观察结果是样本中所有观察结果的一部分,需要进行校准才能进行定量分析。

💧 影响污水样本的因素众多,包括人类贡献、降雨量、饮食、温度和样本浓度等,这些因素会导致样本间的差异。

🌶️ 可以通过标准化人类排泄的其他物种的数量来进行校准,例如使用PMMoV。PMMoV存在于辣椒中,人们食用辣椒后会排泄PMMoV,因此PMMoV的含量与人类贡献成正比。

🔬 深度宏基因组测序能够同时检测到成千上万的物种,通过结合PMMoV、crAssphage、其他人肠道噬菌体、人类肠道细菌等多种指标,可以更精确地校准样本,减少噪声,提高分析的准确性。

Published on June 25, 2025 2:50 AM GMT

Cross-posted from my NAONotebook.

This is something I wrote internally in late-2022. Sharing it nowwith light edits, additional context, and updated links after the ideacame up at the Microbiologyof the Built Environment conference I'm attending this week.

Metagenomic sequencing data is fundamentally relative: eachobservation is a fraction of all the observations in a sample. If youwant to make quantitative observations, however, like understandingwhether there's been an increase in the number of people with someinfection, you need to calibrate these observations. For example,there could be variation between samples due to variation in:

If you're trying to understand growth patterns all of this is noise;can we reverse this variation? I'm using "calibration" to refer tothis process of going from raw per-sample pathogen read counts toestimates of how much of each pathogen was originally shed into sewage.

The simplest option is not to do any calibration, and just considerraw relative abundance: counts relative to the total number of readsin the sample. For example, this is what MarcJohnson and Dave O'Connor are doing.

It seems like you ought to be able to do better if you normalize bythe number of reads matching some other species humans excrete. It'scommon to use PMMoVfor this: peppers are commonly infected with PMMoV, people eatpeppers, people excrete PMMoV. All else being equal, the amount ofPMMoV in a sample should be proportional to the human contribution tothe sample. This is especially common in PCR work, where you take aPCR measurement of your target, and then present it relative to a PCRmeasurement of PMMoV. For example, this is what WastewaterSCAN does.

Because the NAO is doing verydeep metagenomic sequencing, around 1B read pairs (300Gbp) persample, we ought to be able to calibrate against many species at once.PMMoV is commonly excreted, but so are other tobamoviruses,crAssphage, other human gut bacteriophages, human gut bacteria, etc.We pick up thousands of other species, and should be able to combinethose measurements to get a much less noisy measurement of the humancontribution to a sample.

This isn't something the NAO has been able to look into yet, but Istill think it's quite promising.

Comment via: substack



Discuss

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

宏基因组测序 污水监测 病原体 PMMoV 校准
相关文章