• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

测序读长差异导致微生物群落间的人工功能差异。

Artificial functional difference between microbial communities caused by length difference of sequencing reads.

作者信息

Zhang Quan, Doak Thomas G, Ye Yuzhen

机构信息

School of Informatics and Computing, Indiana University, Bloomington, IN 47405, USA.

出版信息

Pac Symp Biocomput. 2012:259-70.

PMID:22174281
Abstract

Homology-based approaches are often used for the annotation of microbial communities, providing functional profiles that are used to characterize and compare the content and the functionality of microbial communities. Metagenomic reads are the starting data for these studies, however considerable differences are observed between the functional profiles-built from sequencing reads produced by different sequencing techniques-for even the same microbial community. Using simulation experiments, we show that such functional differences are likely to be caused by the actual difference in read lengths, and are not the results of a sampling bias of the sequencing techniques. Furthermore, the functional differences derived from different sequencing techniques cannot be fully explained by the read-count bias, i.e. 1) the higher fraction of unannotated shorter reads (i.e., "read length matters"), and 2) the different lengths of proteins in different functional categories. Instead, we show here that specific functional categories are under-annotated, because similarity-search-based functional annotation tools tend to miss more reads from functional categories that contain less conserved genes/proteins. In addition, the accuracy of functional annotation of short reads for different functions varies, further skewing the functional profiles. To address these issues, we present a simple yet efficient method to improve the frequency estimates of different functional categories in the functional profiles of metagenomes, based on the functional annotation of simulated reads from complete microbial genomes.

摘要

基于同源性的方法常用于微生物群落的注释,提供用于表征和比较微生物群落的内容和功能的功能概况。宏基因组读数是这些研究的起始数据,然而,即使对于相同的微生物群落,在由不同测序技术产生的测序读数构建的功能概况之间也观察到相当大的差异。通过模拟实验,我们表明这种功能差异可能是由读数长度的实际差异引起的,而不是测序技术的采样偏差的结果。此外,来自不同测序技术的功能差异不能完全由读数计数偏差来解释,即1)未注释的较短读数的比例较高(即“读数长度很重要”),以及2)不同功能类别中蛋白质的不同长度。相反,我们在此表明特定的功能类别注释不足,因为基于相似性搜索的功能注释工具往往会遗漏来自包含较少保守基因/蛋白质的功能类别的更多读数。此外,不同功能的短读数的功能注释准确性各不相同,进一步扭曲了功能概况。为了解决这些问题,我们提出了一种简单而有效的方法,基于来自完整微生物基因组的模拟读数的功能注释,来改善宏基因组功能概况中不同功能类别的频率估计。

相似文献

1
Artificial functional difference between microbial communities caused by length difference of sequencing reads.测序读长差异导致微生物群落间的人工功能差异。
Pac Symp Biocomput. 2012:259-70.
2
Metagenomics: read length matters.宏基因组学:读长很重要。
Appl Environ Microbiol. 2008 Mar;74(5):1453-63. doi: 10.1128/AEM.02181-07. Epub 2008 Jan 11.
3
MetaDomain: a profile HMM-based protein domain classification tool for short sequences.MetaDomain:一种基于隐马尔可夫模型轮廓的短序列蛋白质结构域分类工具。
Pac Symp Biocomput. 2012:271-82.
4
Comparative analysis of functional metagenomic annotation and the mappability of short reads.功能宏基因组注释与短读长可映射性的比较分析。
PLoS One. 2014 Aug 22;9(8):e105776. doi: 10.1371/journal.pone.0105776. eCollection 2014.
5
MetaGeneHunt for protein domain annotation in short-read metagenomes.元基因搜索在短读长宏基因组中进行蛋白质结构域注释。
Sci Rep. 2020 May 7;10(1):7712. doi: 10.1038/s41598-020-63775-1.
6
A user's guide to quantitative and comparative analysis of metagenomic datasets.宏基因组数据集定量与比较分析用户指南
Methods Enzymol. 2013;531:525-47. doi: 10.1016/B978-0-12-407863-5.00023-X.
7
Evaluating the Quantitative Capabilities of Metagenomic Analysis Software.评估宏基因组分析软件的定量能力。
Curr Microbiol. 2016 May;72(5):612-6. doi: 10.1007/s00284-016-0991-2. Epub 2016 Jan 30.
8
Bioinformatic progress and applications in metaproteogenomics for bridging the gap between genomic sequences and metabolic functions in microbial communities.生物信息学在宏蛋白质组学中的进展和应用,有助于弥合微生物群落中基因组序列和代谢功能之间的差距。
Proteomics. 2013 Oct;13(18-19):2786-804. doi: 10.1002/pmic.201200566. Epub 2013 Aug 7.
9
Intrinsic correlation of oligonucleotides: a novel genomic signature for metagenome analysis.寡核苷酸的内在相关性:一种用于宏基因组分析的新型基因组特征。
J Theor Biol. 2014 Jul 21;353:9-18. doi: 10.1016/j.jtbi.2014.02.039. Epub 2014 Mar 11.
10
Pre- and post-sequencing recommendations for functional annotation of human fecal metagenomes.人类粪便宏基因组功能注释的测序前和测序后建议。
BMC Bioinformatics. 2020 Feb 24;21(1):74. doi: 10.1186/s12859-020-3416-y.

引用本文的文献

1
Statistical correction for functional metagenomic profiling of a microbial community with short NGS reads.利用短读长NGS数据对微生物群落进行功能宏基因组分析的统计校正
J Appl Stat. 2018;45(14):2521-2535. doi: 10.1080/02664763.2018.1426741. Epub 2018 Jan 27.
2
A Statistical Approach to Correcting Cross-Annotations in a Metagenomic Functional Profile Generated by Short Reads.一种校正短读长生成的宏基因组功能谱中交叉注释的统计方法。
J Biom Biostat. 2014;5(4). doi: 10.4172/2155-6180.1000208. Epub 2014 Nov 10.
3
MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.
环境宏基因组的MinION™纳米孔测序:一种合成方法。
Gigascience. 2017 Mar 1;6(3):1-10. doi: 10.1093/gigascience/gix007.
4
Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes.分子长读长测序有助于复杂土壤宏基因组的组装和基因组分箱。
mSystems. 2016 Jun 28;1(3). doi: 10.1128/mSystems.00045-16. eCollection 2016 May-Jun.
5
Comparative analysis of functional metagenomic annotation and the mappability of short reads.功能宏基因组注释与短读长可映射性的比较分析。
PLoS One. 2014 Aug 22;9(8):e105776. doi: 10.1371/journal.pone.0105776. eCollection 2014.
6
An artificial functional family filter in homolog searching in next-generation sequencing metagenomics.在下一代测序宏基因组学同源搜索中使用人工功能家族滤波器。
PLoS One. 2013;8(3):e58669. doi: 10.1371/journal.pone.0058669. Epub 2013 Mar 14.
7
Third-generation sequencing techniques and applications to drug discovery.第三代测序技术及其在药物发现中的应用。
Expert Opin Drug Discov. 2012 Mar;7(3):231-43. doi: 10.1517/17460441.2012.660145. Epub 2012 Feb 2.