K-shuff：一种用于表征基因文库中结构和组成多样性的新算法。

K-shuff: A Novel Algorithm for Characterizing Structural and Compositional Diversity in Gene Libraries.

作者信息

Jangid Kamlesh, Kao Ming-Hung, Lahamge Aishwarya, Williams Mark A, Rathbun Stephen L, Whitman William B

机构信息

Department of Microbiology, University of Georgia, Athens, Georgia, United States of America.

Microbial Culture Collection, National Centre for Cell Science, Savitribai Phule Pune University, Pune, Maharashtra, India.

出版信息

PLoS One. 2016 Dec 2;11(12):e0167634. doi: 10.1371/journal.pone.0167634. eCollection 2016.

DOI:10.1371/journal.pone.0167634

PMID:27911946

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5135132/

Abstract

K-shuff is a new algorithm for comparing the similarity of gene sequence libraries, providing measures of the structural and compositional diversity as well as the significance of the differences between these measures. Inspired by Ripley's K-function for spatial point pattern analysis, the Intra K-function or IKF measures the structural diversity, including both the richness and overall similarity of the sequences, within a library. The Cross K-function or CKF measures the compositional diversity between gene libraries, reflecting both the number of OTUs shared as well as the overall similarity in OTUs. A Monte Carlo testing procedure then enables statistical evaluation of both the structural and compositional diversity between gene libraries. For 16S rRNA gene libraries from complex bacterial communities such as those found in seawater, salt marsh sediments, and soils, K-shuff yields reproducible estimates of structural and compositional diversity with libraries greater than 50 sequences. Similarly, for pyrosequencing libraries generated from a glacial retreat chronosequence and Illumina® libraries generated from US homes, K-shuff required >300 and 100 sequences per sample, respectively. Power analyses demonstrated that K-shuff is sensitive to small differences in Sanger or Illumina® libraries. This extra sensitivity of K-shuff enabled examination of compositional differences at much deeper taxonomic levels, such as within abundant OTUs. This is especially useful when comparing communities that are compositionally very similar but functionally different. K-shuff will therefore prove beneficial for conventional microbiome analysis as well as specific hypothesis testing.

摘要

K-shuff是一种用于比较基因序列文库相似性的新算法，可提供结构和组成多样性的度量以及这些度量之间差异的显著性。受用于空间点模式分析的Ripley's K函数启发，库内K函数（IKF）测量文库内的结构多样性，包括序列的丰富度和整体相似性。交叉K函数（CKF）测量基因文库之间的组成多样性，反映共享的操作分类单元（OTU）数量以及OTU的整体相似性。然后，蒙特卡罗测试程序能够对基因文库之间的结构和组成多样性进行统计评估。对于来自复杂细菌群落（如海水、盐沼沉积物和土壤中的群落）的16S rRNA基因文库，K-shuff对于序列数大于50的文库能够产生可重复的结构和组成多样性估计值。同样，对于由冰川消退时间序列生成的焦磷酸测序文库和由美国家庭生成的Illumina®文库，K-shuff分别需要每个样本>300和100个序列。功效分析表明，K-shuff对Sanger或Illumina®文库中的微小差异敏感。K-shuff的这种额外敏感性使得能够在更深的分类水平上检查组成差异，例如在丰富的OTU内。在比较组成非常相似但功能不同的群落时，这特别有用。因此，K-shuff将被证明对传统的微生物组分析以及特定的假设检验有益。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5efc/5135132/a854b4d0ce0b/pone.0167634.g001.jpg

相似文献

K-shuff: A Novel Algorithm for Characterizing Structural and Compositional Diversity in Gene Libraries.

PLoS One. 2016 Dec 2;11(12):e0167634. doi: 10.1371/journal.pone.0167634. eCollection 2016.

Bacterial diversity in aquatic and other environments: what 16S rDNA libraries can tell us.

FEMS Microbiol Ecol. 2004 Feb 1;47(2):161-77. doi: 10.1016/S0168-6496(03)00257-5.

Introducing DOTUR, a computer program for defining operational taxonomic units and estimating species richness.

Appl Environ Microbiol. 2005 Mar;71(3):1501-6. doi: 10.1128/AEM.71.3.1501-1506.2005.

[Prokaryotic microbial diversity of the ancient salt deposits in the Kunming Salt Mine, P.R. China].

Wei Sheng Wu Xue Bao. 2007 Apr;47(2):295-300.

Bacterial diversity in the bacterioneuston (sea surface microlayer): the bacterioneuston through the looking glass.

Environ Microbiol. 2005 May;7(5):723-36. doi: 10.1111/j.1462-2920.2004.00736.x.

Bacterial community structure in cooling water and biofilm in an industrial recirculating cooling water system.

Water Sci Technol. 2013;68(4):940-7. doi: 10.2166/wst.2013.334.

Characterization of microbial community structure in Gulf of Mexico gas hydrates: comparative analysis of DNA- and RNA-derived clone libraries.

Appl Environ Microbiol. 2005 Jun;71(6):3235-47. doi: 10.1128/AEM.71.6.3235-3247.2005.

Bacterial diversity of metagenomic and PCR libraries from the Delaware River.

Environ Microbiol. 2005 Dec;7(12):1883-95. doi: 10.1111/j.1462-2920.2005.00762.x.

Bacterial diversity of water and sediment in the Changjiang estuary and coastal area of the East China Sea.

FEMS Microbiol Ecol. 2009 Nov;70(2):80-92. doi: 10.1111/j.1574-6941.2009.00772.x. Epub 2009 Aug 28.

Microbial community of salt crystals processed from Mediterranean seawater based on 16S rRNA analysis.

Can J Microbiol. 2010 Jan;56(1):44-51. doi: 10.1139/w09-102.

引用本文的文献

Unprecedented bacterial community richness in soybean nodules vary with cultivar and water status.

Microbiome. 2019 Apr 16;7(1):63. doi: 10.1186/s40168-019-0676-8.

Structure and Diversity of Soil Bacterial Communities in Offshore Islands.

Sci Rep. 2019 Mar 20;9(1):4689. doi: 10.1038/s41598-019-41170-9.

Effects of Reforestation on the Structure and Diversity of Bacterial Communities in Subtropical Low Mountain Forest Soils.

Front Microbiol. 2018 Aug 21;9:1968. doi: 10.3389/fmicb.2018.01968. eCollection 2018.

本文引用的文献

Massively parallel multiplex DNA sequencing for specimen identification using an Illumina MiSeq platform.

Sci Rep. 2015 Apr 17;5:9687. doi: 10.1038/srep09687.

Uniting the classification of cultured and uncultured bacteria and archaea using 16S rRNA gene sequences.

Nat Rev Microbiol. 2014 Sep;12(9):635-45. doi: 10.1038/nrmicro3330.

Soil bacterial community succession during long-term ecosystem development.

Mol Ecol. 2013 Jun;22(12):3415-24. doi: 10.1111/mec.12325.

SASI-Seq: sample assurance Spike-Ins, and highly differentiating 384 barcoding for Illumina sequencing.

BMC Genomics. 2014 Feb 7;15(1):110. doi: 10.1186/1471-2164-15-110.

Home life: factors structuring the bacterial diversity found within and between homes.

PLoS One. 2013 May 22;8(5):e64133. doi: 10.1371/journal.pone.0064133. Print 2013.

Edge principal components and squash clustering: using the special structure of phylogenetic placement data for sample comparison.

PLoS One. 2013;8(3):e56859. doi: 10.1371/journal.pone.0056859. Epub 2013 Mar 11.

Not all sequence tags are created equal: designing and validating sequence identification tags robust to indels.

PLoS One. 2012;7(8):e42543. doi: 10.1371/journal.pone.0042543. Epub 2012 Aug 10.

Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities.

Appl Environ Microbiol. 2009 Dec;75(23):7537-41. doi: 10.1128/AEM.01541-09. Epub 2009 Oct 2.

Statistical methods for detecting differentially abundant features in clinical metagenomic samples.

PLoS Comput Biol. 2009 Apr;5(4):e1000352. doi: 10.1371/journal.pcbi.1000352. Epub 2009 Apr 10.

The diverse bacterial community in intertidal, anaerobic sediments at Sapelo Island, Georgia.

Microb Ecol. 2009 Aug;58(2):244-61. doi: 10.1007/s00248-008-9481-9. Epub 2009 Feb 11.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr
超能文献

K-shuff：一种用于表征基因文库中结构和组成多样性的新算法。

K-shuff: A Novel Algorithm for Characterizing Structural and Compositional Diversity in Gene Libraries.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Suppr超能文献

K-shuff：一种用于表征基因文库中结构和组成多样性的新算法。

K-shuff: A Novel Algorithm for Characterizing Structural and Compositional Diversity in Gene Libraries.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

Suppr
超能文献