• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过预测未知蛋白质家族来发现人类宏基因组数据中的病毒基因组。

Discovering viral genomes in human metagenomic data by predicting unknown protein families.

机构信息

Department of Cell and Molecular Biology, Science for Life Laboratory, Karolinska Institutet, PO Box 285, SE-171 77, Stockholm, Sweden.

Stockholm Bioinformatics Center, Department of Biochemistry and Biophysics, Stockholm University, Science for Life Laboratory, Box 1031, SE-171 21, Solna, Sweden.

出版信息

Sci Rep. 2018 Jan 8;8(1):28. doi: 10.1038/s41598-017-18341-7.

DOI:10.1038/s41598-017-18341-7
PMID:29311716
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5758519/
Abstract

Massive amounts of metagenomics data are currently being produced, and in all such projects a sizeable fraction of the resulting data shows no or little homology to known sequences. It is likely that this fraction contains novel viruses, but identification is challenging since they frequently lack homology to known viruses. To overcome this problem, we developed a strategy to detect ORFan protein families in shotgun metagenomics data, using similarity-based clustering and a set of filters to extract bona fide protein families. We applied this method to 17 virus-enriched libraries originating from human nasopharyngeal aspirates, serum, feces, and cerebrospinal fluid samples. This resulted in 32 predicted putative novel gene families. Some families showed detectable homology to sequences in metagenomics datasets and protein databases after reannotation. Notably, one predicted family matches an ORF from the highly variable Torque Teno virus (TTV). Furthermore, follow-up from a predicted ORFan resulted in the complete reconstruction of a novel circular genome. Its organisation suggests that it most likely corresponds to a novel bacteriophage in the microviridae family, hence it was named bacteriophage HFM.

摘要

目前正在产生大量的宏基因组学数据,在所有这些项目中,相当一部分的结果数据与已知序列没有或几乎没有同源性。这部分数据很可能包含新的病毒,但由于它们通常与已知病毒没有同源性,因此鉴定具有挑战性。为了克服这个问题,我们开发了一种在鸟枪法宏基因组学数据中检测 ORFan 蛋白家族的策略,使用基于相似性的聚类和一组过滤器来提取真正的蛋白家族。我们将这种方法应用于 17 个源自人鼻咽抽吸物、血清、粪便和脑脊液样本的病毒富集文库。这导致了 32 个预测的假定新基因家族。一些家族在重新注释后显示出与宏基因组数据集和蛋白质数据库中的序列有可检测的同源性。值得注意的是,一个预测的家族与高度可变的 Torque Teno 病毒 (TTV) 的一个 ORF 相匹配。此外,对一个预测的 ORFan 的后续研究导致了一个新的圆形基因组的完整重建。它的组织表明,它很可能对应于微病毒科中的一种新型噬菌体,因此它被命名为噬菌体 HFM。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e55/5758519/34ccb9ff6dde/41598_2017_18341_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e55/5758519/414127331b14/41598_2017_18341_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e55/5758519/181a7442fac3/41598_2017_18341_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e55/5758519/47420eeb0644/41598_2017_18341_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e55/5758519/34ccb9ff6dde/41598_2017_18341_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e55/5758519/414127331b14/41598_2017_18341_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e55/5758519/181a7442fac3/41598_2017_18341_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e55/5758519/47420eeb0644/41598_2017_18341_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3e55/5758519/34ccb9ff6dde/41598_2017_18341_Fig4_HTML.jpg

相似文献

1
Discovering viral genomes in human metagenomic data by predicting unknown protein families.通过预测未知蛋白质家族来发现人类宏基因组数据中的病毒基因组。
Sci Rep. 2018 Jan 8;8(1):28. doi: 10.1038/s41598-017-18341-7.
2
Two novel circo-like viruses detected in human feces: complete genome sequencing and electron microscopy analysis.在人类粪便中检测到两种新型环状病毒:全基因组测序和电子显微镜分析。
Virus Res. 2013 Dec 26;178(2):364-73. doi: 10.1016/j.virusres.2013.09.018. Epub 2013 Sep 18.
3
VirFinder: a novel k-mer based tool for identifying viral sequences from assembled metagenomic data.VirFinder:一种新型的基于 k-mer 的工具,用于从组装的宏基因组数据中识别病毒序列。
Microbiome. 2017 Jul 6;5(1):69. doi: 10.1186/s40168-017-0283-5.
4
Deciphering the Human Virome with Single-Virus Genomics and Metagenomics.利用单病毒基因组学和宏基因组学破译人类病毒组。
Viruses. 2018 Mar 6;10(3):113. doi: 10.3390/v10030113.
5
VIBRANT: automated recovery, annotation and curation of microbial viruses, and evaluation of viral community function from genomic sequences.VIBRANT:从基因组序列中自动恢复、注释和培养微生物病毒,并评估病毒群落功能。
Microbiome. 2020 Jun 10;8(1):90. doi: 10.1186/s40168-020-00867-0.
6
Functional metagenomics reveals novel β-galactosidases not predictable from gene sequences.功能宏基因组学揭示了无法从基因序列预测的新型β-半乳糖苷酶。
PLoS One. 2017 Mar 8;12(3):e0172545. doi: 10.1371/journal.pone.0172545. eCollection 2017.
7
Remote homology and the functions of metagenomic dark matter.远程同源性与宏基因组暗物质的功能。
Front Genet. 2015 Jul 21;6:234. doi: 10.3389/fgene.2015.00234. eCollection 2015.
8
Assessment of k-mer spectrum applicability for metagenomic dissimilarity analysis.用于宏基因组差异分析的k-mer谱适用性评估。
BMC Bioinformatics. 2016 Jan 16;17:38. doi: 10.1186/s12859-015-0875-7.
9
GHOSTX: A Fast Sequence Homology Search Tool for Functional Annotation of Metagenomic Data.GHOSTX:一种用于宏基因组数据功能注释的快速序列同源性搜索工具。
Methods Mol Biol. 2017;1611:15-25. doi: 10.1007/978-1-4939-7015-5_2.
10
Bioinformatic Approaches for Comparative Analysis of Viruses.病毒比较分析的生物信息学方法
Methods Mol Biol. 2018;1704:401-417. doi: 10.1007/978-1-4939-7463-4_15.

引用本文的文献

1
Charting Peptide Shared Sequences Between 'Diabetes-Viruses' and Human Pancreatic Proteins, Their Structural and Autoimmune Implications.绘制“糖尿病病毒”与人类胰腺蛋白之间的肽共享序列,及其结构和自身免疫意义
Bioinform Biol Insights. 2024 Nov 5;18:11779322241289936. doi: 10.1177/11779322241289936. eCollection 2024.
2
Insights into water insecurity in Indigenous communities in Canada: assessing microbial risks and innovative solutions, a multifaceted review.加拿大原住民社区的水不安全问题洞察:评估微生物风险和创新解决方案,一个多方面的综述。
PeerJ. 2024 Oct 18;12:e18277. doi: 10.7717/peerj.18277. eCollection 2024.
3

本文引用的文献

1
Uncovering Earth's virome.揭示地球的病毒组。
Nature. 2016 Aug 25;536(7617):425-30. doi: 10.1038/nature19094. Epub 2016 Aug 17.
2
Bacteriophage infections of microbiota can lead to leaky gut in an experimental rodent model.在实验性啮齿动物模型中,微生物群的噬菌体感染可导致肠道渗漏。
Gut Pathog. 2016 Jun 23;8:33. doi: 10.1186/s13099-016-0109-1. eCollection 2016.
3
Fast and sensitive taxonomic classification for metagenomics with Kaiju.使用Kaiju对宏基因组学进行快速且灵敏的分类学分类。
Post-genomic platform for development of oligonucleotide vaccines against RNA viruses: diamond cuts diamond.
针对 RNA 病毒的寡核苷酸疫苗开发的后基因组平台:钻石切割钻石。
Inflamm Res. 2022 Aug;71(7-8):729-739. doi: 10.1007/s00011-022-01582-2. Epub 2022 May 6.
4
Metagenomic profiling of placental tissue suggests DNA virus infection of the placenta is rare.胎盘组织的宏基因组分析表明,胎盘内 DNA 病毒感染较为罕见。
J Gen Virol. 2021 Nov;102(11). doi: 10.1099/jgv.0.001677.
5
Short k-mer abundance profiles yield robust machine learning features and accurate classifiers for RNA viruses.短 k- -mer 丰度谱为 RNA 病毒提供了强大的机器学习特征和准确的分类器。
PLoS One. 2020 Sep 18;15(9):e0239381. doi: 10.1371/journal.pone.0239381. eCollection 2020.
6
Beyond Just Bacteria: Functional Biomes in the Gut Ecosystem Including Virome, Mycobiome, Archaeome and Helminths.超越细菌:肠道生态系统中的功能微生物群落,包括病毒群落、真菌群落、古菌群落和蠕虫
Microorganisms. 2020 Mar 28;8(4):483. doi: 10.3390/microorganisms8040483.
7
The Cancer Microbiome: Distinguishing Direct and Indirect Effects Requires a Systemic View.癌症微生物组:需要系统观点来区分直接和间接影响。
Trends Cancer. 2020 Mar;6(3):192-204. doi: 10.1016/j.trecan.2020.01.004. Epub 2020 Feb 7.
8
Gut virome of mammals and birds reveals high genetic diversity of the family .哺乳动物和鸟类的肠道病毒组揭示了该家族的高度遗传多样性。
Virus Evol. 2019 Jun 8;5(1):vez013. doi: 10.1093/ve/vez013. eCollection 2019 Jan.
9
Genetic and functional diversity of double-stranded DNA viruses in a tropical monsoonal estuary, India.印度热带季风河口地区双链 DNA 病毒的遗传和功能多样性。
Sci Rep. 2018 Oct 30;8(1):16036. doi: 10.1038/s41598-018-34332-8.
Nat Commun. 2016 Apr 13;7:11257. doi: 10.1038/ncomms11257.
4
The human skin double-stranded DNA virome: topographical and temporal diversity, genetic enrichment, and dynamic associations with the host microbiome.人类皮肤双链DNA病毒群落:地形和时间多样性、基因富集以及与宿主微生物群的动态关联
mBio. 2015 Oct 20;6(5):e01578-15. doi: 10.1128/mBio.01578-15.
5
Metagenomic ventures into outer sequence space.宏基因组学对外源序列空间的探索。
Bacteriophage. 2014 Dec 15;4(4):e979664. doi: 10.4161/21597081.2014.979664. eCollection 2014.
6
MetaPhlAn2 for enhanced metagenomic taxonomic profiling.用于增强宏基因组分类分析的MetaPhlAn2
Nat Methods. 2015 Oct;12(10):902-3. doi: 10.1038/nmeth.3589.
7
UpSet: Visualization of Intersecting Sets.UpSet:相交集的可视化
IEEE Trans Vis Comput Graph. 2014 Dec;20(12):1983-92. doi: 10.1109/TVCG.2014.2346248.
8
Viral dark matter and virus-host interactions resolved from publicly available microbial genomes.从公开的微生物基因组中解析出的病毒暗物质与病毒-宿主相互作用。
Elife. 2015 Jul 22;4:e08490. doi: 10.7554/eLife.08490.
9
IVA: accurate de novo assembly of RNA virus genomes.IVA:RNA病毒基因组的精确从头组装
Bioinformatics. 2015 Jul 15;31(14):2374-6. doi: 10.1093/bioinformatics/btv120. Epub 2015 Feb 28.
10
Profile hidden Markov models for the detection of viruses within metagenomic sequence data.用于在宏基因组序列数据中检测病毒的轮廓隐马尔可夫模型。
PLoS One. 2014 Aug 20;9(8):e105067. doi: 10.1371/journal.pone.0105067. eCollection 2014.