Suppr超能文献

公共数据库中基因组组装的错误鉴定:以大连瑙莫酵母为例及提出纠正错误鉴定的方案

Misidentification of genome assemblies in public databases: The case of Naumovozyma dairenensis and proposal of a protocol to correct misidentifications.

作者信息

Stavrou Aimilia A, Mixão Verónica, Boekhout Teun, Gabaldón Toni

机构信息

Westerdijk Fungal Biodiversity Institute, 3584, Utrecht, The Netherlands.

Institute for Biodiversity and ecosystem Dynamics, University of Amsterdam, 1012, WX, Amsterdam, The Netherlands.

出版信息

Yeast. 2018 Jun;35(6):425-429. doi: 10.1002/yea.3303. Epub 2018 Feb 22.

Abstract

Online sequence databases such as NCBI GenBank serve as a tremendously useful platform for researchers to share and reuse published data. However, submission systems lack control for errors such as organism misidentification, which once entered in the database can be propagated and mislead downstream analyses. Here we present an illustrating case of misidentification of Candida albicans from a clinical sample as Naumovozyma dairenensis based on whole-genome shotgun data. Analyses of phylogenetic markers, read mapping and single nucleotide polymorphisms served to correct the identification. We propose that the routine use of such analyses could help to detect misidentifications arising from unsupervised analyses and correct them before they enter the databases. Finally, we discuss broader implications of such misidentifications and the difficulty of correcting them once they are in the records.

摘要

诸如NCBI基因库之类的在线序列数据库,为研究人员共享和重用已发表的数据提供了极为有用的平台。然而,提交系统缺乏对诸如生物误识别等错误的控制,一旦这些错误录入数据库,就可能传播并误导下游分析。在此,我们展示一个基于全基因组鸟枪法数据将临床样本中的白色念珠菌误识别为大连瑙莫酵母的案例。系统发育标记分析、读段比对和单核苷酸多态性分析有助于纠正这一误识别。我们建议,常规使用此类分析有助于检测无监督分析产生的误识别,并在其进入数据库之前予以纠正。最后,我们讨论了此类误识别的更广泛影响以及一旦记录在案就难以纠正的问题。

相似文献

2
The diploid genome sequence of Candida albicans.白色念珠菌的二倍体基因组序列。
Proc Natl Acad Sci U S A. 2004 May 11;101(19):7329-34. doi: 10.1073/pnas.0401648101. Epub 2004 May 3.
3
Sequence resources at the Candida Genome Database.念珠菌基因组数据库中的序列资源。
Nucleic Acids Res. 2007 Jan;35(Database issue):D452-6. doi: 10.1093/nar/gkl899. Epub 2006 Nov 7.
8
The long hard road to a completed Candida albicans genome.通往完整白色念珠菌基因组的漫长而艰难之路。
Fungal Genet Biol. 2006 May;43(5):311-5. doi: 10.1016/j.fgb.2006.01.002. Epub 2006 Mar 6.
10
Molecular epidemiology, phylogeny and evolution of Candida albicans.白色念珠菌的分子流行病学、系统发育与进化
Infect Genet Evol. 2014 Jan;21:166-78. doi: 10.1016/j.meegid.2013.11.008. Epub 2013 Nov 19.

引用本文的文献

10
and aspergillosis: From basics to clinics.以及曲霉病:从基础到临床
Stud Mycol. 2021 May 10;100:100115. doi: 10.1016/j.simyco.2021.100115. eCollection 2021 Sep.

本文引用的文献

6
Genomics and the making of yeast biodiversity.基因组学与酵母生物多样性的形成
Curr Opin Genet Dev. 2015 Dec;35:100-9. doi: 10.1016/j.gde.2015.10.008. Epub 2015 Nov 30.
9
Genetic and phenotypic intra-species variation in Candida albicans.白色念珠菌的种内遗传和表型变异。
Genome Res. 2015 Mar;25(3):413-25. doi: 10.1101/gr.174623.114. Epub 2014 Dec 11.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验