• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ImproveAssembly - 一种用于识别新基因产物和改进基因组组装的工具。

ImproveAssembly - Tool for identifying new gene products and improving genome assembly.

机构信息

Faculty of Computer Engineering, Federal University of Pará campus Tucuruí (CAMTUC-UFPA), Pará, Brazil.

Federal Rural University of Amazonia campus Tomé-Açu (UFRA), Pará, Brazil.

出版信息

PLoS One. 2018 Oct 26;13(10):e0206000. doi: 10.1371/journal.pone.0206000. eCollection 2018.

DOI:10.1371/journal.pone.0206000
PMID:30365512
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6203371/
Abstract

The availability of biological information in public databases has increased exponentially. To ensure the accuracy of this information, researchers have adopted several methods and refinements to avoid the dissemination of incorrect information; for example, several automated tools are available for annotation processes. However, manual curation ensures and enriches biological information. Additionally, the genomic finishing process is complex, resulting in increased deposition of drafts genomes. This introduces bias in other omics analyses because incomplete genomic content is used. This is also observed for complete genomes. For example, genomes generated by reference assembly may not include new products in the new sequence or errors or bias can occur during the assembly process. Thus, we developed ImproveAssembly, a tool capable of identifying new products missing from genomic sequences, which can be used for complete and draft genomes. The identified products can improve the annotation of complete genomes and drafts while significantly reducing the bias when the information is used in other omics analyses.

摘要

公共数据库中生物信息的可用性呈指数级增长。为了确保这些信息的准确性,研究人员已经采用了多种方法和改进措施来避免错误信息的传播;例如,有几个自动化工具可用于注释过程。然而,人工校对可以确保和丰富生物信息。此外,基因组完成过程非常复杂,导致草稿基因组的提交量增加。这会导致其他组学分析出现偏差,因为使用的是不完整的基因组内容。这在完整基因组中也会出现。例如,通过参考组装生成的基因组可能不包括新序列中的新产品,或者在组装过程中可能会出现错误或偏差。因此,我们开发了 ImproveAssembly,这是一种能够识别基因组序列中缺失新产品的工具,可用于完整和草稿基因组。所识别的产物可改进完整基因组和草稿基因组的注释,同时在将这些信息用于其他组学分析时显著减少偏差。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75a9/6203371/091dc3c8b62f/pone.0206000.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75a9/6203371/da6cd3240c4f/pone.0206000.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75a9/6203371/2397b760645d/pone.0206000.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75a9/6203371/091dc3c8b62f/pone.0206000.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75a9/6203371/da6cd3240c4f/pone.0206000.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75a9/6203371/2397b760645d/pone.0206000.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/75a9/6203371/091dc3c8b62f/pone.0206000.g003.jpg

相似文献

1
ImproveAssembly - Tool for identifying new gene products and improving genome assembly.ImproveAssembly - 一种用于识别新基因产物和改进基因组组装的工具。
PLoS One. 2018 Oct 26;13(10):e0206000. doi: 10.1371/journal.pone.0206000. eCollection 2018.
2
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
3
ReNoteWeb - Web platform for the improvement of assembly result and annotation of prokaryotic genomes.ReNoteWeb - 用于提高原核基因组组装结果和注释质量的网络平台。
Gene. 2022 Nov 30;844:146819. doi: 10.1016/j.gene.2022.146819. Epub 2022 Aug 25.
4
GI-POP: a combinational annotation and genomic island prediction pipeline for ongoing microbial genome projects.GI-POP:一种组合注释和基因组岛预测管道,用于正在进行的微生物基因组项目。
Gene. 2013 Apr 10;518(1):114-23. doi: 10.1016/j.gene.2012.11.063. Epub 2013 Jan 12.
5
CGAT: a comparative genome analysis tool for visualizing alignments in the analysis of complex evolutionary changes between closely related genomes.CGAT:一种用于在分析密切相关基因组之间复杂进化变化时可视化比对结果的比较基因组分析工具。
BMC Bioinformatics. 2006 Oct 24;7:472. doi: 10.1186/1471-2105-7-472.
6
ISQuest: finding insertion sequences in prokaryotic sequence fragment data.ISQuest:在原核生物序列片段数据中寻找插入序列
Bioinformatics. 2015 Nov 1;31(21):3406-12. doi: 10.1093/bioinformatics/btv388. Epub 2015 Jun 27.
7
PRAP: an ab initio software package for automated genome-wide analysis of DNA repeats for prokaryotes.PRAP:一个用于原核生物 DNA 重复序列全基因组自动化分析的从头软件包。
Bioinformatics. 2013 Nov 1;29(21):2683-9. doi: 10.1093/bioinformatics/btt482. Epub 2013 Aug 19.
8
Genix: a new online automated pipeline for bacterial genome annotation.Genix:一种用于细菌基因组注释的新型在线自动化流程。
FEMS Microbiol Lett. 2016 Dec;363(23). doi: 10.1093/femsle/fnw263. Epub 2016 Nov 16.
9
GFinisher: a new strategy to refine and finish bacterial genome assemblies.GFinisher:一种用于优化和完成细菌基因组组装的新策略。
Sci Rep. 2016 Oct 10;6:34963. doi: 10.1038/srep34963.
10
Assessing the impact of exact reads on reducing the error rate of read mapping.评估精确读取对降低读取映射错误率的影响。
BMC Bioinformatics. 2018 Nov 6;19(1):406. doi: 10.1186/s12859-018-2432-7.

本文引用的文献

1
Improving draft genome contiguity with reference-derived in silico mate-pair libraries.利用参考序列衍生的虚拟同型配对文库提高基因组草图连续性。
Gigascience. 2018 May 1;7(5). doi: 10.1093/gigascience/giy029.
2
Approaches for in silico finishing of microbial genome sequences.微生物基因组序列的计算机辅助完成方法。
Genet Mol Biol. 2017;40(3):553-576. doi: 10.1590/1678-4685-GMB-2016-0230.
3
Genome annotation for clinical genomic diagnostics: strengths and weaknesses.临床基因组诊断中的基因组注释:优势与不足
Genome Med. 2017 May 30;9(1):49. doi: 10.1186/s13073-017-0441-1.
4
PanWeb: A web interface for pan-genomic analysis.PanWeb:用于泛基因组分析的网络界面。
PLoS One. 2017 May 24;12(5):e0178154. doi: 10.1371/journal.pone.0178154. eCollection 2017.
5
OrthoFiller: utilising data from multiple species to improve the completeness of genome annotations.OrthoFiller:利用多个物种的数据提高基因组注释的完整性。
BMC Genomics. 2017 May 18;18(1):390. doi: 10.1186/s12864-017-3771-x.
6
Insights from 20 years of bacterial genome sequencing.20 年细菌基因组测序的启示。
Funct Integr Genomics. 2015 Mar;15(2):141-61. doi: 10.1007/s10142-015-0433-4. Epub 2015 Feb 27.
7
SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.SOAPdenovo2:一种经验丰富的、内存效率高的短读长从头组装器。
Gigascience. 2012 Dec 27;1(1):18. doi: 10.1186/2047-217X-1-18.
8
SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.SPAdes:一种新的基因组组装算法及其在单细胞测序中的应用
J Comput Biol. 2012 May;19(5):455-77. doi: 10.1089/cmb.2012.0021. Epub 2012 Apr 16.
9
Gene fragmentation in bacterial draft genomes: extent, consequences and mitigation.细菌草图基因组中的基因碎片化:程度、后果和缓解措施。
BMC Genomics. 2012 Jan 10;13:14. doi: 10.1186/1471-2164-13-14.
10
Limitations of next-generation genome sequence assembly.下一代基因组序列组装的局限性。
Nat Methods. 2011 Jan;8(1):61-5. doi: 10.1038/nmeth.1527. Epub 2010 Nov 21.