• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GAAP:一个基于图形用户界面的基因组组装与注释软件包。

GAAP: A GUI-based Genome Assembly and Annotation Package.

作者信息

Singla Deepak, Yadav Inderjit Singh

机构信息

School of Agricultural Biotechnology, Punjab Agricultural University, Ludhiana, India.

出版信息

Curr Genomics. 2022 Jun 10;23(2):77-82. doi: 10.2174/1389202923666220128155537.

DOI:10.2174/1389202923666220128155537
PMID:36778979
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9878834/
Abstract

Next-generation sequencing (NGS) technologies are being continuously used for high-throughput sequencing data generation that requires easy-to-use GUI-based data analysis software. These kinds of software could be used in-parallel with sequencing for the automatic data analysis. At present, very few software are available for use and most of them are commercial, thus creating a gap between data generation and data analysis. GAAP is developed on the NodeJS platform that uses HTML, JavaScript as the front-end for communication with users. We have implemented FastQC and trimmomatic tool for quality checking and control. Velvet and Prodigal are integrated for genome assembly and gene prediction. The annotation will be done with the help of remote NCBI Blast and IPR-Scan. In the back- end, we have used PERL and JavaScript for the processing of data. To evaluate the performance of GAAP, we have assembled a viral (SRR11621811), bacterial (SRR17153353) and human genome (SRR16845439). We have used GAAP software to assemble, and annotate a COVID-19 genome on a desktop computer that resulted in a single contig of 27994bp with 99.57% reference genome coverage. This assembly predicted 11 genes, of which 10 were annotated using annotation module of GAAP. We have also assembled a bacterial and human genome 138 and 194281 contigs with N50 value 100399 and 610, respectively. In this study, we have developed freely available, platform-independent genome assembly and annotation (GAAP) software (www.deepaklab.com/gaap). The software itself acts as a complete data analysis package with quality check, quality control, genome assembly, gene prediction and annotation (Blast, PFAM, GO-Term, pathway and enzyme mapping) modules.

摘要

新一代测序(NGS)技术正不断用于生成高通量测序数据,这需要基于图形用户界面(GUI)的易于使用的数据分析软件。这类软件可与测序并行使用,以进行自动数据分析。目前,可用的软件非常少,而且大多数都是商业软件, 因此在数据生成和数据分析之间形成了差距。GAAP是在NodeJS平台上开发的,它使用HTML、JavaScript作为与用户通信的前端。我们已经实现了用于质量检查和控制的FastQC和trimmomatic工具。整合了Velvet和Prodigal用于基因组组装和基因预测。注释将借助远程NCBI Blast和IPR-Scan完成。在后端,我们使用PERL和JavaScript进行数据处理。为了评估GAAP的性能,我们组装了一个病毒基因组(SRR11621811)、一个细菌基因组(SRR17153353)和一个人类基因组(SRR16845439)。我们使用GAAP软件在台式计算机上组装并注释了一个新冠病毒基因组,得到了一个长度为27994bp的单重叠群,参考基因组覆盖率为99.57%。该组装预测了11个基因,其中10个使用GAAP的注释模块进行了注释。我们还分别组装了一个细菌基因组和一个人类基因组,分别得到了138和194281个重叠群,N50值分别为100399和610。在本研究中,我们开发了免费的、与平台无关的基因组组装和注释(GAAP)软件(www.deepaklab.com/gaap)。该软件本身作为一个完整的数据分析包,包含质量检查、质量控制、基因组组装、基因预测和注释(Blast、PFAM、GO术语、通路和酶映射)模块。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f75/9878834/5f4b99dff1e6/CG-23-77_F2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f75/9878834/f1f89f028dde/CG-23-77_F1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f75/9878834/5f4b99dff1e6/CG-23-77_F2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f75/9878834/f1f89f028dde/CG-23-77_F1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f75/9878834/5f4b99dff1e6/CG-23-77_F2.jpg

相似文献

1
GAAP: A GUI-based Genome Assembly and Annotation Package.GAAP:一个基于图形用户界面的基因组组装与注释软件包。
Curr Genomics. 2022 Jun 10;23(2):77-82. doi: 10.2174/1389202923666220128155537.
2
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
3
Software for pre-processing Illumina next-generation sequencing short read sequences.用于预处理Illumina下一代测序短读序列的软件。
Source Code Biol Med. 2014 May 3;9:8. doi: 10.1186/1751-0473-9-8. eCollection 2014.
4
A Comparison of Resources for the Annotation of a De Novo Assembled Transcriptome in the Molting Gland (Y-Organ) of the Blackback Land Crab, Gecarcinus lateralis.黑背陆蟹(Gecarcinus lateralis)蜕皮腺(Y器官)中从头组装转录组注释资源的比较
Integr Comp Biol. 2016 Dec;56(6):1103-1112. doi: 10.1093/icb/icw107. Epub 2016 Aug 22.
5
Genome sequencing of bacteria: sequencing, de novo assembly and rapid analysis using open source tools.细菌基因组测序:使用开源工具进行测序、从头组装和快速分析。
BMC Genomics. 2013 Apr 1;14:211. doi: 10.1186/1471-2164-14-211.
6
GALAXY Workflow for Bacterial Next-Generation Sequencing De Novo Assembly and Annotation.用于细菌新一代测序从头组装和注释的银河工作流程。
Curr Protoc. 2021 Sep;1(9):e242. doi: 10.1002/cpz1.242.
7
Assembly and annotation of a non-model gastropod (Nerita melanotragus) transcriptome: a comparison of de novo assemblers.一种非模式腹足动物(黑唇蜒螺)转录组的组装与注释:从头组装器的比较
BMC Res Notes. 2014 Aug 1;7:488. doi: 10.1186/1756-0500-7-488.
8
A new rhesus macaque assembly and annotation for next-generation sequencing analyses.用于下一代测序分析的恒河猴新基因组组装与注释。
Biol Direct. 2014 Oct 14;9(1):20. doi: 10.1186/1745-6150-9-20.
9
A pilot study for channel catfish whole genome sequencing and de novo assembly.斑点叉尾鮰全基因组测序和从头组装的初步研究。
BMC Genomics. 2011 Dec 22;12:629. doi: 10.1186/1471-2164-12-629.
10
AGORA: organellar genome annotation from the amino acid and nucleotide references.AGORA:基于氨基酸和核苷酸参考的细胞器基因组注释。
Bioinformatics. 2018 Aug 1;34(15):2661-2663. doi: 10.1093/bioinformatics/bty196.

引用本文的文献

1
Chromosome-level genome assembly of the cave leech Sinospelaeobdella cavatuses (Hirudinea: Haemadipsidae).洞穴蛭 Sinospelaeobdella cavatuses(蛭纲:Hirudinea:Haemadipsidae)的染色体水平基因组组装。
Sci Data. 2024 Nov 13;11(1):1223. doi: 10.1038/s41597-024-04007-3.
2
Chromosomal-Level Reference Genome for the Chinese Endemic Pygmy Grasshopper, , Sheds Light on Tetrigidae Evolution and Advancing Conservation Efforts.中国特有小蚱蜢的染色体水平参考基因组为蚱蜢科的进化及推进保护工作提供了线索。
Insects. 2024 Mar 25;15(4):223. doi: 10.3390/insects15040223.
3
Integration of Non-Coding RNA and mRNA Profiles Reveals the Mechanisms of Rumen Development Induced by Different Types of Diet in Calves.

本文引用的文献

1
Plant pan-genomes are the new reference.植物泛基因组成为新的参考。
Nat Plants. 2020 Aug;6(8):914-920. doi: 10.1038/s41477-020-0733-0. Epub 2020 Jul 20.
2
Long walk to genomics: History and current approaches to genome sequencing and assembly.通往基因组学的漫长之路:基因组测序与组装的历史及当前方法
Comput Struct Biotechnol J. 2019 Nov 17;18:9-19. doi: 10.1016/j.csbj.2019.11.002. eCollection 2020.
3
Exploring and Exploiting Pan-genomics for Crop Improvement.探索和利用泛基因组进行作物改良。
非编码 RNA 和 mRNA 谱的整合揭示了不同类型日粮诱导犊牛瘤胃发育的机制。
Genes (Basel). 2023 May 16;14(5):1093. doi: 10.3390/genes14051093.
Mol Plant. 2019 Feb 4;12(2):156-169. doi: 10.1016/j.molp.2018.12.016. Epub 2018 Dec 28.
4
The Gene Ontology Resource: 20 years and still GOing strong.《基因本体论资源:20 年,持续强大》
Nucleic Acids Res. 2019 Jan 8;47(D1):D330-D338. doi: 10.1093/nar/gky1055.
5
A Review of Bioinformatics Tools for Bio-Prospecting from Metagenomic Sequence Data.用于从宏基因组序列数据中进行生物勘探的生物信息学工具综述。
Front Genet. 2017 Mar 6;8:23. doi: 10.3389/fgene.2017.00023. eCollection 2017.
6
InterPro in 2017-beyond protein family and domain annotations.2017年的InterPro——超越蛋白质家族和结构域注释
Nucleic Acids Res. 2017 Jan 4;45(D1):D190-D199. doi: 10.1093/nar/gkw1107. Epub 2016 Nov 29.
7
KEGG as a reference resource for gene and protein annotation.KEGG作为基因和蛋白质注释的参考资源。
Nucleic Acids Res. 2016 Jan 4;44(D1):D457-62. doi: 10.1093/nar/gkv1070. Epub 2015 Oct 17.
8
The bacterial pangenome as a new tool for analysing pathogenic bacteria.细菌泛基因组作为分析病原菌的新工具。
New Microbes New Infect. 2015 Jun 26;7:72-85. doi: 10.1016/j.nmni.2015.06.005. eCollection 2015 Sep.
9
Trimmomatic: a flexible trimmer for Illumina sequence data.Trimmomatic:一款适用于 Illumina 测序数据的灵活修剪工具。
Bioinformatics. 2014 Aug 1;30(15):2114-20. doi: 10.1093/bioinformatics/btu170. Epub 2014 Apr 1.
10
InterProScan 5: genome-scale protein function classification.InterProScan 5:基因组规模的蛋白质功能分类。
Bioinformatics. 2014 May 1;30(9):1236-40. doi: 10.1093/bioinformatics/btu031. Epub 2014 Jan 21.