• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于注释NCBI生物项目转录组数据的工作流程和网络应用程序。

Workflow and web application for annotating NCBI BioProject transcriptome data.

作者信息

Vera Alvarez Roberto, Medeiros Vidal Newton, Garzón-Martínez Gina A, Barrero Luz S, Landsman David, Mariño-Ramírez Leonardo

机构信息

Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike. Bethesda, MD 20894, USA.

Colombian Corporation for Agricultural Research (CORPOICA), Km 14 vía Mosquera, Bogota, Colombia.

出版信息

Database (Oxford). 2017 Jan 1;2017. doi: 10.1093/database/bax008.

DOI:10.1093/database/bax008
PMID:28605765
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5467576/
Abstract

ABSTRACT

The volume of transcriptome data is growing exponentially due to rapid improvement of experimental technologies. In response, large central resources such as those of the National Center for Biotechnology Information (NCBI) are continually adapting their computational infrastructure to accommodate this large influx of data. New and specialized databases, such as Transcriptome Shotgun Assembly Sequence Database (TSA) and Sequence Read Archive (SRA), have been created to aid the development and expansion of centralized repositories. Although the central resource databases are under continual development, they do not include automatic pipelines to increase annotation of newly deposited data. Therefore, third-party applications are required to achieve that aim. Here, we present an automatic workflow and web application for the annotation of transcriptome data. The workflow creates secondary data such as sequencing reads and BLAST alignments, which are available through the web application. They are based on freely available bioinformatics tools and scripts developed in-house. The interactive web application provides a search engine and several browser utilities. Graphical views of transcript alignments are available through SeqViewer, an embedded tool developed by NCBI for viewing biological sequence data. The web application is tightly integrated with other NCBI web applications and tools to extend the functionality of data processing and interconnectivity. We present a case study for the species Physalis peruviana with data generated from BioProject ID 67621.

DATABASE

URL: http://www.ncbi.nlm.nih.gov/projects/physalis/.

摘要

摘要

由于实验技术的迅速发展,转录组数据量呈指数级增长。作为回应,诸如美国国立生物技术信息中心(NCBI)这样的大型中央资源机构不断调整其计算基础设施,以适应大量涌入的数据。已经创建了新的专门数据库,如转录组鸟枪法测序序列数据库(TSA)和序列读数档案库(SRA),以协助集中式存储库的开发和扩展。尽管中央资源数据库在不断发展,但它们不包括用于增加新存入数据注释的自动管道。因此,需要第三方应用程序来实现这一目标。在这里,我们展示了一种用于转录组数据注释的自动工作流程和网络应用程序。该工作流程创建诸如测序读数和BLAST比对等二级数据,可通过网络应用程序获取。它们基于内部开发的免费生物信息学工具和脚本。交互式网络应用程序提供了一个搜索引擎和几个浏览器实用程序。转录本比对的图形视图可通过SeqViewer获得,SeqViewer是NCBI开发的用于查看生物序列数据的嵌入式工具。该网络应用程序与其他NCBI网络应用程序和工具紧密集成,以扩展数据处理和互连的功能。我们展示了一个针对酸浆的数据案例研究,数据来自生物项目ID 67621。

数据库

网址:http://www.ncbi.nlm.nih.gov/projects/physalis/

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/192c/5467576/f1d579431f01/bax008f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/192c/5467576/13bcf92eca4a/bax008f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/192c/5467576/ee309d933a73/bax008f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/192c/5467576/2683c8a84937/bax008f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/192c/5467576/f4771487e78a/bax008f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/192c/5467576/3ea7da5db03a/bax008f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/192c/5467576/f1d579431f01/bax008f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/192c/5467576/13bcf92eca4a/bax008f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/192c/5467576/ee309d933a73/bax008f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/192c/5467576/2683c8a84937/bax008f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/192c/5467576/f4771487e78a/bax008f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/192c/5467576/3ea7da5db03a/bax008f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/192c/5467576/f1d579431f01/bax008f6.jpg

相似文献

1
Workflow and web application for annotating NCBI BioProject transcriptome data.用于注释NCBI生物项目转录组数据的工作流程和网络应用程序。
Database (Oxford). 2017 Jan 1;2017. doi: 10.1093/database/bax008.
2
Database resources of the National Center for Biotechnology Information.国家生物技术信息中心数据库资源。
Nucleic Acids Res. 2014 Jan;42(Database issue):D7-17. doi: 10.1093/nar/gkt1146. Epub 2013 Nov 19.
3
BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata.NCBI 的 BioProject 和 BioSample 数据库:促进元数据的捕获和组织。
Nucleic Acids Res. 2012 Jan;40(Database issue):D57-63. doi: 10.1093/nar/gkr1163. Epub 2011 Dec 1.
4
Database resources of the National Center for Biotechnology Information.国家生物技术信息中心数据库资源。
Nucleic Acids Res. 2013 Jan;41(Database issue):D8-D20. doi: 10.1093/nar/gks1189. Epub 2012 Nov 27.
5
Database resources of the National Center for Biotechnology Information.国家生物技术信息中心数据库资源。
Nucleic Acids Res. 2012 Jan;40(Database issue):D13-25. doi: 10.1093/nar/gkr1184. Epub 2011 Dec 2.
6
Dataset of de novo assembly and functional annotation of the transcriptomes of three native oleaginous microalgae from the Peruvian Amazon.来自秘鲁亚马逊地区的三种本地产油微藻转录组的从头组装和功能注释数据集。
Data Brief. 2020 Jun 21;31:105917. doi: 10.1016/j.dib.2020.105917. eCollection 2020 Aug.
7
The CAIRR Pipeline for Submitting Standards-Compliant B and T Cell Receptor Repertoire Sequencing Studies to the National Center for Biotechnology Information Repositories.CAIRR 管道用于向国家生物技术信息中心存储库提交符合标准的 B 和 T 细胞受体文库测序研究。
Front Immunol. 2018 Aug 16;9:1877. doi: 10.3389/fimmu.2018.01877. eCollection 2018.
8
Database resources of the National Center for Biotechnology Information.国家生物技术信息中心数据库资源。
Nucleic Acids Res. 2019 Jan 8;47(D1):D23-D28. doi: 10.1093/nar/gky1069.
9
Database resources of the National Center for Biotechnology Information.国家生物技术信息中心数据库资源。
Nucleic Acids Res. 2010 Jan;38(Database issue):D5-16. doi: 10.1093/nar/gkp967. Epub 2009 Nov 12.
10
search GenBank: interactive orchestration and ad-hoc choreography of Web services in the exploration of the biomedical resources of the National Center For Biotechnology Information.搜索 GenBank:在探索国家生物技术信息中心的生物医学资源时,对 Web 服务进行交互式编排和临时编排。
BMC Bioinformatics. 2013 Mar 1;14:73. doi: 10.1186/1471-2105-14-73.

引用本文的文献

1
Combining transcriptome analysis and GWAS for identification and validation of marker genes in the - pathosystem.结合转录组分析和全基因组关联研究以鉴定和验证 - 病理系统中的标记基因。
PeerJ. 2021 Mar 22;9:e11135. doi: 10.7717/peerj.11135. eCollection 2021.
2
Transcriptome annotation in the cloud: complexity, best practices, and cost.转录组注释在云端:复杂性、最佳实践和成本。
Gigascience. 2021 Jan 29;10(2). doi: 10.1093/gigascience/giaa163.
3
Obtaining extremely large and accurate protein multiple sequence alignments from curated hierarchical alignments.

本文引用的文献

1
Association analysis for disease resistance to Fusarium oxysporum in cape gooseberry (Physalis peruviana L).灯笼果(酸浆属秘鲁酸浆)对尖孢镰刀菌抗病性的关联分析
BMC Genomics. 2016 Mar 18;17:248. doi: 10.1186/s12864-016-2568-7.
2
EchinoDB, an application for comparative transcriptomics of deeply-sampled clades of echinoderms.棘皮动物数据库(EchinoDB),一款用于深度采样的棘皮动物进化枝比较转录组学的应用程序。
BMC Bioinformatics. 2016 Jan 22;17:48. doi: 10.1186/s12859-016-0883-2.
3
Whole Transcriptome Analysis Provides Insights into Molecular Mechanisms for Molting in Litopenaeus vannamei.
从已编辑的层级比对获取超大量且精确的蛋白质多重序列比对。
Database (Oxford). 2020 Jan 1;2020. doi: 10.1093/database/baaa042.
全转录组分析为凡纳滨对虾蜕皮的分子机制提供了见解。
PLoS One. 2015 Dec 9;10(12):e0144350. doi: 10.1371/journal.pone.0144350. eCollection 2015.
4
Database resources of the National Center for Biotechnology Information.美国国立生物技术信息中心的数据库资源。
Nucleic Acids Res. 2016 Jan 4;44(D1):D7-19. doi: 10.1093/nar/gkv1290. Epub 2015 Nov 28.
5
Transcriptator: An Automated Computational Pipeline to Annotate Assembled Reads and Identify Non Coding RNA.转录器:一种用于注释组装读段和识别非编码RNA的自动化计算流程。
PLoS One. 2015 Nov 18;10(11):e0140268. doi: 10.1371/journal.pone.0140268. eCollection 2015.
6
Genetic diversity and population structure in and related taxa based on InDels and SNPs derived from COSII and IRG markers.基于来自COSII和IRG标记的插入缺失(InDels)和单核苷酸多态性(SNPs),研究[具体物种]及其相关类群的遗传多样性和种群结构。
Plant Gene. 2015 Dec 1;4:29-37. doi: 10.1016/j.plgene.2015.09.003.
7
Enhancing Structural Annotation of Yeast Genomes with RNA-Seq Data.利用RNA测序数据增强酵母基因组的结构注释
Methods Mol Biol. 2016;1361:41-56. doi: 10.1007/978-1-4939-3079-1_2.
8
Whole transcriptome expression profiling of mouse limb tendon development by using RNA-seq.利用RNA测序技术对小鼠肢体肌腱发育进行全转录组表达谱分析。
J Orthop Res. 2015 Jun;33(6):840-8. doi: 10.1002/jor.22886. Epub 2015 Apr 28.
9
ASPicDB: a database web tool for alternative splicing analysis.ASPicDB:用于可变剪接分析的数据库网络工具。
Methods Mol Biol. 2015;1269:365-78. doi: 10.1007/978-1-4939-2291-8_23.
10
UniProt: a hub for protein information.通用蛋白质数据库(UniProt):蛋白质信息中心。
Nucleic Acids Res. 2015 Jan;43(Database issue):D204-12. doi: 10.1093/nar/gku989. Epub 2014 Oct 27.