• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因组组装与注释入门的十个步骤。

Ten steps to get started in Genome Assembly and Annotation.

作者信息

Dominguez Del Angel Victoria, Hjerde Erik, Sterck Lieven, Capella-Gutierrez Salvadors, Notredame Cederic, Vinnere Pettersson Olga, Amselem Joelle, Bouri Laurent, Bocs Stephanie, Klopp Christophe, Gibrat Jean-Francois, Vlasova Anna, Leskosek Brane L, Soler Lucile, Binzer-Panchal Mahesh, Lantz Henrik

机构信息

Institut Français de Bioinformatique, UMS3601-CNRS, Université Paris-Saclay, Orsay, 91403, France.

Department of Chemistry, Norstruct, UiT The Arctic University of Norway, Tromsø, 9019, Norway.

出版信息

F1000Res. 2018 Feb 5;7. doi: 10.12688/f1000research.13598.1. eCollection 2018.

DOI:10.12688/f1000research.13598.1
PMID:29568489
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5850084/
Abstract

As a part of the ELIXIR-EXCELERATE efforts in capacity building, we present here 10 steps to facilitate researchers getting started in genome assembly and genome annotation. The guidelines given are broadly applicable, intended to be stable over time, and cover all aspects from start to finish of a general assembly and annotation project. Intrinsic properties of genomes are discussed, as is the importance of using high quality DNA. Different sequencing technologies and generally applicable workflows for genome assembly are also detailed. We cover structural and functional annotation and encourage readers to also annotate transposable elements, something that is often omitted from annotation workflows. The importance of data management is stressed, and we give advice on where to submit data and how to make your results Findable, Accessible, Interoperable, and Reusable (FAIR).

摘要

作为ELIXIR-EXCELERATE能力建设工作的一部分,我们在此介绍10个步骤,以帮助研究人员开始进行基因组组装和基因组注释。所给出的指南具有广泛适用性,旨在长期保持稳定,并涵盖了一般组装和注释项目从开始到结束的各个方面。我们讨论了基因组的内在特性,以及使用高质量DNA的重要性。还详细介绍了不同的测序技术和一般适用的基因组组装工作流程。我们涵盖了结构和功能注释,并鼓励读者对转座元件进行注释,而这在注释工作流程中常常被忽略。强调了数据管理的重要性,并就数据提交地点以及如何使你的结果具备可查找、可访问、可互操作和可重用(FAIR)性提供了建议。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8410/5850084/1ec2b3955771/f1000research-7-14771-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8410/5850084/4f85b964d877/f1000research-7-14771-g0000.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8410/5850084/0b06e1c1b6c9/f1000research-7-14771-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8410/5850084/b513f88776fa/f1000research-7-14771-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8410/5850084/1ec2b3955771/f1000research-7-14771-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8410/5850084/4f85b964d877/f1000research-7-14771-g0000.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8410/5850084/0b06e1c1b6c9/f1000research-7-14771-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8410/5850084/b513f88776fa/f1000research-7-14771-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8410/5850084/1ec2b3955771/f1000research-7-14771-g0003.jpg

相似文献

1
Ten steps to get started in Genome Assembly and Annotation.基因组组装与注释入门的十个步骤。
F1000Res. 2018 Feb 5;7. doi: 10.12688/f1000research.13598.1. eCollection 2018.
2
Twelve quick steps for genome assembly and annotation in the classroom.课堂上进行基因组组装和注释的 12 个快速步骤。
PLoS Comput Biol. 2020 Nov 12;16(11):e1008325. doi: 10.1371/journal.pcbi.1008325. eCollection 2020 Nov.
3
Workflows for Rapid Functional Annotation of Diverse Arthropod Genomes.多种节肢动物基因组快速功能注释的工作流程
Insects. 2021 Aug 19;12(8):748. doi: 10.3390/insects12080748.
4
5
The FAANG Data Portal: Global, Open-Access, "FAIR", and Richly Validated Genotype to Phenotype Data for High-Quality Functional Annotation of Animal Genomes.FAANG数据门户:用于动物基因组高质量功能注释的全球开放获取、“FAIR”且经过充分验证的基因型到表型数据
Front Genet. 2021 Jun 17;12:639238. doi: 10.3389/fgene.2021.639238. eCollection 2021.
6
Initiatives, Concepts, and Implementation Practices of the Findable, Accessible, Interoperable, and Reusable Data Principles in Health Data Stewardship: Scoping Review.健康数据治理中可发现性、可访问性、互操作性和可重用性数据原则的举措、概念和实施实践:范围综述。
J Med Internet Res. 2023 Aug 28;25:e45013. doi: 10.2196/45013.
7
A transposable element annotation pipeline and expression analysis reveal potentially active elements in the microalga Tisochrysis lutea.转座元件注释流水线和表达分析揭示微藻新月菱形藻中潜在活跃的元件。
BMC Genomics. 2018 May 22;19(1):378. doi: 10.1186/s12864-018-4763-1.
8
Phage Genome Annotation: Where to Begin and End.噬菌体基因组注释:从何处开始与结束
Phage (New Rochelle). 2021 Dec 1;2(4):183-193. doi: 10.1089/phage.2021.0015. Epub 2021 Dec 16.
9
BG7: a new approach for bacterial genome annotation designed for next generation sequencing data.BG7:一种专为下一代测序数据设计的细菌基因组注释新方法。
PLoS One. 2012;7(11):e49239. doi: 10.1371/journal.pone.0049239. Epub 2012 Nov 21.
10
JWES: a new pipeline for whole genome/exome sequence data processing, management, and gene-variant discovery, annotation, prediction, and genotyping.JWES:一个用于全基因组/外显子组序列数据处理、管理以及基因变异发现、注释、预测和基因分型的新管道。
FEBS Open Bio. 2021 Sep;11(9):2441-2452. doi: 10.1002/2211-5463.13261. Epub 2021 Aug 11.

引用本文的文献

1
Establishing genome sequencing and assembly for non-model and emerging model organisms: a brief guide.为非模式生物和新兴模式生物建立基因组测序与组装:简要指南
Front Zool. 2025 Apr 17;22(1):7. doi: 10.1186/s12983-025-00561-7.
2
Insights from draft genomes of Heterodera species isolated from field soil samples.从田间土壤样本中分离出的异皮线虫属物种基因组草图中获得的见解。
BMC Genomics. 2025 Feb 18;26(1):158. doi: 10.1186/s12864-025-11351-0.
3
The Ribosomal Operon Database: A Full-Length rDNA Operon Database Derived From Genome Assemblies.

本文引用的文献

1
A manifesto for reproducible science.可重复科学宣言。
Nat Hum Behav. 2017 Jan 10;1(1):0021. doi: 10.1038/s41562-016-0021.
2
Comparative plastome genomics and phylogenomics of Brachypodium: flowering time signatures, introgression and recombination in recently diverged ecotypes.比较短柄草的质体基因组学和系统发育基因组学:开花时间特征、在最近分化的生态型中的渐渗和重组。
New Phytol. 2018 Jun;218(4):1631-1644. doi: 10.1111/nph.14926. Epub 2017 Dec 5.
3
Evaluation of the impact of Illumina error correction tools on de novo genome assembly.
核糖体操纵子数据库:一个源自基因组组装的全长核糖体DNA操纵子数据库。
Mol Ecol Resour. 2025 Jan;25(1):e14031. doi: 10.1111/1755-0998.14031. Epub 2024 Oct 21.
4
Rapid Targeted Assembly of the Proteome Reveals Evolutionary Variation of GC Content in Avian Lice.蛋白质组的快速靶向组装揭示了鸟类虱子基因组中GC含量的进化变异。
Bioinform Biol Insights. 2024 Jun 9;18:11779322241257991. doi: 10.1177/11779322241257991. eCollection 2024.
5
CD59 gene: 143 haplotypes of 22,718 nucleotides length by computational phasing in 113 individuals from different ethnicities.CD59 基因:通过计算相位在来自不同种族的 113 个人中计算出 22718 个核苷酸长度的 143 个单倍型。
Transfusion. 2024 Jul;64(7):1296-1305. doi: 10.1111/trf.17869. Epub 2024 May 30.
6
Approaches to increase the validity of gene family identification using manual homology search tools.采用手动同源搜索工具提高基因家族鉴定有效性的方法。
Genetica. 2023 Dec;151(6):325-338. doi: 10.1007/s10709-023-00196-8. Epub 2023 Oct 10.
7
Bioinformatics and its role in the study of the evolution and probiotic potential of lactic acid bacteria.生物信息学及其在乳酸菌进化与益生菌潜力研究中的作用。
Food Sci Biotechnol. 2022 Aug 10;32(4):389-412. doi: 10.1007/s10068-022-01142-8. eCollection 2023 Mar.
8
Genomic profiling of dioecious Amaranthus species provides novel insights into species relatedness and sex genes.雌雄异株苋菜属物种的基因组分析为物种亲缘关系和性基因提供了新的见解。
BMC Biol. 2023 Feb 20;21(1):37. doi: 10.1186/s12915-023-01539-9.
9
Advances in experimental and computational methodologies for the study of microbial-surface interactions at different omics levels.不同组学水平下微生物-表面相互作用研究的实验和计算方法进展。
Front Microbiol. 2022 Nov 28;13:1006946. doi: 10.3389/fmicb.2022.1006946. eCollection 2022.
10
Characterization, Comparison of Two New Mitogenomes of Crocodile Newts (Caudata: Salamandridae), and Phylogenetic Implications.两型中国小鲵线粒体基因组特征、比较及其系统发育关系分析
Genes (Basel). 2022 Oct 17;13(10):1878. doi: 10.3390/genes13101878.
评估Illumina纠错工具对从头基因组组装的影响。
BMC Bioinformatics. 2017 Aug 18;18(1):374. doi: 10.1186/s12859-017-1784-8.
4
Rapid de novo assembly of the European eel genome from nanopore sequencing reads.欧洲鳗鲡基因组从头快速组装来自纳米孔测序reads。
Sci Rep. 2017 Aug 3;7(1):7213. doi: 10.1038/s41598-017-07650-6.
5
De novo yeast genome assemblies from MinION, PacBio and MiSeq platforms.基于 MinION、PacBio 和 MiSeq 平台的从头酵母基因组组装。
Sci Rep. 2017 Jun 21;7(1):3935. doi: 10.1038/s41598-017-03996-z.
6
High-quality de novo assembly of the apple genome and methylome dynamics of early fruit development.高质量的苹果基因组从头组装和早期果实发育的甲基组动态。
Nat Genet. 2017 Jul;49(7):1099-1106. doi: 10.1038/ng.3886. Epub 2017 Jun 5.
7
The sunflower genome provides insights into oil metabolism, flowering and Asterid evolution.向日葵基因组为油脂代谢、开花和菊类植物进化提供了线索。
Nature. 2017 Jun 1;546(7656):148-152. doi: 10.1038/nature22380. Epub 2017 May 22.
8
Nextflow enables reproducible computational workflows.Nextflow支持可重复的计算工作流程。
Nat Biotechnol. 2017 Apr 11;35(4):316-319. doi: 10.1038/nbt.3820.
9
Double trouble: taxonomy and definitions of polyploidy.双重麻烦:多倍体的分类学与定义
New Phytol. 2017 Jan;213(2):487-493. doi: 10.1111/nph.14276. Epub 2016 Nov 7.
10
Database Resources of the National Center for Biotechnology Information.美国国立医学图书馆国家生物技术信息中心数据库资源
Nucleic Acids Res. 2017 Jan 4;45(D1):D12-D17. doi: 10.1093/nar/gkw1071. Epub 2016 Nov 28.