• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

长读注释:基于长读 cDNA 测序的自动化真核基因组注释。

Long-Read Annotation: Automated Eukaryotic Genome Annotation Based on Long-Read cDNA Sequencing.

机构信息

Laboratory of Phytopathology, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB Wageningen, the Netherlands.

Laboratory of Molecular Biology, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB Wageningen, the Netherlands.

出版信息

Plant Physiol. 2019 Jan;179(1):38-54. doi: 10.1104/pp.18.00848. Epub 2018 Nov 6.

DOI:10.1104/pp.18.00848
PMID:30401722
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6324239/
Abstract

Single-molecule full-length complementary DNA (cDNA) sequencing can aid genome annotation by revealing transcript structure and alternative splice forms, yet current annotation pipelines do not incorporate such information. Here we present long-read annotation (LoReAn) software, an automated annotation pipeline utilizing short- and long-read cDNA sequencing, protein evidence, and ab initio prediction to generate accurate genome annotations. Based on annotations of two fungal genomes ( and ) and two plant genomes (Arabidopsis [] and ), we show that LoReAn outperforms popular annotation pipelines by integrating single-molecule cDNA-sequencing data generated from either the Pacific Biosciences or MinION sequencing platforms, correctly predicting gene structure, and capturing genes missed by other annotation pipelines.

摘要

单分子全长 cDNA 测序可以通过揭示转录结构和选择性剪接形式来辅助基因组注释,然而当前的注释管道并没有整合这些信息。在这里,我们介绍了长读注释(LoReAn)软件,这是一个利用短读和长读 cDNA 测序、蛋白质证据和从头预测来生成准确基因组注释的自动化注释管道。基于两个真菌基因组(和)和两个植物基因组(拟南芥[]和)的注释,我们表明,通过整合来自 Pacific Biosciences 或 MinION 测序平台的单分子 cDNA 测序数据,LoReAn 优于流行的注释管道,正确预测基因结构,并捕获其他注释管道错过的基因。

相似文献

1
Long-Read Annotation: Automated Eukaryotic Genome Annotation Based on Long-Read cDNA Sequencing.长读注释:基于长读 cDNA 测序的自动化真核基因组注释。
Plant Physiol. 2019 Jan;179(1):38-54. doi: 10.1104/pp.18.00848. Epub 2018 Nov 6.
2
Tiling Assembly: a new tool for reference annotation-independent transcript assembly and novel gene identification by RNA-sequencing.平铺组装:一种用于通过RNA测序进行不依赖参考注释的转录本组装和新基因鉴定的新工具。
DNA Res. 2015 Oct;22(5):319-29. doi: 10.1093/dnares/dsv015. Epub 2015 Sep 3.
3
Consideration of non-canonical splice sites improves gene prediction on the Arabidopsis thaliana Niederzenz-1 genome sequence.对非经典剪接位点的考虑改进了对拟南芥 Niederzenz-1 基因组序列的基因预测。
BMC Res Notes. 2017 Dec 4;10(1):667. doi: 10.1186/s13104-017-2985-y.
4
CodingQuarry: highly accurate hidden Markov model gene prediction in fungal genomes using RNA-seq transcripts.CodingQuarry:利用RNA测序转录本对真菌基因组进行高精度隐马尔可夫模型基因预测。
BMC Genomics. 2015 Mar 11;16(1):170. doi: 10.1186/s12864-015-1344-4.
5
cDNA Library Enrichment of Full Length Transcripts for SMRT Long Read Sequencing.用于单分子实时长读长测序的全长转录本的cDNA文库富集
PLoS One. 2016 Jun 21;11(6):e0157779. doi: 10.1371/journal.pone.0157779. eCollection 2016.
6
Long-Read cDNA Sequencing Enables a "Gene-Like" Transcript Annotation of Transposable Elements.长读 cDNA 测序实现转座元件的“基因样”转录本注释。
Plant Cell. 2020 Sep;32(9):2687-2698. doi: 10.1105/tpc.20.00115. Epub 2020 Jul 9.
7
Illuminating the dark side of the human transcriptome with long read transcript sequencing.利用长读转录组测序揭示人类转录组的暗面。
BMC Genomics. 2020 Oct 30;21(1):751. doi: 10.1186/s12864-020-07123-7.
8
Rascaf: Improving Genome Assembly with RNA Sequencing Data.Rascaf:利用 RNA 测序数据提高基因组组装质量。
Plant Genome. 2016 Nov;9(3). doi: 10.3835/plantgenome2016.03.0027.
9
PacBio Long-Read Sequencing, Assembly, and Funannotate Reannotation of the Complete Genome of Trichoderma reesei QM6a.PacBio 长读测序、组装和 Funannotate 对里氏木霉 QM6a 全基因组的重新注释。
Methods Mol Biol. 2021;2234:311-329. doi: 10.1007/978-1-0716-1048-0_21.
10
BRAKER3: Fully automated genome annotation using RNA-seq and protein evidence with GeneMark-ETP, AUGUSTUS, and TSEBRA.BRAKER3:利用 RNA-seq 和蛋白质证据,通过 GeneMark-ETP、AUGUSTUS 和 TSEBRA 进行全自动基因组注释。
Genome Res. 2024 Jun 25;34(5):769-777. doi: 10.1101/gr.278090.123.

引用本文的文献

1
Evaluation of strategies for evidence-driven genome annotation using long-read RNA-seq.使用长读长RNA测序对证据驱动的基因组注释策略进行评估。
Genome Res. 2025 Apr 14;35(4):1053-1064. doi: 10.1101/gr.279864.124.
2
A nearly telomere-to-telomere diploid genome assembly of Firmiana kwangsiensis, a threatened species in China.中国濒危物种广西梭罗的一个近乎端粒到端粒的二倍体基因组组装。
Sci Data. 2024 Dec 18;11(1):1394. doi: 10.1038/s41597-024-04250-8.
3
Comprehensive genome annotation of the model ciliate Tetrahymena thermophila by in-depth epigenetic and transcriptomic profiling.通过深入的表观遗传学和转录组分析对模式纤毛虫嗜热四膜虫进行全面的基因组注释。
Nucleic Acids Res. 2025 Jan 11;53(2). doi: 10.1093/nar/gkae1177.
4
Long-read sequencing transcriptome quantification with lr-kallisto.使用lr-kallisto进行长读长测序转录组定量分析。
bioRxiv. 2025 Jan 29:2024.07.19.604364. doi: 10.1101/2024.07.19.604364.
5
GeneMark-ETP significantly improves the accuracy of automatic annotation of large eukaryotic genomes.GeneMark-ETP 显著提高了大型真核基因组自动注释的准确性。
Genome Res. 2024 Jun 25;34(5):757-768. doi: 10.1101/gr.278373.123.
6
Annotation and visualization of parasite, fungi and arthropod genomes with Companion.使用 Companion 对寄生虫、真菌和节肢动物基因组进行注释和可视化。
Nucleic Acids Res. 2024 Jul 5;52(W1):W39-W44. doi: 10.1093/nar/gkae378.
7
Dynamic DNA -adenine methylation (6mA) governs the encystment process, showcased in the unicellular eukaryote .动态 DNA-腺嘌呤甲基化 (6mA) 控制着单细胞真核生物的囊胞形成过程。
Genome Res. 2024 Mar 20;34(2):256-271. doi: 10.1101/gr.278796.123.
8
A chromosome-level genome assembly for Onobrychis viciifolia reveals gene copy number gain underlying enhanced proanthocyanidin biosynthesis.黄花木染色体水平基因组组装揭示了增强原花色素生物合成的基因拷贝数增加。
Commun Biol. 2024 Jan 5;7(1):19. doi: 10.1038/s42003-023-05754-6.
9
Chromosome-level Genome Assembly and Sex-specific Differential Transcriptome of the White-backed Planthopper, .白背飞虱的染色体水平基因组组装及性别特异性差异转录组
Curr Genomics. 2023 Feb 14;23(6):400-411. doi: 10.2174/1389202924666230102092822.
10
Novel and improved Caenorhabditis briggsae gene models generated by community curation.通过社区管理生成的新型且改进的秀丽隐杆线虫基因模型。
BMC Genomics. 2023 Aug 25;24(1):486. doi: 10.1186/s12864-023-09582-0.

本文引用的文献

1
Evaluation of tools for long read RNA-seq splice-aware alignment.长读 RNA-seq 剪接感知比对工具评估。
Bioinformatics. 2018 Mar 1;34(5):748-754. doi: 10.1093/bioinformatics/btx668.
2
Seqping: gene prediction pipeline for plant genomes using self-training gene models and transcriptomic data.Seqping:使用自训练基因模型和转录组数据的植物基因组基因预测流程
BMC Bioinformatics. 2017 Jan 27;18(Suppl 1):1426. doi: 10.1186/s12859-016-1426-6.
3
New advances in sequence assembly.序列组装的新进展。
Genome Res. 2017 May;27(5):xi-xiii. doi: 10.1101/gr.223057.117.
4
The impact of third generation genomic technologies on plant genome assembly.第三代基因组技术对植物基因组组装的影响。
Curr Opin Plant Biol. 2017 Apr;36:64-70. doi: 10.1016/j.pbi.2017.02.002. Epub 2017 Feb 21.
5
Phased diploid genome assembly with single-molecule real-time sequencing.基于单分子实时测序的阶段性二倍体基因组组装
Nat Methods. 2016 Dec;13(12):1050-1054. doi: 10.1038/nmeth.4035. Epub 2016 Oct 17.
6
Unveiling the complexity of the maize transcriptome by single-molecule long-read sequencing.通过单分子长读测序揭示玉米转录组的复杂性。
Nat Commun. 2016 Jun 24;7:11708. doi: 10.1038/ncomms11708.
7
A survey of the sorghum transcriptome using single-molecule long reads.利用单分子长读测序技术对高粱转录组进行调查。
Nat Commun. 2016 Jun 24;7:11706. doi: 10.1038/ncomms11706.
8
Transposons passively and actively contribute to evolution of the two-speed genome of a fungal pathogen.转座子以被动和主动的方式推动一种真菌病原体的双速基因组进化。
Genome Res. 2016 Aug;26(8):1091-100. doi: 10.1101/gr.204974.116. Epub 2016 Jun 20.
9
Genome analysis of three Pneumocystis species reveals adaptation mechanisms to life exclusively in mammalian hosts.对三种肺孢子菌的基因组分析揭示了其仅在哺乳动物宿主中生存的适应机制。
Nat Commun. 2016 Feb 22;7:10740. doi: 10.1038/ncomms10740.
10
Major Improvements to the Heliconius melpomene Genome Assembly Used to Confirm 10 Chromosome Fusion Events in 6 Million Years of Butterfly Evolution.对用于确认蝴蝶六百万年进化过程中10次染色体融合事件的红带袖蝶基因组组装的重大改进。
G3 (Bethesda). 2016 Jan 15;6(3):695-708. doi: 10.1534/g3.115.023655.