• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 Iso-seq 分析的新方法进行高分辨率的单个分子测序的拟南芥转录组。

A high-resolution single-molecule sequencing-based Arabidopsis transcriptome using novel methods of Iso-seq analysis.

机构信息

Information and Computational Sciences, James Hutton Institute, Dundee, DD2 5DA, Scotland, UK.

The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Midlothian, EH25 9RG, UK.

出版信息

Genome Biol. 2022 Jul 7;23(1):149. doi: 10.1186/s13059-022-02711-0.

DOI:10.1186/s13059-022-02711-0
PMID:35799267
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9264592/
Abstract

BACKGROUND

Accurate and comprehensive annotation of transcript sequences is essential for transcript quantification and differential gene and transcript expression analysis. Single-molecule long-read sequencing technologies provide improved integrity of transcript structures including alternative splicing, and transcription start and polyadenylation sites. However, accuracy is significantly affected by sequencing errors, mRNA degradation, or incomplete cDNA synthesis.

RESULTS

We present a new and comprehensive Arabidopsis thaliana Reference Transcript Dataset 3 (AtRTD3). AtRTD3 contains over 169,000 transcripts-twice that of the best current Arabidopsis transcriptome and including over 1500 novel genes. Seventy-eight percent of transcripts are from Iso-seq with accurately defined splice junctions and transcription start and end sites. We develop novel methods to determine splice junctions and transcription start and end sites accurately. Mismatch profiles around splice junctions provide a powerful feature to distinguish correct splice junctions and remove false splice junctions. Stratified approaches identify high-confidence transcription start and end sites and remove fragmentary transcripts due to degradation. AtRTD3 is a major improvement over existing transcriptomes as demonstrated by analysis of an Arabidopsis cold response RNA-seq time-series. AtRTD3 provides higher resolution of transcript expression profiling and identifies cold-induced differential transcription start and polyadenylation site usage.

CONCLUSIONS

AtRTD3 is the most comprehensive Arabidopsis transcriptome currently. It improves the precision of differential gene and transcript expression, differential alternative splicing, and transcription start/end site usage analysis from RNA-seq data. The novel methods for identifying accurate splice junctions and transcription start/end sites are widely applicable and will improve single-molecule sequencing analysis from any species.

摘要

背景

准确而全面的转录本序列注释对于转录本定量和差异基因及转录本表达分析至关重要。单分子长读测序技术提供了改进的转录本结构完整性,包括可变剪接以及转录起始和聚腺苷酸化位点。然而,准确性受到测序错误、mRNA 降解或不完全 cDNA 合成的显著影响。

结果

我们提出了一个新的、全面的拟南芥参考转录数据集 3(AtRTD3)。AtRTD3 包含超过 169000 个转录本,是目前最好的拟南芥转录组的两倍,其中包括 1500 多个新基因。78%的转录本来自 Iso-seq,具有准确定义的剪接接头以及转录起始和结束位点。我们开发了新的方法来准确确定剪接接头以及转录起始和结束位点。剪接接头周围的错配图谱提供了一种强大的特征,可以区分正确的剪接接头并去除错误的剪接接头。分层方法可以识别高可信度的转录起始和结束位点,并去除由于降解而产生的片段化转录本。AtRTD3 比现有的转录组有了很大的改进,这在对拟南芥冷响应 RNA-seq 时间序列的分析中得到了证明。AtRTD3 提供了更高分辨率的转录本表达谱,并确定了冷诱导的差异转录起始和多聚腺苷酸化位点使用。

结论

AtRTD3 是目前最全面的拟南芥转录组。它提高了从 RNA-seq 数据中进行差异基因和转录本表达、差异可变剪接以及转录起始/结束位点使用分析的精度。用于识别准确剪接接头和转录起始/结束位点的新方法具有广泛的适用性,将提高来自任何物种的单分子测序分析的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c9/9264592/82a1f5121bef/13059_2022_2711_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c9/9264592/6a9808623678/13059_2022_2711_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c9/9264592/428d14f2d631/13059_2022_2711_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c9/9264592/16b2f6f68cea/13059_2022_2711_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c9/9264592/a79094d873e7/13059_2022_2711_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c9/9264592/c95ee627eacf/13059_2022_2711_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c9/9264592/82a1f5121bef/13059_2022_2711_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c9/9264592/6a9808623678/13059_2022_2711_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c9/9264592/428d14f2d631/13059_2022_2711_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c9/9264592/16b2f6f68cea/13059_2022_2711_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c9/9264592/a79094d873e7/13059_2022_2711_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c9/9264592/c95ee627eacf/13059_2022_2711_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/52c9/9264592/82a1f5121bef/13059_2022_2711_Fig6_HTML.jpg

相似文献

1
A high-resolution single-molecule sequencing-based Arabidopsis transcriptome using novel methods of Iso-seq analysis.利用 Iso-seq 分析的新方法进行高分辨率的单个分子测序的拟南芥转录组。
Genome Biol. 2022 Jul 7;23(1):149. doi: 10.1186/s13059-022-02711-0.
2
AtRTD - a comprehensive reference transcript dataset resource for accurate quantification of transcript-specific expression in Arabidopsis thaliana.AtRTD——一个用于准确量化拟南芥转录本特异性表达的全面参考转录本数据集资源。
New Phytol. 2015 Oct;208(1):96-101. doi: 10.1111/nph.13545. Epub 2015 Jun 25.
3
BaRTv2: a highly resolved barley reference transcriptome for accurate transcript-specific RNA-seq quantification.BaRTv2:一个高度解析的大麦参考转录组,用于准确的转录特异性 RNA-seq 定量。
Plant J. 2022 Aug;111(4):1183-1202. doi: 10.1111/tpj.15871. Epub 2022 Jul 19.
4
LsRTDv1, a reference transcript dataset for accurate transcript-specific expression analysis in lettuce.生菜转录特异性表达分析的参考转录数据集 LsRTDv1。
Plant J. 2024 Oct;120(1):370-386. doi: 10.1111/tpj.16978. Epub 2024 Aug 15.
5
Bookend: precise transcript reconstruction with end-guided assembly.Bookend:端引导组装的精确转录本重构。
Genome Biol. 2022 Jun 29;23(1):143. doi: 10.1186/s13059-022-02700-3.
6
Discover hidden splicing variations by mapping personal transcriptomes to personal genomes.通过将个人转录组与个人基因组进行比对,发现隐藏的剪接变异。
Nucleic Acids Res. 2015 Dec 15;43(22):10612-22. doi: 10.1093/nar/gkv1099. Epub 2015 Nov 17.
7
A high quality Arabidopsis transcriptome for accurate transcript-level analysis of alternative splicing.一个用于准确进行可变剪接转录本水平分析的高质量拟南芥转录组。
Nucleic Acids Res. 2017 May 19;45(9):5061-5073. doi: 10.1093/nar/gkx267.
8
Discerning novel splice junctions derived from RNA-seq alignment: a deep learning approach.从 RNA-seq 比对中识别新的剪接接头:一种深度学习方法。
BMC Genomics. 2018 Dec 27;19(1):971. doi: 10.1186/s12864-018-5350-1.
9
PacBio full-length cDNA sequencing integrated with RNA-seq reads drastically improves the discovery of splicing transcripts in rice.PacBio 全长 cDNA 测序与 RNA-seq reads 相结合极大地提高了水稻剪接转录本的发现。
Plant J. 2019 Jan;97(2):296-305. doi: 10.1111/tpj.14120. Epub 2018 Dec 3.
10
BaRTv1.0: an improved barley reference transcript dataset to determine accurate changes in the barley transcriptome using RNA-seq.BaRTv1.0:一个改良的大麦参考转录组数据集,用于使用 RNA-seq 确定大麦转录组中的精确变化。
BMC Genomics. 2019 Dec 11;20(1):968. doi: 10.1186/s12864-019-6243-7.

引用本文的文献

1
Long Noncoding RNAs as Emerging Regulators of Seed Development, Germination, and Senescence.长链非编码RNA作为种子发育、萌发和衰老的新兴调控因子
Int J Mol Sci. 2025 Sep 6;26(17):8702. doi: 10.3390/ijms26178702.
2
Genome analyses and breeding of polyploid crops.多倍体作物的基因组分析与育种
Nat Plants. 2025 Aug 28. doi: 10.1038/s41477-025-02088-5.
3
Navigating Eukaryotic Genome Annotation Pipelines: A Route Map to Using BRAKER, Galba, and TSEBRA.探索真核生物基因组注释流程:使用BRAKER、Galba和TSEBRA的路线图

本文引用的文献

1
TRAPID 2.0: a web application for taxonomic and functional analysis of de novo transcriptomes.TRAPID 2.0:一个用于从头转录组的分类学和功能分析的网络应用程序。
Nucleic Acids Res. 2021 Sep 27;49(17):e101. doi: 10.1093/nar/gkab565.
2
A chromosome-level Amaranthus cruentus genome assembly highlights gene family evolution and biosynthetic gene clusters that may underpin the nutritional value of this traditional crop.一份藜染色体水平基因组组装图谱揭示了基因家族进化和生物合成基因簇,这些可能是藜这种传统作物具有营养价值的基础。
Plant J. 2021 Jul;107(2):613-628. doi: 10.1111/tpj.15298. Epub 2021 Jun 24.
3
Widespread premature transcription termination of NLR genes by the spen protein FPA.
Methods Mol Biol. 2025;2935:67-107. doi: 10.1007/978-1-0716-4583-3_4.
4
The role of Arabidopsis Splicing Factor 30 in floral transition and the implications of its alternative splicing.拟南芥剪接因子30在花期转换中的作用及其可变剪接的影响
Plant Physiol. 2025 Aug 4;198(4). doi: 10.1093/plphys/kiaf335.
5
A million shades of green: understanding and harnessing plant metabolic diversity.绿色的百万种色调:理解与利用植物代谢多样性
EMBO J. 2025 Jul 3. doi: 10.1038/s44318-025-00496-z.
6
At-RS31 orchestrates hierarchical cross-regulation of splicing factors and integrates alternative splicing with TOR-ABA pathways.At-RS31协调剪接因子的分级交叉调控,并将可变剪接与TOR-ABA途径整合在一起。
New Phytol. 2025 Jul;247(2):738-759. doi: 10.1111/nph.70221. Epub 2025 May 26.
7
Comprehensive mapping of Arabidopsis alternative splicing landscape reveals key insights into plant development and immunity.拟南芥可变剪接图谱的全面绘制揭示了对植物发育和免疫的关键见解。
Plant Genome. 2025 Jun;18(2):e70022. doi: 10.1002/tpg2.70022.
8
Transcriptomics in the era of long-read sequencing.长读长测序时代的转录组学
Nat Rev Genet. 2025 Mar 28. doi: 10.1038/s41576-025-00828-z.
9
The Arabidopsis thaliana core splicing factor PORCUPINE/SmE1 requires intron-mediated expression.拟南芥核心剪接因子豪猪/SmE1需要内含子介导的表达。
PLoS One. 2025 Mar 26;20(3):e0318163. doi: 10.1371/journal.pone.0318163. eCollection 2025.
10
Global profiling of CPL3-mediated alternative splicing reveals regulatory mechanisms of DGK5 in plant immunity and phosphatidic acid homeostasis.CPL3介导的可变剪接的全局分析揭示了DGK5在植物免疫和磷脂酸稳态中的调控机制。
Genome Biol. 2025 Mar 21;26(1):65. doi: 10.1186/s13059-025-03529-2.
Spen 蛋白 FPA 广泛导致 NLR 基因过早转录终止。
Elife. 2021 Apr 27;10:e65537. doi: 10.7554/eLife.65537.
4
Alternative splicing landscapes in Arabidopsis thaliana across tissues and stress conditions highlight major functional differences with animals.拟南芥不同组织和胁迫条件下的选择性剪接图谱突出了与动物的主要功能差异。
Genome Biol. 2021 Jan 14;22(1):35. doi: 10.1186/s13059-020-02258-y.
5
Utilizing PacBio Iso-Seq for Novel Transcript and Gene Discovery of Abiotic Stress Responses in L.利用 PacBio Iso-Seq 进行非生物胁迫响应的 L. 新转录本和基因发现
Int J Mol Sci. 2020 Oct 31;21(21):8148. doi: 10.3390/ijms21218148.
6
Illuminating the dark side of the human transcriptome with long read transcript sequencing.利用长读转录组测序揭示人类转录组的暗面。
BMC Genomics. 2020 Oct 30;21(1):751. doi: 10.1186/s12864-020-07123-7.
7
High-resolution profile of transcriptomes reveals a role of alternative splicing for modulating response to nitrogen in maize.高分辨率转录组图谱揭示了可变剪接在调节玉米氮响应中的作用。
BMC Genomics. 2020 May 11;21(1):353. doi: 10.1186/s12864-020-6769-8.
8
Nanopore direct RNA sequencing maps the complexity of Arabidopsis mRNA processing and mA modification.纳米孔直接 RNA 测序绘制拟南芥 mRNA 加工和 mA 修饰的复杂性图谱。
Elife. 2020 Jan 14;9:e49658. doi: 10.7554/eLife.49658.
9
PacBio single-molecule long-read sequencing shed new light on the transcripts and splice isoforms of the perennial ryegrass.PacBio 单分子长读测序技术为多年生黑麦草的转录本和剪接异构体提供了新的见解。
Mol Genet Genomics. 2020 Mar;295(2):475-489. doi: 10.1007/s00438-019-01635-y. Epub 2020 Jan 1.
10
Transcriptome assembly from long-read RNA-seq alignments with StringTie2.基于长读 RNA-seq 比对的转录组组装与 StringTie2。
Genome Biol. 2019 Dec 16;20(1):278. doi: 10.1186/s13059-019-1910-1.