• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TranD 中实现的用于量化可变剪接的核苷酸水平距离度量。

Nucleotide-level distance metrics to quantify alternative splicing implemented in TranD.

机构信息

Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL 32611, USA.

University of Florida Genetics Institute, University of Florida, Gainesville, FL 32611, USA.

出版信息

Nucleic Acids Res. 2024 Mar 21;52(5):e28. doi: 10.1093/nar/gkae056.

DOI:10.1093/nar/gkae056
PMID:38340337
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10954468/
Abstract

Advances in affordable transcriptome sequencing combined with better exon and gene prediction has motivated many to compare transcription across the tree of life. We develop a mathematical framework to calculate complexity and compare transcript models. Structural features, i.e. intron retention (IR), donor/acceptor site variation, alternative exon cassettes, alternative 5'/3' UTRs, are compared and the distance between transcript models is calculated with nucleotide level precision. All metrics are implemented in a PyPi package, TranD and output can be used to summarize splicing patterns for a transcriptome (1GTF) and between transcriptomes (2GTF). TranD output enables quantitative comparisons between: annotations augmented by empirical RNA-seq data and the original transcript models; transcript model prediction tools for longread RNA-seq (e.g. FLAIR versus Isoseq3); alternate annotations for a species (e.g. RefSeq vs Ensembl); and between closely related species. In C. elegans, Z. mays, D. melanogaster, D. simulans and H. sapiens, alternative exons were observed more frequently in combination with an alternative donor/acceptor than alone. Transcript models in RefSeq and Ensembl are linked and both have unique transcript models with empirical support. D. melanogaster and D. simulans, share many transcript models and long-read RNAseq data suggests that both species are under-annotated. We recommend combined references.

摘要

经济实惠的转录组测序技术的进步,加上更好的外显子和基因预测技术,促使许多人开始比较整个生命之树的转录组。我们开发了一种数学框架来计算复杂性并比较转录本模型。结构特征,如内含子保留(IR)、供体/受体位点变异、可变外显子盒、可变 5'/3'UTR,都进行了比较,并以核苷酸精度计算了转录本模型之间的距离。所有指标都在 PyPi 包 TranD 中实现,输出可用于总结转录组(1GTF)和转录组之间(2GTF)的剪接模式。TranD 输出可实现以下方面的定量比较:用经验 RNA-seq 数据增强注释与原始转录本模型之间的比较;长读 RNA-seq 的转录本模型预测工具(例如 FLAIR 与 Isoseq3)之间的比较;物种的替代注释(例如 RefSeq 与 Ensembl)之间的比较;以及密切相关的物种之间的比较。在秀丽隐杆线虫、玉米、果蝇、拟南芥和人类中,与替代供体/受体结合的替代外显子比单独出现的频率更高。RefSeq 和 Ensembl 中的转录本模型是相关联的,并且都有具有经验支持的独特转录本模型。果蝇和拟南芥有许多共同的转录本模型,长读 RNA-seq 数据表明这两个物种的注释都不足。我们建议联合参考。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba1c/10954468/e29f80f651ae/gkae056fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba1c/10954468/1bb93cb78b64/gkae056figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba1c/10954468/f87b7efd77b7/gkae056fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba1c/10954468/cda5d59dab1c/gkae056fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba1c/10954468/f6b4ccbaa13d/gkae056fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba1c/10954468/47c9aa1d45b4/gkae056fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba1c/10954468/7951278f8f75/gkae056fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba1c/10954468/16916856254b/gkae056fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba1c/10954468/e29f80f651ae/gkae056fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba1c/10954468/1bb93cb78b64/gkae056figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba1c/10954468/f87b7efd77b7/gkae056fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba1c/10954468/cda5d59dab1c/gkae056fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba1c/10954468/f6b4ccbaa13d/gkae056fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba1c/10954468/47c9aa1d45b4/gkae056fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba1c/10954468/7951278f8f75/gkae056fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba1c/10954468/16916856254b/gkae056fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ba1c/10954468/e29f80f651ae/gkae056fig7.jpg

相似文献

1
Nucleotide-level distance metrics to quantify alternative splicing implemented in TranD.TranD 中实现的用于量化可变剪接的核苷酸水平距离度量。
Nucleic Acids Res. 2024 Mar 21;52(5):e28. doi: 10.1093/nar/gkae056.
2
Correspondence of D. melanogaster and C. elegans developmental stages revealed by alternative splicing characteristics of conserved exons.通过保守外显子的可变剪接特征揭示黑腹果蝇和秀丽隐杆线虫发育阶段的对应关系。
BMC Genomics. 2017 Mar 16;18(1):234. doi: 10.1186/s12864-017-3600-2.
3
Estimating transcriptome complexities across eukaryotes.估算真核生物转录组的复杂性。
BMC Genomics. 2023 May 11;24(1):254. doi: 10.1186/s12864-023-09326-0.
4
Design of RNA splicing analysis null models for post hoc filtering of Drosophila head RNA-Seq data with the splicing analysis kit (Spanki).利用剪接分析试剂盒(Spanki)对果蝇头部 RNA-Seq 数据进行事后过滤的 RNA 剪接分析零模型设计。
BMC Bioinformatics. 2013 Nov 9;14:320. doi: 10.1186/1471-2105-14-320.
5
Global regulatory features of alternative splicing across tissues and within the nervous system of .不同组织和神经系统中剪接的全球调控特征。
Genome Res. 2020 Dec;30(12):1766-1780. doi: 10.1101/gr.267328.120. Epub 2020 Oct 30.
6
Long-read RNA sequencing reveals widespread sex-specific alternative splicing in threespine stickleback fish.长读 RNA 测序揭示了三刺鱼中广泛存在的性别特异性可变剪接。
Genome Res. 2021 Aug;31(8):1486-1497. doi: 10.1101/gr.274282.120. Epub 2021 Jun 15.
7
spliceR: an R package for classification of alternative splicing and prediction of coding potential from RNA-seq data.spliceR:一个用于 RNA-seq 数据中可变剪接分类和编码潜力预测的 R 包。
BMC Bioinformatics. 2014 Mar 23;15:81. doi: 10.1186/1471-2105-15-81.
8
Multiple insert size paired-end sequencing for deconvolution of complex transcriptomes.多重插入片段大小的双端测序技术可用于复杂转录组的去卷积分析。
RNA Biol. 2012 May;9(5):596-609. doi: 10.4161/rna.19683. Epub 2012 May 1.
9
Alternative Splicing Signatures in RNA-seq Data: Percent Spliced in (PSI).RNA测序数据中的可变剪接特征:剪接百分率(PSI)
Curr Protoc Hum Genet. 2015 Oct 6;87:11.16.1-11.16.14. doi: 10.1002/0471142905.hg1116s87.
10
A general definition and nomenclature for alternative splicing events.可变剪接事件的一般定义和命名法。
PLoS Comput Biol. 2008 Aug 8;4(8):e1000147. doi: 10.1371/journal.pcbi.1000147.

引用本文的文献

1
Quality assessment of long read data in multisample lrRNA-seq experiments using SQANTI-reads.使用SQANTI-reads对多样本长读长RNA测序实验中的长读长数据进行质量评估。
Genome Res. 2025 Apr 14;35(4):987-998. doi: 10.1101/gr.280021.124.
2
Notable challenges posed by long-read sequencing for the study of transcriptional diversity and genome annotation.长读长测序在转录多样性研究和基因组注释方面带来的显著挑战。
Genome Res. 2025 Apr 14;35(4):583-592. doi: 10.1101/gr.279865.124.
3
Phylogenetic Analysis of 590 Species Reveals Distinct Evolutionary Patterns of Intron-Exon Gene Structures Across Eukaryotic Lineages.

本文引用的文献

1
Systematic assessment of long-read RNA-seq methods for transcript identification and quantification.系统评估长读 RNA-seq 方法在转录本鉴定和定量中的应用。
Nat Methods. 2024 Jul;21(7):1349-1363. doi: 10.1038/s41592-024-02298-3. Epub 2024 Jun 7.
2
Sex-specific splicing occurs genome-wide during early embryogenesis.性别的特异性剪接在胚胎早期发生在全基因组范围内。
Elife. 2023 Jul 19;12:e87865. doi: 10.7554/eLife.87865.
3
IsoTools: a flexible workflow for long-read transcriptome sequencing analysis.IsoTools:一种用于长读转录组测序分析的灵活工作流程。
对590个物种的系统发育分析揭示了真核生物谱系中内含子-外显子基因结构的独特进化模式。
Mol Biol Evol. 2024 Dec 6;41(12). doi: 10.1093/molbev/msae248.
4
SQANTI-reads: a tool for the quality assessment of long read data in multi-sample lrRNA-seq experiments.SQANTI-reads:一种用于多样本长读长核糖体RNA测序实验中长读长数据质量评估的工具。
bioRxiv. 2024 Sep 17:2024.08.23.609463. doi: 10.1101/2024.08.23.609463.
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad364.
4
Estimating transcriptome complexities across eukaryotes.估算真核生物转录组的复杂性。
BMC Genomics. 2023 May 11;24(1):254. doi: 10.1186/s12864-023-09326-0.
5
Sex-Biased Expression Is Associated With Chromatin State in Drosophila melanogaster and Drosophila simulans.性染色体偏性表达与黑腹果蝇和拟暗果蝇的染色质状态相关。
Mol Biol Evol. 2023 May 2;40(5). doi: 10.1093/molbev/msad078.
6
Two Forms of Sexual Dimorphism in Gene Expression in Drosophila melanogaster: Their Coincidence and Evolutionary Genetics.果蝇中两种形式的性二型基因表达及其偶合和进化遗传学。
Mol Biol Evol. 2023 May 2;40(5). doi: 10.1093/molbev/msad091.
7
The Gene Ontology knowledgebase in 2023.2023 版基因本体论知识库。
Genetics. 2023 May 4;224(1). doi: 10.1093/genetics/iyad031.
8
Approaching complete genomes, transcriptomes and epi-omes with accurate long-read sequencing.采用准确的长读测序技术获取完整的基因组、转录组和表观基因组。
Nat Methods. 2023 Jan;20(1):12-16. doi: 10.1038/s41592-022-01716-8.
9
OrthoDB v11: annotation of orthologs in the widest sampling of organismal diversity.OrthoDB v11:在最广泛的生物多样性样本中注释直系同源物。
Nucleic Acids Res. 2023 Jan 6;51(D1):D445-D451. doi: 10.1093/nar/gkac998.
10
Improved transcriptome assembly using a hybrid of long and short reads with StringTie.使用长读长和短读长混合的方法进行转录组组装,可提高组装质量。
PLoS Comput Biol. 2022 Jun 1;18(6):e1009730. doi: 10.1371/journal.pcbi.1009730. eCollection 2022 Jun.