• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SQANTI-reads:一种用于多样本长读长核糖体RNA测序实验中长读长数据质量评估的工具。

SQANTI-reads: a tool for the quality assessment of long read data in multi-sample lrRNA-seq experiments.

作者信息

Keil Netanya, Monzó Carolina, McIntyre Lauren, Conesa Ana

机构信息

Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, USA, 32610.

University of Florida Genetics Institute, University of Florida, Gainesville, FL, USA, 32610.

出版信息

bioRxiv. 2024 Sep 17:2024.08.23.609463. doi: 10.1101/2024.08.23.609463.

DOI:10.1101/2024.08.23.609463
PMID:39229095
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11370609/
Abstract

SQANTI-reads leverages SQANTI3, a tool for the analysis of the quality of transcript models, to develop a read-level quality control framework for replicated long-read RNA-seq experiments. The number and distribution of reads, as well as the number and distribution of unique junction chains (transcript splicing patterns), in SQANTI3 structural categories are informative of raw data quality. Multi-sample visualizations of QC metrics are presented by experimental design factors to identify outliers. We introduce new metrics for 1) the identification of potentially under-annotated genes and putative novel transcripts and for 2) quantifying variation in junction donors and acceptors. We applied SQANTI-reads to two different datasets, a developmental experiment and a multi-platform dataset from the LRGASP project and demonstrate that the tool effectively reveals the impact of read coverage on data quality, and readily identifies strong and weak splicing sites. SQANTI-reads is open source and available for download at GitHub.

摘要

SQANTI-reads利用用于分析转录本模型质量的工具SQANTI3,为重复的长读长RNA测序实验开发了一个读段水平的质量控制框架。SQANTI3结构类别中的读段数量和分布,以及独特连接链(转录本剪接模式)的数量和分布,能够反映原始数据的质量。通过实验设计因素展示质量控制指标的多样本可视化结果,以识别异常值。我们引入了新的指标,用于1)识别潜在注释不足的基因和假定的新转录本,以及2)量化连接供体和受体的变异。我们将SQANTI-reads应用于两个不同的数据集,一个发育实验数据集和来自LRGASP项目的多平台数据集,并证明该工具能够有效揭示读段覆盖度对数据质量的影响,并能轻松识别强剪接位点和弱剪接位点。SQANTI-reads是开源的,可在GitHub上下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c26/11412362/5c4f3ecf833d/nihpp-2024.08.23.609463v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c26/11412362/2db8d2c09b95/nihpp-2024.08.23.609463v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c26/11412362/95c2ad0366c5/nihpp-2024.08.23.609463v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c26/11412362/a02001a1c978/nihpp-2024.08.23.609463v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c26/11412362/5c4f3ecf833d/nihpp-2024.08.23.609463v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c26/11412362/2db8d2c09b95/nihpp-2024.08.23.609463v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c26/11412362/95c2ad0366c5/nihpp-2024.08.23.609463v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c26/11412362/a02001a1c978/nihpp-2024.08.23.609463v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c26/11412362/5c4f3ecf833d/nihpp-2024.08.23.609463v2-f0004.jpg

相似文献

1
SQANTI-reads: a tool for the quality assessment of long read data in multi-sample lrRNA-seq experiments.SQANTI-reads:一种用于多样本长读长核糖体RNA测序实验中长读长数据质量评估的工具。
bioRxiv. 2024 Sep 17:2024.08.23.609463. doi: 10.1101/2024.08.23.609463.
2
Quality assessment of long read data in multisample lrRNA-seq experiments using SQANTI-reads.使用SQANTI-reads对多样本长读长RNA测序实验中的长读长数据进行质量评估。
Genome Res. 2025 Apr 14;35(4):987-998. doi: 10.1101/gr.280021.124.
3
SQANTI-SIM: a simulator of controlled transcript novelty for lrRNA-seq benchmark.SQANTI-SIM:用于长读长RNA测序基准测试的可控转录本新颖性模拟器
bioRxiv. 2023 Aug 24:2023.08.23.554392. doi: 10.1101/2023.08.23.554392.
4
SQANTI-SIM: a simulator of controlled transcript novelty for lrRNA-seq benchmark.SQANTI-SIM:用于 lrRNA-seq 基准测试的受控转录物新颖性的模拟器。
Genome Biol. 2023 Dec 11;24(1):286. doi: 10.1186/s13059-023-03127-0.
5
SQANTI3: curation of long-read transcriptomes for accurate identification of known and novel isoforms.SQANTI3:用于准确识别已知和新型异构体的长读长转录组注释
bioRxiv. 2023 Jun 3:2023.05.17.541248. doi: 10.1101/2023.05.17.541248.
6
SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification.SQANTI:用于全长转录组鉴定和定量的长读转录序列的广泛特征化,以进行质量控制。
Genome Res. 2018 Mar 1;28(3):396-411. doi: 10.1101/gr.222976.117.
7
The ENCODE4 long-read RNA-seq collection reveals distinct classes of transcript structure diversity.ENCODE4长读长RNA测序数据集揭示了不同类别的转录本结构多样性。
bioRxiv. 2023 May 16:2023.05.15.540865. doi: 10.1101/2023.05.15.540865.
8
Forseti: a mechanistic and predictive model of the splicing status of scRNA-seq reads.Forseti:一种用于预测和解释单细胞 RNA-seq 数据剪接状态的机制模型。
Bioinformatics. 2024 Jun 28;40(Suppl 1):i297-i306. doi: 10.1093/bioinformatics/btae207.
9
Clair3-RNA: A deep learning-based small variant caller for long-read RNA sequencing data.Clair3-RNA:一种基于深度学习的长读长RNA测序数据小变异体检测工具。
bioRxiv. 2025 Jan 3:2024.11.17.624050. doi: 10.1101/2024.11.17.624050.
10
Forseti: A mechanistic and predictive model of the splicing status of scRNA-seq reads.福赛蒂:单细胞RNA测序读数剪接状态的机制与预测模型
bioRxiv. 2024 Feb 5:2024.02.01.577813. doi: 10.1101/2024.02.01.577813.

引用本文的文献

1
How to use learning curves to evaluate the sample size for malaria prediction models developed using machine learning algorithms.如何使用学习曲线评估利用机器学习算法开发的疟疾预测模型的样本量。
Malar J. 2025 Jul 24;24(1):242. doi: 10.1186/s12936-025-05479-3.

本文引用的文献

1
CapTrap-seq: a platform-agnostic and quantitative approach for high-fidelity full-length RNA sequencing.CapTrap-seq:一种平台无关且定量的全长 RNA 测序方法。
Nat Commun. 2024 Jun 27;15(1):5278. doi: 10.1038/s41467-024-49523-3.
2
Systematic assessment of long-read RNA-seq methods for transcript identification and quantification.系统评估长读 RNA-seq 方法在转录本鉴定和定量中的应用。
Nat Methods. 2024 Jul;21(7):1349-1363. doi: 10.1038/s41592-024-02298-3. Epub 2024 Jun 7.
3
Developmental isoform diversity in the human neocortex informs neuropsychiatric risk mechanisms.
人类新皮层发育亚型的多样性为神经精神疾病风险机制提供了线索。
Science. 2024 May 24;384(6698):eadh7688. doi: 10.1126/science.adh7688.
4
Single-cell long-read sequencing-based mapping reveals specialized splicing patterns in developing and adult mouse and human brain.基于单细胞长读测序的映射揭示了发育中和成年鼠和人脑的特异性剪接模式。
Nat Neurosci. 2024 Jun;27(6):1051-1063. doi: 10.1038/s41593-024-01616-4. Epub 2024 Apr 9.
5
SQANTI3: curation of long-read transcriptomes for accurate identification of known and novel isoforms.SQANTI3:长读转录组的编目,用于准确识别已知和新的异构体。
Nat Methods. 2024 May;21(5):793-797. doi: 10.1038/s41592-024-02229-2. Epub 2024 Mar 20.
6
Nucleotide-level distance metrics to quantify alternative splicing implemented in TranD.TranD 中实现的用于量化可变剪接的核苷酸水平距离度量。
Nucleic Acids Res. 2024 Mar 21;52(5):e28. doi: 10.1093/nar/gkae056.
7
FlyBase: updates to the Drosophila genes and genomes database.FlyBase:果蝇基因和基因组数据库的更新。
Genetics. 2024 May 7;227(1). doi: 10.1093/genetics/iyad211.
8
Utility of long-read sequencing for All of Us.长读测序在“所有人”研究中的应用。
Nat Commun. 2024 Jan 29;15(1):837. doi: 10.1038/s41467-024-44804-3.
9
High-throughput RNA isoform sequencing using programmed cDNA concatenation.使用可编程 cDNA 连接的高通量 RNA 异构体测序。
Nat Biotechnol. 2024 Apr;42(4):582-586. doi: 10.1038/s41587-023-01815-7. Epub 2023 Jun 8.
10
IsoTools: a flexible workflow for long-read transcriptome sequencing analysis.IsoTools:一种用于长读转录组测序分析的灵活工作流程。
Bioinformatics. 2023 Jun 1;39(6). doi: 10.1093/bioinformatics/btad364.