• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

InPAS:一个用于从大量RNA测序数据中识别新型聚腺苷酸化位点和可变聚腺苷酸化的R/Bioconductor软件包。

InPAS: An R/Bioconductor Package for Identifying Novel Polyadenylation Sites and Alternative Polyadenylation from Bulk RNA-seq Data.

作者信息

Ou Jianhong, Liu Haibo, Park Sungmi, Green Michael R, Zhu Lihua Julie

机构信息

Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA.

Regeneration Center, Duke University School of Medicine, Duke University, Durham, NC 27701, USA.

出版信息

Front Biosci (Schol Ed). 2024 Dec 17;16(4):21. doi: 10.31083/j.fbs1604021.

DOI:10.31083/j.fbs1604021
PMID:39736014
Abstract

BACKGROUND

Alternative cleavage and polyadenylation (APA) is a crucial post-transcriptional gene regulation mechanism that regulates gene expression in eukaryotes by increasing the diversity and complexity of both the transcriptome and proteome. Despite the development of more than a dozen experimental methods over the last decade to identify and quantify APA events, widespread adoption of these methods has been limited by technical, financial, and time constraints. Consequently, APA remains poorly understood in most eukaryotes. However, RNA sequencing (RNA-seq) technology has revolutionized transcriptome profiling and recent studies have shown that RNA-seq data can be leveraged to identify and quantify APA events.

RESULTS

To fully capitalize on the exponentially growing RNA-seq data, we developed InPAS (Identification of Novel alternative PolyAdenylation Sites), an R/Bioconductor package for accurate identification of novel and known cleavage and polyadenylation sites (CPSs), as well as quantification of APA from RNA-seq data of various experimental designs. Compared to other APA analysis tools, InPAS offers several important advantages, including the ability to detect both novel proximal and distal CPSs, to fine tune positions of CPSs using a naïve Bayes classifier based on flanking sequence features, and to identify APA events from RNA-seq data of complex experimental designs using linear models. We benchmarked the performance of InPAS and other leading tools using simulated and experimental RNA-seq data with matched 3'-end RNA-seq data. Our results reveal that InPAS frequently outperforms existing tools in terms of precision, sensitivity, and specificity. Furthermore, we demonstrate its scalability and versatility by applying it to large, diverse RNA-seq datasets.

CONCLUSIONS

InPAS is an efficient and robust tool for identifying and quantifying APA events using readily accessible conventional RNA-seq data. Its versatility opens doors to explore APA regulation across diverse eukaryotic systems with various experimental designs. We believe that InPAS will drive APA research forward, deepening our understanding of its role in regulating gene expression, and potentially leading to the discovery of biomarkers or therapeutics for diseases.

摘要

背景

可变剪接和多聚腺苷酸化(APA)是一种关键的转录后基因调控机制,通过增加转录组和蛋白质组的多样性和复杂性来调节真核生物中的基因表达。尽管在过去十年中已经开发了十多种实验方法来识别和量化APA事件,但这些方法的广泛应用受到技术、资金和时间限制。因此,在大多数真核生物中,APA仍然知之甚少。然而,RNA测序(RNA-seq)技术彻底改变了转录组分析,最近的研究表明,可以利用RNA-seq数据来识别和量化APA事件。

结果

为了充分利用呈指数增长的RNA-seq数据,我们开发了InPAS(新型可变多聚腺苷酸化位点识别),这是一个R/Bioconductor软件包,用于准确识别新的和已知的切割和多聚腺苷酸化位点(CPS),以及从各种实验设计的RNA-seq数据中量化APA。与其他APA分析工具相比,InPAS具有几个重要优势,包括能够检测新的近端和远端CPS,使用基于侧翼序列特征的朴素贝叶斯分类器微调CPS的位置,以及使用线性模型从复杂实验设计的RNA-seq数据中识别APA事件。我们使用模拟和实验RNA-seq数据以及匹配的3'端RNA-seq数据对InPAS和其他领先工具的性能进行了基准测试。我们的结果表明,InPAS在精度、灵敏度和特异性方面经常优于现有工具。此外,我们通过将其应用于大型、多样的RNA-seq数据集来证明其可扩展性和通用性。

结论

InPAS是一种高效且强大的工具,可使用易于获取的传统RNA-seq数据来识别和量化APA事件。它的通用性为探索各种实验设计的不同真核系统中的APA调控打开了大门。我们相信InPAS将推动APA研究向前发展,加深我们对其在调节基因表达中的作用的理解,并有可能导致发现疾病的生物标志物或治疗方法。

相似文献

1
InPAS: An R/Bioconductor Package for Identifying Novel Polyadenylation Sites and Alternative Polyadenylation from Bulk RNA-seq Data.InPAS:一个用于从大量RNA测序数据中识别新型聚腺苷酸化位点和可变聚腺苷酸化的R/Bioconductor软件包。
Front Biosci (Schol Ed). 2024 Dec 17;16(4):21. doi: 10.31083/j.fbs1604021.
2
PolyAMiner-Bulk is a deep learning-based algorithm that decodes alternative polyadenylation dynamics from bulk RNA-seq data.PolyAMiner-Bulk 是一种基于深度学习的算法,可从批量 RNA-seq 数据中解码可变多聚腺苷酸化动态。
Cell Rep Methods. 2024 Feb 26;4(2):100707. doi: 10.1016/j.crmeth.2024.100707. Epub 2024 Feb 6.
3
A survey on identification and quantification of alternative polyadenylation sites from RNA-seq data.基于 RNA-seq 数据的可变多聚腺苷酸化位点鉴定与定量研究综述
Brief Bioinform. 2020 Jul 15;21(4):1261-1276. doi: 10.1093/bib/bbz068.
4
Benchmarking sequencing methods and tools that facilitate the study of alternative polyadenylation.用于研究可变多聚腺苷酸化的测序方法和工具的基准测试。
Genome Biol. 2021 Oct 14;22(1):291. doi: 10.1186/s13059-021-02502-z.
5
Accurate transcriptome-wide identification and quantification of alternative polyadenylation from RNA-seq data with APAIQ.利用 APAIQ 从 RNA-seq 数据中准确识别和定量全转录组可变多聚腺苷酸化。
Genome Res. 2023 Apr;33(4):644-657. doi: 10.1101/gr.277177.122. Epub 2023 Apr 28.
6
APA-Scan: detection and visualization of 3'-UTR alternative polyadenylation with RNA-seq and 3'-end-seq data.APA-Scan:利用 RNA-seq 和 3'-端测序数据检测和可视化 3'-UTR 可变多聚腺苷酸化
BMC Bioinformatics. 2022 Sep 28;23(Suppl 3):396. doi: 10.1186/s12859-022-04939-w.
7
APAtrap: identification and quantification of alternative polyadenylation sites from RNA-seq data.APAtrap:从 RNA-seq 数据中鉴定和定量分析可变多聚腺苷酸化位点。
Bioinformatics. 2018 Jun 1;34(11):1841-1849. doi: 10.1093/bioinformatics/bty029.
8
TAPAS: tool for alternative polyadenylation site analysis.TAPAS:可变多聚腺苷酸化位点分析工具。
Bioinformatics. 2018 Aug 1;34(15):2521-2529. doi: 10.1093/bioinformatics/bty110.
9
Protocol for unlocking alternative polyadenylation insights from bulk RNA-seq data with PolyAMiner-Bulk.使用PolyAMiner-Bulk从大量RNA测序数据中解锁替代性聚腺苷酸化见解的方案。
STAR Protoc. 2025 Mar 21;6(1):103664. doi: 10.1016/j.xpro.2025.103664. Epub 2025 Feb 28.
10
Extensible benchmarking of methods that identify and quantify polyadenylation sites from RNA-seq data.从 RNA-seq 数据中识别和定量多聚腺苷酸化位点的方法的可扩展基准测试。
RNA. 2023 Dec;29(12):1839-1855. doi: 10.1261/rna.079849.123. Epub 2023 Oct 10.