• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PAcluster:使用典型相关分析对聚腺苷酸化位点数据进行聚类

PAcluster: Clustering polyadenylation site data using canonical correlation analysis.

作者信息

Ji Guoli, Lin Qianmin, Long Yuqi, Ye Congting, Ye Wenbin, Wu Xiaohui

机构信息

* Department of Automation, Xiamen University, Xiamen, Fujian, P. R. China.

† College of the Environment and Ecology, Xiamen University, Xiamen, Fujian, P. R. China.

出版信息

J Bioinform Comput Biol. 2017 Oct;15(5):1750018. doi: 10.1142/S0219720017500184. Epub 2017 Aug 16.

DOI:10.1142/S0219720017500184
PMID:28874086
Abstract

Alternative polyadenylation (APA) is a pervasive mechanism that contributes to gene regulation. Increasing sequenced poly(A) sites are placing new demands for the development of computational methods to investigate APA regulation. Cluster analysis is important to identify groups of co-expressed genes. However, clustering of poly(A) sites has not been extensively studied in APA, where most APA studies failed to consider the distribution, abundance, and variation of APA sites in each gene. Here we constructed a two-layer model based on canonical correlation analysis (CCA) to explore the underlying biological mechanisms in APA regulation. The first layer quantifies the general correlation of APA sites across various conditions between each gene and the second layer identifies genes with statistically significant correlation on their APA patterns to infer APA-specific gene clusters. Using hierarchical clustering, we comprehensively compared our method with four other widely used distance measures based on three performance indexes. Results showed that our method significantly enhanced the clustering performance for both synthetic and real poly(A) site data and could generate clusters with more biological meaning. We have implemented the CCA-based method as a publically available R package called PAcluster, which provides an efficient solution to the clustering of large APA-specific biological dataset.

摘要

可变聚腺苷酸化(Alternative polyadenylation,APA)是一种广泛存在的基因调控机制。越来越多已测序的聚腺苷酸化(poly(A))位点对用于研究APA调控的计算方法的发展提出了新的要求。聚类分析对于识别共表达基因的组很重要。然而,在APA中,poly(A)位点的聚类尚未得到广泛研究,在大多数APA研究中,未能考虑每个基因中APA位点的分布、丰度和变异。在此,我们构建了一个基于典型相关分析(Canonical correlation analysis,CCA)的两层模型,以探索APA调控中的潜在生物学机制。第一层量化每个基因在各种条件下APA位点的总体相关性,第二层识别其APA模式具有统计学显著相关性的基因,以推断特定于APA的基因簇。使用层次聚类,我们基于三个性能指标将我们的方法与其他四种广泛使用的距离度量进行了全面比较。结果表明,我们的方法显著提高了合成和真实poly(A)位点数据的聚类性能,并且可以生成具有更多生物学意义的簇。我们已将基于CCA的方法实现为一个名为PAcluster的公开可用R包,它为大型特定于APA的生物学数据集的聚类提供了一个有效的解决方案。

相似文献

1
PAcluster: Clustering polyadenylation site data using canonical correlation analysis.PAcluster:使用典型相关分析对聚腺苷酸化位点数据进行聚类
J Bioinform Comput Biol. 2017 Oct;15(5):1750018. doi: 10.1142/S0219720017500184. Epub 2017 Aug 16.
2
Cluster analysis of replicated alternative polyadenylation data using canonical correlation analysis.基于典型相关分析的重复可变剪接数据分析的聚类。
BMC Genomics. 2019 Jan 22;20(1):75. doi: 10.1186/s12864-019-5433-7.
3
movAPA: modeling and visualization of dynamics of alternative polyadenylation across biological samples.movAPA:跨生物样本的可变聚腺苷酸化动力学建模与可视化
Bioinformatics. 2021 Aug 25;37(16):2470-2472. doi: 10.1093/bioinformatics/btaa997.
4
VAAPA: a web platform for visualization and analysis of alternative polyadenylation.VAAPA:一个用于可变多聚腺苷酸化可视化和分析的网络平台。
Comput Biol Med. 2015 Feb;57:20-5. doi: 10.1016/j.compbiomed.2014.11.010. Epub 2014 Nov 24.
5
Alternative polyadenylation and gene expression regulation in plants.植物中的可变多聚腺苷酸化和基因表达调控。
Wiley Interdiscip Rev RNA. 2011 May-Jun;2(3):445-58. doi: 10.1002/wrna.59. Epub 2010 Nov 9.
6
TAPAS: tool for alternative polyadenylation site analysis.TAPAS:可变多聚腺苷酸化位点分析工具。
Bioinformatics. 2018 Aug 1;34(15):2521-2529. doi: 10.1093/bioinformatics/bty110.
7
PlantAPAdb: A Comprehensive Database for Alternative Polyadenylation Sites in Plants.植物 APAdb:植物中可变多聚腺苷酸化位点的综合数据库。
Plant Physiol. 2020 Jan;182(1):228-242. doi: 10.1104/pp.19.00943. Epub 2019 Nov 25.
8
APAtrap: identification and quantification of alternative polyadenylation sites from RNA-seq data.APAtrap:从 RNA-seq 数据中鉴定和定量分析可变多聚腺苷酸化位点。
Bioinformatics. 2018 Jun 1;34(11):1841-1849. doi: 10.1093/bioinformatics/bty029.
9
Identification and Characterization of Transcripts Regulated by Circadian Alternative Polyadenylation in Mouse Liver.小鼠肝脏中受昼夜节律性可变聚腺苷酸化调控的转录本的鉴定与表征
G3 (Bethesda). 2018 Nov 6;8(11):3539-3548. doi: 10.1534/g3.118.200559.
10
PlantAPA: A Portal for Visualization and Analysis of Alternative Polyadenylation in Plants.植物APA:植物中可变聚腺苷酸化可视化与分析门户
Front Plant Sci. 2016 Jun 21;7:889. doi: 10.3389/fpls.2016.00889. eCollection 2016.

引用本文的文献

1
Alternative polyadenylation analysis in animals and plants: newly developed strategies for profiling, processing and validation.动植物中的可变多聚腺苷酸化分析:新兴的谱分析、处理和验证策略。
Int J Biol Sci. 2018 Sep 7;14(12):1709-1714. doi: 10.7150/ijbs.27168. eCollection 2018.
2
Alternative polyadenylation drives genome-to-phenome information detours in the AMPKα1 and AMPKα2 knockout mice.可变多聚腺苷酸化在 AMPKα1 和 AMPKα2 敲除小鼠中驱动从基因组到表型的信息迂回。
Sci Rep. 2018 Apr 24;8(1):6462. doi: 10.1038/s41598-018-24683-7.