• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全基因组鉴定和组织特异性可变多聚腺苷酸化的预测建模。

Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation.

机构信息

Department of Computer Science, Duke University, Durham, NC 27708, USA.

出版信息

Bioinformatics. 2013 Jul 1;29(13):i108-16. doi: 10.1093/bioinformatics/btt233.

DOI:10.1093/bioinformatics/btt233
PMID:23812974
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3694680/
Abstract

MOTIVATION

Pre-mRNA cleavage and polyadenylation are essential steps for 3'-end maturation and subsequent stability and degradation of mRNAs. This process is highly controlled by cis-regulatory elements surrounding the cleavage/polyadenylation sites (polyA sites), which are frequently constrained by sequence content and position. More than 50% of human transcripts have multiple functional polyA sites, and the specific use of alternative polyA sites (APA) results in isoforms with variable 3'-untranslated regions, thus potentially affecting gene regulation. Elucidating the regulatory mechanisms underlying differential polyA preferences in multiple cell types has been hindered both by the lack of suitable data on the precise location of cleavage sites, as well as of appropriate tests for determining APAs with significant differences across multiple libraries.

RESULTS

We applied a tailored paired-end RNA-seq protocol to specifically probe the position of polyA sites in three human adult tissue types. We specified a linear-effects regression model to identify tissue-specific biases indicating regulated APA; the significance of differences between tissue types was assessed by an appropriately designed permutation test. This combination allowed to identify highly specific subsets of APA events in the individual tissue types. Predictive models successfully classified constitutive polyA sites from a biologically relevant background (auROC = 99.6%), as well as tissue-specific regulated sets from each other. We found that the main cis-regulatory elements described for polyadenylation are a strong, and highly informative, hallmark for constitutive sites only. Tissue-specific regulated sites were found to contain other regulatory motifs, with the canonical polyadenylation signal being nearly absent at brain-specific polyA sites. Together, our results contribute to the understanding of the diversity of post-transcriptional gene regulation.

AVAILABILITY

Raw data are deposited on SRA, accession numbers: brain SRX208132, kidney SRX208087 and liver SRX208134. Processed datasets as well as model code are published on our website: http://www.genome.duke.edu/labs/ohler/research/UTR/.

CONTACT

uwe.ohler@duke.edu.

摘要

动机

前体 mRNA 的切割和多聚腺苷酸化是 3'端成熟以及随后 mRNA 的稳定性和降解所必需的步骤。这个过程受到切割/多聚腺苷酸化位点(多聚 A 位点)周围顺式调控元件的高度调控,这些元件经常受到序列内容和位置的限制。超过 50%的人类转录本具有多个功能多聚 A 位点,而替代多聚 A 位点(APA)的特异性使用导致具有可变 3'非翻译区的异构体,从而可能影响基因调控。阐明多种细胞类型中不同多聚 A 偏好的调控机制,既受到缺乏关于切割位点精确位置的合适数据的限制,也受到缺乏在多个文库中具有显著差异的 APA 的适当测试的限制。

结果

我们应用了一种定制的配对末端 RNA-seq 方案,专门探测三个人类成年组织类型中多聚 A 位点的位置。我们指定了一个线性效应回归模型来识别指示调节 APA 的组织特异性偏差;通过适当设计的置换检验来评估组织类型之间差异的显著性。这种组合允许在单个组织类型中识别出高度特异性的 APA 事件子集。预测模型成功地将组成型多聚 A 位点从生物学相关背景中分类(auROC = 99.6%),并将彼此的组织特异性调节集分类。我们发现,多聚腺苷酸化描述的主要顺式调控元件是组成型位点的一个强有力且高度信息丰富的特征。组织特异性调节位点包含其他调控基序,而在脑特异性多聚 A 位点中几乎不存在典型的多聚腺苷酸化信号。总之,我们的结果有助于理解转录后基因调控的多样性。

可用性

原始数据已存储在 SRA 中, accession numbers:brain SRX208132,kidney SRX208087 和 liver SRX208134。处理后的数据集以及模型代码已发布在我们的网站上:http://www.genome.duke.edu/labs/ohler/research/UTR/。

联系

uwe.ohler@duke.edu。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/062d/3694680/d435fcd860fc/btt233f6p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/062d/3694680/e8ea0dc8f391/btt233f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/062d/3694680/38d72e14d0d1/btt233f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/062d/3694680/e5d1ba90fbfb/btt233f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/062d/3694680/fa45ff48b59c/btt233f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/062d/3694680/480615b1938e/btt233f5p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/062d/3694680/d435fcd860fc/btt233f6p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/062d/3694680/e8ea0dc8f391/btt233f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/062d/3694680/38d72e14d0d1/btt233f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/062d/3694680/e5d1ba90fbfb/btt233f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/062d/3694680/fa45ff48b59c/btt233f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/062d/3694680/480615b1938e/btt233f5p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/062d/3694680/d435fcd860fc/btt233f6p.jpg

相似文献

1
Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation.全基因组鉴定和组织特异性可变多聚腺苷酸化的预测建模。
Bioinformatics. 2013 Jul 1;29(13):i108-16. doi: 10.1093/bioinformatics/btt233.
2
Implications of polyadenylation in health and disease.多聚腺苷酸化在健康与疾病中的意义。
Nucleus. 2014;5(6):508-19. doi: 10.4161/nucl.36360. Epub 2014 Oct 31.
3
PolyA_DB: a database for mammalian mRNA polyadenylation.PolyA_DB:一个哺乳动物mRNA多聚腺苷酸化数据库。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D116-20. doi: 10.1093/nar/gki055.
4
PolyA-miner: accurate assessment of differential alternative poly-adenylation from 3'Seq data using vector projections and non-negative matrix factorization.PolyA-miner:利用向量投影和非负矩阵分解从3'Seq数据中准确评估差异可变聚腺苷酸化
Nucleic Acids Res. 2020 Jul 9;48(12):e69. doi: 10.1093/nar/gkaa398.
5
A quantitative atlas of polyadenylation in five mammals.五个哺乳动物中多聚腺苷酸化的定量图谱。
Genome Res. 2012 Jun;22(6):1173-83. doi: 10.1101/gr.132563.111. Epub 2012 Mar 27.
6
DeepPASTA: deep neural network based polyadenylation site analysis.DeepPASTA:基于深度神经网络的聚腺苷酸化位点分析。
Bioinformatics. 2019 Nov 1;35(22):4577-4585. doi: 10.1093/bioinformatics/btz283.
7
Alternative polyadenylation analysis in animals and plants: newly developed strategies for profiling, processing and validation.动植物中的可变多聚腺苷酸化分析:新兴的谱分析、处理和验证策略。
Int J Biol Sci. 2018 Sep 7;14(12):1709-1714. doi: 10.7150/ijbs.27168. eCollection 2018.
8
A computational pipeline to infer alternative poly-adenylation from 3' sequencing data.从 3' 测序数据推断可变多聚腺苷酸化的计算流程。
Methods Enzymol. 2021;655:185-204. doi: 10.1016/bs.mie.2021.04.001. Epub 2021 Jun 5.
9
Alternative polyadenylation by sequential activation of distal and proximal PolyA sites.通过顺序激活远端和近端 PolyA 位点进行交替多聚腺苷酸化。
Nat Struct Mol Biol. 2022 Jan;29(1):21-31. doi: 10.1038/s41594-021-00709-z. Epub 2022 Jan 10.
10
PolyA_DB 2: mRNA polyadenylation sites in vertebrate genes.PolyA_DB 2:脊椎动物基因中的mRNA聚腺苷酸化位点
Nucleic Acids Res. 2007 Jan;35(Database issue):D165-8. doi: 10.1093/nar/gkl870.

引用本文的文献

1
Alternative polyadenylation and metabolic profiling in young panicle development of hybrid rice and its parents.杂交水稻及其亲本幼穗发育过程中的可变多聚腺苷酸化与代谢谱分析
Rice (N Y). 2025 Aug 21;18(1):80. doi: 10.1186/s12284-025-00834-z.
2
Alternative polyadenylation regulates the translation of metabolic and inflammation-related proteins in adipose tissue of gestational diabetes mellitus.可变聚腺苷酸化调节妊娠期糖尿病患者脂肪组织中代谢和炎症相关蛋白的翻译。
Comput Struct Biotechnol J. 2024 Mar 15;23:1298-1310. doi: 10.1016/j.csbj.2024.03.013. eCollection 2024 Dec.
3
Known sequence features explain half of all human gene ends.

本文引用的文献

1
A quantitative atlas of polyadenylation in five mammals.五个哺乳动物中多聚腺苷酸化的定量图谱。
Genome Res. 2012 Jun;22(6):1173-83. doi: 10.1101/gr.132563.111. Epub 2012 Mar 27.
2
Dragon PolyA Spotter: predictor of poly(A) motifs within human genomic DNA sequences.Dragon PolyA Spotter:在人类基因组 DNA 序列中预测多聚(A)基序的工具。
Bioinformatics. 2012 Jan 1;28(1):127-9. doi: 10.1093/bioinformatics/btr602. Epub 2011 Nov 15.
3
Signals for pre-mRNA cleavage and polyadenylation.前体 mRNA 切割和多聚腺苷酸化信号。
已知的序列特征解释了所有人类基因末端的一半。
NAR Genom Bioinform. 2023 Apr 5;5(2):lqad031. doi: 10.1093/nargab/lqad031. eCollection 2023 Jun.
4
A Survey on Methods for Predicting Polyadenylation Sites from DNA Sequences, Bulk RNA-seq, and Single-cell RNA-seq.基于 DNA 序列、bulk RNA-seq 和单细胞 RNA-seq 预测多聚腺苷酸化位点的方法综述
Genomics Proteomics Bioinformatics. 2023 Feb;21(1):67-83. doi: 10.1016/j.gpb.2022.09.005. Epub 2022 Sep 24.
5
Profiling Alternative 3' Untranslated Regions in Sorghum using RNA-seq Data.利用RNA测序数据分析高粱中可变的3'非翻译区
Front Genet. 2020 Oct 26;11:556749. doi: 10.3389/fgene.2020.556749. eCollection 2020.
6
Poly(A)-DG: A deep-learning-based domain generalization method to identify cross-species Poly(A) signal without prior knowledge from target species.聚(A)-DG:一种基于深度学习的领域泛化方法,可在没有目标物种先验知识的情况下识别跨物种聚(A)信号。
PLoS Comput Biol. 2020 Nov 5;16(11):e1008297. doi: 10.1371/journal.pcbi.1008297. eCollection 2020 Nov.
7
DeepPASTA: deep neural network based polyadenylation site analysis.DeepPASTA:基于深度神经网络的聚腺苷酸化位点分析。
Bioinformatics. 2019 Nov 1;35(22):4577-4585. doi: 10.1093/bioinformatics/btz283.
8
A Genome-Wide Epstein-Barr Virus Polyadenylation Map and Its Antisense RNA to EBNA.一张全基因组 Epstein-Barr 病毒聚腺苷酸化图谱及其与 EBNA 的反义 RNA
J Virol. 2019 Jan 4;93(2). doi: 10.1128/JVI.01593-18. Print 2019 Jan 15.
9
Inference of the human polyadenylation code.人类多聚腺苷酸化代码推断。
Bioinformatics. 2018 Sep 1;34(17):2889-2898. doi: 10.1093/bioinformatics/bty211.
10
QAPA: a new method for the systematic analysis of alternative polyadenylation from RNA-seq data.QAPA:一种从 RNA-seq 数据中系统分析可变多聚腺苷酸化的新方法。
Genome Biol. 2018 Mar 28;19(1):45. doi: 10.1186/s13059-018-1414-4.
Wiley Interdiscip Rev RNA. 2012 May-Jun;3(3):385-96. doi: 10.1002/wrna.116. Epub 2011 Oct 19.
4
Differential genome-wide profiling of tandem 3' UTRs among human breast cancer and normal cells by high-throughput sequencing.高通量测序分析人类乳腺癌细胞和正常细胞中串联 3'UTR 的差异全基因组图谱。
Genome Res. 2011 May;21(5):741-7. doi: 10.1101/gr.115295.110. Epub 2011 Apr 7.
5
Complex and dynamic landscape of RNA polyadenylation revealed by PAS-Seq.PAS-Seq 揭示的 RNA 多聚腺苷酸化的复杂和动态景观。
RNA. 2011 Apr;17(4):761-72. doi: 10.1261/rna.2581711. Epub 2011 Feb 22.
6
Characterization and prediction of mRNA polyadenylation sites in human genes.人基因中 mRNA 多聚腺苷酸化位点的特征分析与预测。
Med Biol Eng Comput. 2011 Apr;49(4):463-72. doi: 10.1007/s11517-011-0732-4. Epub 2011 Feb 1.
7
Comprehensive polyadenylation site maps in yeast and human reveal pervasive alternative polyadenylation.酵母和人类的综合多聚腺苷酸化位点图谱揭示了普遍存在的可变多聚腺苷酸化。
Cell. 2010 Dec 10;143(6):1018-29. doi: 10.1016/j.cell.2010.11.020.
8
POLYAR, a new computer program for prediction of poly(A) sites in human sequences.POLYAR,一个用于预测人类序列中多聚腺苷酸位点的新计算机程序。
BMC Genomics. 2010 Nov 19;11:646. doi: 10.1186/1471-2164-11-646.
9
Formation, regulation and evolution of Caenorhabditis elegans 3'UTRs.秀丽隐杆线虫 3'UTR 的形成、调控和进化。
Nature. 2011 Jan 6;469(7328):97-101. doi: 10.1038/nature09616. Epub 2010 Nov 17.
10
The landscape of C. elegans 3'UTRs.秀丽隐杆线虫 3'UTR 景观。
Science. 2010 Jul 23;329(5990):432-5. doi: 10.1126/science.1191244. Epub 2010 Jun 3.