• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

预测剪接因子 SRSF1 识别的 RNA 结合区域的序列和结构特异性。

Predicting sequence and structural specificities of RNA binding regions recognized by splicing factor SRSF1.

机构信息

Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, IN 46202, USA.

出版信息

BMC Genomics. 2011 Dec 23;12 Suppl 5(Suppl 5):S8. doi: 10.1186/1471-2164-12-S5-S8.

DOI:10.1186/1471-2164-12-S5-S8
PMID:22369183
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3287504/
Abstract

BACKGROUND

RNA-binding proteins (RBPs) play diverse roles in eukaryotic RNA processing. Despite their pervasive functions in coding and noncoding RNA biogenesis and regulation, elucidating the sequence specificities that define protein-RNA interactions remains a major challenge. Recently, CLIP-seq (Cross-linking immunoprecipitation followed by high-throughput sequencing) has been successfully implemented to study the transcriptome-wide binding patterns of SRSF1, PTBP1, NOVA and fox2 proteins. These studies either adopted traditional methods like Multiple EM for Motif Elicitation (MEME) to discover the sequence consensus of RBP's binding sites or used Z-score statistics to search for the overrepresented nucleotides of a certain size. We argue that most of these methods are not well-suited for RNA motif identification, as they are unable to incorporate the RNA structural context of protein-RNA interactions, which may affect to binding specificity. Here, we describe a novel model-based approach--RNAMotifModeler to identify the consensus of protein-RNA binding regions by integrating sequence features and RNA secondary structures.

RESULTS

As an example, we implemented RNAMotifModeler on SRSF1 (SF2/ASF) CLIP-seq data. The sequence-structural consensus we identified is a purine-rich octamer 'AGAAGAAG' in a highly single-stranded RNA context. The unpaired probabilities, the probabilities of not forming pairs, are significantly higher than negative controls and the flanking sequence surrounding the binding site, indicating that SRSF1 proteins tend to bind on single-stranded RNA. Further statistical evaluations revealed that the second and fifth bases of SRSF1octamer motif have much stronger sequence specificities, but weaker single-strandedness, while the third, fourth, sixth and seventh bases are far more likely to be single-stranded, but have more degenerate sequence specificities. Therefore, we hypothesize that nucleotide specificity and secondary structure play complementary roles during binding site recognition by SRSF1.

CONCLUSION

In this study, we presented a computational model to predict the sequence consensus and optimal RNA secondary structure for protein-RNA binding regions. The successful implementation on SRSF1 CLIP-seq data demonstrates great potential to improve our understanding on the binding specificity of RNA binding proteins.

摘要

背景

RNA 结合蛋白(RBPs)在真核生物 RNA 加工过程中发挥着多样化的作用。尽管它们在编码和非编码 RNA 的生物发生和调控中具有普遍的功能,但阐明定义蛋白-RNA 相互作用的序列特异性仍然是一个主要挑战。最近,CLIP-seq(交联免疫沉淀 followed by 高通量测序)已成功用于研究 SRSF1、PTBP1、NOVA 和 fox2 蛋白的转录组范围结合模式。这些研究要么采用传统方法,如多模式启发式 for 基序提取(MEME)来发现 RBP 结合位点的序列共识,要么使用 Z 分数统计来搜索特定大小的过代表核苷酸。我们认为,这些方法中的大多数都不适合 RNA 基序识别,因为它们无法将蛋白-RNA 相互作用的 RNA 结构上下文纳入考虑,而这可能会影响结合特异性。在这里,我们描述了一种新的基于模型的方法——RNAMotifModeler,通过整合序列特征和 RNA 二级结构来识别蛋白-RNA 结合区域的共识。

结果

作为一个例子,我们在 SRSF1(SF2/ASF)CLIP-seq 数据上实现了 RNAMotifModeler。我们确定的序列-结构共识是一个富含嘌呤的八聚体'AGAAGAAG',在高度单链 RNA 环境中。未配对的概率,即不形成对的概率,明显高于阴性对照和结合位点周围的侧翼序列,表明 SRSF1 蛋白倾向于结合单链 RNA。进一步的统计评估表明,SRSF1 八聚体基序的第二和第五个碱基具有更强的序列特异性,但较弱的单链性,而第三个、第四个、第六个和第七个碱基更有可能是单链的,但具有更多的简并序列特异性。因此,我们假设核苷酸特异性和二级结构在 SRSF1 结合位点识别过程中发挥互补作用。

结论

在这项研究中,我们提出了一种计算模型来预测蛋白-RNA 结合区域的序列共识和最佳 RNA 二级结构。在 SRSF1 CLIP-seq 数据上的成功实施表明,该模型具有很大的潜力,可以提高我们对 RNA 结合蛋白结合特异性的理解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/148c/3287504/ffb54524bd7a/1471-2164-12-S5-S8-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/148c/3287504/b4f8a3796e5a/1471-2164-12-S5-S8-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/148c/3287504/06f9b6db1937/1471-2164-12-S5-S8-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/148c/3287504/4ac47519e02d/1471-2164-12-S5-S8-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/148c/3287504/82c86b1f6e8f/1471-2164-12-S5-S8-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/148c/3287504/17248c7eb7d4/1471-2164-12-S5-S8-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/148c/3287504/764b68ded3cc/1471-2164-12-S5-S8-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/148c/3287504/ffb54524bd7a/1471-2164-12-S5-S8-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/148c/3287504/b4f8a3796e5a/1471-2164-12-S5-S8-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/148c/3287504/06f9b6db1937/1471-2164-12-S5-S8-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/148c/3287504/4ac47519e02d/1471-2164-12-S5-S8-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/148c/3287504/82c86b1f6e8f/1471-2164-12-S5-S8-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/148c/3287504/17248c7eb7d4/1471-2164-12-S5-S8-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/148c/3287504/764b68ded3cc/1471-2164-12-S5-S8-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/148c/3287504/ffb54524bd7a/1471-2164-12-S5-S8-7.jpg

相似文献

1
Predicting sequence and structural specificities of RNA binding regions recognized by splicing factor SRSF1.预测剪接因子 SRSF1 识别的 RNA 结合区域的序列和结构特异性。
BMC Genomics. 2011 Dec 23;12 Suppl 5(Suppl 5):S8. doi: 10.1186/1471-2164-12-S5-S8.
2
A combined sequence and structure based method for discovering enriched motifs in RNA from in vivo binding data.一种基于序列和结构相结合的方法,用于从体内结合数据中发现RNA中富集的基序。
Methods. 2017 Apr 15;118-119:73-81. doi: 10.1016/j.ymeth.2017.03.003. Epub 2017 Mar 6.
3
Splicing factor SFRS1 recognizes a functionally diverse landscape of RNA transcripts.剪接因子SFRS1识别功能多样的RNA转录本景观。
Genome Res. 2009 Mar;19(3):381-94. doi: 10.1101/gr.082503.108. Epub 2008 Dec 30.
4
RNA Bind-n-Seq: Measuring the Binding Affinity Landscape of RNA-Binding Proteins.RNA结合测序:测量RNA结合蛋白的结合亲和力图谱。
Methods Enzymol. 2015;558:465-493. doi: 10.1016/bs.mie.2015.02.007. Epub 2015 May 12.
5
Modeling RNA-Binding Protein Specificity In Vivo by Precisely Registering Protein-RNA Crosslink Sites.通过精确记录蛋白 - RNA 交联位点来模拟体内 RNA 结合蛋白的特异性。
Mol Cell. 2019 Jun 20;74(6):1189-1204.e6. doi: 10.1016/j.molcel.2019.02.002.
6
Leveraging cross-link modification events in CLIP-seq for motif discovery.利用CLIP-seq中的交联修饰事件进行基序发现。
Nucleic Acids Res. 2015 Jan;43(1):95-103. doi: 10.1093/nar/gku1288. Epub 2014 Dec 10.
7
Activation-induced cytidine deaminase (AID)-dependent somatic hypermutation requires a splice isoform of the serine/arginine-rich (SR) protein SRSF1.激活诱导胞嘧啶脱氨酶(AID)依赖性体细胞超突变需要丝氨酸/精氨酸丰富(SR)蛋白 SRSF1 的剪接异构体。
Proc Natl Acad Sci U S A. 2012 Jan 24;109(4):1216-21. doi: 10.1073/pnas.1120368109. Epub 2012 Jan 9.
8
CapR: revealing structural specificities of RNA-binding protein target recognition using CLIP-seq data.CapR:利用CLIP-seq数据揭示RNA结合蛋白靶点识别的结构特异性
Genome Biol. 2014 Jan 21;15(1):R16. doi: 10.1186/gb-2014-15-1-r16.
9
Global regulation of alternative RNA splicing by the SR-rich protein RBM39.富含SR的蛋白质RBM39对可变RNA剪接的全局调控。
Biochim Biophys Acta. 2016 Aug;1859(8):1014-24. doi: 10.1016/j.bbagrm.2016.06.007. Epub 2016 Jun 21.
10
SARNAclust: Semi-automatic detection of RNA protein binding motifs from immunoprecipitation data.SARNAclust:从免疫沉淀数据中半自动检测 RNA 蛋白质结合基序。
PLoS Comput Biol. 2018 Mar 29;14(3):e1006078. doi: 10.1371/journal.pcbi.1006078. eCollection 2018 Mar.

引用本文的文献

1
The FANCI/FANCD2 complex links DNA damage response to R-loop regulation through SRSF1-mediated mRNA export.FANCI/FANCD2 复合物通过 SRSF1 介导的 mRNA 输出将 DNA 损伤反应与 R 环调控联系起来。
Cell Rep. 2024 Jan 23;43(1):113610. doi: 10.1016/j.celrep.2023.113610. Epub 2024 Jan 1.
2
Transfer Learning Allows Accurate RBP Target Site Prediction with Limited Sample Sizes.迁移学习可在样本量有限的情况下实现准确的RNA结合蛋白靶位点预测。
Biology (Basel). 2023 Sep 25;12(10):1276. doi: 10.3390/biology12101276.
3
Pre-mRNA splicing order is predetermined and maintains splicing fidelity across multi-intronic transcripts.

本文引用的文献

1
RNAcontext: a new method for learning the sequence and structure binding preferences of RNA-binding proteins.RNAcontext:一种学习 RNA 结合蛋白序列和结构结合偏好的新方法。
PLoS Comput Biol. 2010 Jul 1;6(7):e1000832. doi: 10.1371/journal.pcbi.1000832.
2
Genome-wide analysis of PTB-RNA interactions reveals a strategy used by the general splicing repressor to modulate exon inclusion or skipping.全基因组分析表明 PTB-RNA 相互作用揭示了一般剪接抑制剂用来调节外显子包含或跳过的策略。
Mol Cell. 2009 Dec 25;36(6):996-1006. doi: 10.1016/j.molcel.2009.12.003.
3
Rapid and systematic analysis of the RNA recognition specificities of RNA-binding proteins.
前体 mRNA 剪接顺序是预先确定的,并在多内含子转录本中保持剪接保真度。
Nat Struct Mol Biol. 2023 Aug;30(8):1064-1076. doi: 10.1038/s41594-023-01035-2. Epub 2023 Jul 13.
4
Differential analysis of RNA structure probing experiments at nucleotide resolution: uncovering regulatory functions of RNA structure.核苷酸分辨率下 RNA 结构探测实验的差异分析:揭示 RNA 结构的调控功能。
Nat Commun. 2022 Jul 22;13(1):4227. doi: 10.1038/s41467-022-31875-3.
5
The GAUGAA Motif Is Responsible for the Binding between circSMARCA5 and SRSF1 and Related Downstream Effects on Glioblastoma Multiforme Cell Migration and Angiogenic Potential.GAUGAA 基序负责 circSMARCA5 与 SRSF1 之间的结合,并对多形性胶质母细胞瘤细胞迁移和血管生成潜能产生相关的下游影响。
Int J Mol Sci. 2021 Feb 7;22(4):1678. doi: 10.3390/ijms22041678.
6
Structure of SRSF1 RRM1 bound to RNA reveals an unexpected bimodal mode of interaction and explains its involvement in SMN1 exon7 splicing.SRSF1 RRM1 结合 RNA 的结构揭示了一种意想不到的双模态相互作用模式,并解释了其在 SMN1 外显子 7 剪接中的参与。
Nat Commun. 2021 Jan 18;12(1):428. doi: 10.1038/s41467-020-20481-w.
7
Splicing Enhancers at Intron-Exon Borders Participate in Acceptor Splice Sites Recognition.剪接增强子在内含子-外显子边界参与供体位点识别。
Int J Mol Sci. 2020 Sep 8;21(18):6553. doi: 10.3390/ijms21186553.
8
DeepCLIP: predicting the effect of mutations on protein-RNA binding with deep learning.DeepCLIP:利用深度学习预测突变对蛋白质-RNA 结合的影响。
Nucleic Acids Res. 2020 Jul 27;48(13):7099-7118. doi: 10.1093/nar/gkaa530.
9
SRSF1 and PTBP1 Are -Acting Factors That Suppress the Formation of a CD33 Splicing Isoform Linked to Alzheimer's Disease Risk.SRSF1 和 PTBP1 是抑制与阿尔茨海默病风险相关的 CD33 剪接异构体形成的反式作用因子。
Mol Cell Biol. 2019 Aug 27;39(18). doi: 10.1128/MCB.00568-18. Print 2019 Sep 15.
10
A CLK3-HMGA2 Alternative Splicing Axis Impacts Human Hematopoietic Stem Cell Molecular Identity throughout Development.CLK3-HMGA2 可变剪接轴影响人类造血干细胞在整个发育过程中的分子特征。
Cell Stem Cell. 2018 Apr 5;22(4):575-588.e7. doi: 10.1016/j.stem.2018.03.012.
对RNA结合蛋白的RNA识别特异性进行快速系统分析。
Nat Biotechnol. 2009 Jul;27(7):667-70. doi: 10.1038/nbt.1550. Epub 2009 Jun 28.
4
An RNA code for the FOX2 splicing regulator revealed by mapping RNA-protein interactions in stem cells.通过绘制干细胞中的RNA-蛋白质相互作用图谱揭示的FOX2剪接调节因子的RNA编码
Nat Struct Mol Biol. 2009 Feb;16(2):130-7. doi: 10.1038/nsmb.1545. Epub 2009 Jan 11.
5
Splicing factor SFRS1 recognizes a functionally diverse landscape of RNA transcripts.剪接因子SFRS1识别功能多样的RNA转录本景观。
Genome Res. 2009 Mar;19(3):381-94. doi: 10.1101/gr.082503.108. Epub 2008 Dec 30.
6
Identification of nuclear and cytoplasmic mRNA targets for the shuttling protein SF2/ASF.穿梭蛋白SF2/ASF的细胞核和细胞质mRNA靶标的鉴定
PLoS One. 2008 Oct 8;3(10):e3369. doi: 10.1371/journal.pone.0003369.
7
Adaptable molecular interactions guide phosphorylation of the SR protein ASF/SF2 by SRPK1.适应性分子相互作用指导SR蛋白激酶1(SRPK1)对SR蛋白ASF/SF2的磷酸化作用。
J Mol Biol. 2008 Oct 17;382(4):894-909. doi: 10.1016/j.jmb.2008.07.055. Epub 2008 Jul 26.
8
RNA secondary structure analysis using the Vienna RNA package.使用维也纳RNA软件包进行RNA二级结构分析。
Curr Protoc Bioinformatics. 2004 Feb;Chapter 12:Unit 12.2. doi: 10.1002/0471250953.bi1202s04.
9
RNA-binding proteins and post-transcriptional gene regulation.RNA结合蛋白与转录后基因调控
FEBS Lett. 2008 Jun 18;582(14):1977-86. doi: 10.1016/j.febslet.2008.03.004. Epub 2008 Mar 13.
10
A sliding docking interaction is essential for sequential and processive phosphorylation of an SR protein by SRPK1.滑动对接相互作用对于SRPK1对SR蛋白的顺序性和持续性磷酸化至关重要。
Mol Cell. 2008 Mar 14;29(5):563-76. doi: 10.1016/j.molcel.2007.12.017.