• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用核苷酸频率预测人类DNA序列中的聚腺苷酸化信号。

Prediction of polyadenylation signals in human DNA sequences using nucleotide frequencies.

作者信息

Ahmed Firoz, Kumar Manish, Raghava Gajendra P S

机构信息

Bioinformatics Centre, Institute of Microbial Technology, Chandigarh, India.

出版信息

In Silico Biol. 2009;9(3):135-48.

PMID:19795571
Abstract

The polyadenylation signal plays a key role in determining the site for addition of a polyadenylated tail to nascent mRNA and its mutation(s) are reported in many diseases. Thus, identifying poly(A) sites is important for understanding the regulation and stability of mRNA. In this study, Support Vector Machine (SVM) models have been developed for predicting poly(A) signals in a DNA sequence using 100 nucleotides, each upstream and downstream of this signal. Here, we introduced a novel split nucleotide frequency technique, and the models thus developed achieved maximum Matthews correlation coefficients (MCC) of 0.58, 0.69, 0.70 and 0.69 using mononucleotide, dinucleotide, trinucleotide, and tetranucleotide frequencies, respectively. Finally, a hybrid model developed using a combination of dinucleotide, 2nd order dinucleotide and tetranucleotide frequencies, achieved a maximum MCC of 0.72. Moreover, for independent datasets this model achieved a precision ranging from 75.8-95.7% with a sensitivity of 57%, which is better than any other known methods.

摘要

聚腺苷酸化信号在确定向新生mRNA添加聚腺苷酸化尾巴的位点方面起着关键作用,并且在许多疾病中都报道了其突变情况。因此,识别聚腺苷酸位点对于理解mRNA的调控和稳定性很重要。在本研究中,已开发出支持向量机(SVM)模型,用于使用该信号上游和下游各100个核苷酸的DNA序列预测聚腺苷酸信号。在此,我们引入了一种新颖的分裂核苷酸频率技术,使用单核苷酸、二核苷酸、三核苷酸和四核苷酸频率分别开发的模型,其最大马修斯相关系数(MCC)分别达到了0.58、0.69、0.70和0.69。最后,使用二核苷酸、二阶二核苷酸和四核苷酸频率组合开发的混合模型,其最大MCC达到了0.72。此外,对于独立数据集,该模型的精度范围为75.8 - 95.7%,灵敏度为57%,优于任何其他已知方法。

相似文献

1
Prediction of polyadenylation signals in human DNA sequences using nucleotide frequencies.利用核苷酸频率预测人类DNA序列中的聚腺苷酸化信号。
In Silico Biol. 2009;9(3):135-48.
2
Prediction of mRNA polyadenylation sites by support vector machine.利用支持向量机预测mRNA聚腺苷酸化位点
Bioinformatics. 2006 Oct 1;22(19):2320-5. doi: 10.1093/bioinformatics/btl394. Epub 2006 Jul 26.
3
An in-silico method for prediction of polyadenylation signals in human sequences.
Genome Inform. 2003;14:84-93.
4
Prediction of RNA binding sites in a protein using SVM and PSSM profile.使用支持向量机和位置特异性得分矩阵预测蛋白质中的RNA结合位点。
Proteins. 2008 Apr;71(1):189-94. doi: 10.1002/prot.21677.
5
Predicting methylation status of human DNA sequences by pseudo-trinucleotide composition.基于伪三核苷酸组成预测人类 DNA 序列的甲基化状态。
Talanta. 2011 Aug 15;85(2):1143-7. doi: 10.1016/j.talanta.2011.05.043. Epub 2011 May 27.
6
Identification and characterization of polyadenylation signal (PAS) variants in human genomic sequences based on modified EST clustering.基于改进的EST聚类对人类基因组序列中聚腺苷酸化信号(PAS)变体进行鉴定和表征。
In Silico Biol. 2008;8(3-4):347-61.
7
Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.基于序列的DNA中蛋白质结合位点预测:两种支持向量机模型的比较研究
Comput Methods Programs Biomed. 2014 Nov;117(2):158-67. doi: 10.1016/j.cmpb.2014.07.009. Epub 2014 Aug 1.
8
Prediction of non-canonical polyadenylation signals in human genomic sequences based on a novel algorithm using a fuzzy membership function.基于使用模糊隶属函数的新算法对人类基因组序列中非经典聚腺苷酸化信号的预测。
J Biosci Bioeng. 2009 May;107(5):569-78. doi: 10.1016/j.jbiosc.2009.01.001.
9
An improved poly(A) motifs recognition method based on decision level fusion.一种基于决策级融合的改进型聚腺苷酸(poly(A))基序识别方法。
Comput Biol Chem. 2015 Feb;54:49-56. doi: 10.1016/j.compbiolchem.2014.12.001. Epub 2014 Dec 30.
10
Improved translation efficiency of injected mRNA during early embryonic development.
Dev Dyn. 2006 Dec;235(12):3370-8. doi: 10.1002/dvdy.20995.

引用本文的文献

1
Deciphering dysregulation and CpG methylation in hepatocellular carcinoma using multi-omics and machine learning.利用多组学和机器学习破译肝细胞癌中的失调和 CpG 甲基化。
Epigenomics. 2024;16(13):961-983. doi: 10.1080/17501911.2024.2374702. Epub 2024 Jul 29.
2
A Novel Dependoparvovirus Identified in Cloacal Swabs of Monk Parakeet () from Urban Areas of Spain.在西班牙城市地区的玄凤鹦鹉的泄殖腔拭子中发现了一种新型依赖细小病毒。
Viruses. 2023 Mar 26;15(4):850. doi: 10.3390/v15040850.
3
A Systems Biology and LASSO-Based Approach to Decipher the Transcriptome-Interactome Signature for Predicting Non-Small Cell Lung Cancer.
一种基于系统生物学和LASSO的方法来破译转录组-相互作用组特征以预测非小细胞肺癌。
Biology (Basel). 2022 Nov 30;11(12):1752. doi: 10.3390/biology11121752.
4
A Survey on Methods for Predicting Polyadenylation Sites from DNA Sequences, Bulk RNA-seq, and Single-cell RNA-seq.基于 DNA 序列、bulk RNA-seq 和单细胞 RNA-seq 预测多聚腺苷酸化位点的方法综述
Genomics Proteomics Bioinformatics. 2023 Feb;21(1):67-83. doi: 10.1016/j.gpb.2022.09.005. Epub 2022 Sep 24.
5
From shallow to deep: some lessons learned from application of machine learning for recognition of functional genomic elements in human genome.从浅入深:机器学习在人类基因组功能基因组元件识别应用中的一些经验教训。
Hum Genomics. 2022 Feb 18;16(1):7. doi: 10.1186/s40246-022-00376-1.
6
Pax-5 Protein Expression Is Regulated by Transcriptional 3'UTR Editing.Pax-5 蛋白表达受转录 3'UTR 编辑调控。
Cells. 2021 Dec 28;11(1):76. doi: 10.3390/cells11010076.
7
Biochemical characterization of the cyclooxygenase enzyme in penaeid shrimp.对虾中环氧化酶酶的生化特性分析。
PLoS One. 2021 Apr 22;16(4):e0250276. doi: 10.1371/journal.pone.0250276. eCollection 2021.
8
A Comprehensive Analysis of -Acting RNA Elements in the SARS-CoV-2 Genome by a Bioinformatics Approach.通过生物信息学方法对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)基因组中起作用的RNA元件进行全面分析。
Front Genet. 2020 Dec 23;11:572702. doi: 10.3389/fgene.2020.572702. eCollection 2020.
9
Investigating the pathogenic SNPs in BLM helicase and their biological consequences by computational approach.通过计算方法研究 BLM 解旋酶中的致病 SNP 及其生物学后果。
Sci Rep. 2020 Jul 23;10(1):12377. doi: 10.1038/s41598-020-69033-8.
10
Integrated Network Analysis Reveals FOXM1 and MYBL2 as Key Regulators of Cell Proliferation in Non-small Cell Lung Cancer.整合网络分析揭示FOXM1和MYBL2是非小细胞肺癌细胞增殖的关键调节因子。
Front Oncol. 2019 Oct 15;9:1011. doi: 10.3389/fonc.2019.01011. eCollection 2019.