Suppr超能文献

POLYAR,一个用于预测人类序列中多聚腺苷酸位点的新计算机程序。

POLYAR, a new computer program for prediction of poly(A) sites in human sequences.

机构信息

Department of Biosciences, COMSATS Institute of Information Technology, Islamabad, Pakistan.

出版信息

BMC Genomics. 2010 Nov 19;11:646. doi: 10.1186/1471-2164-11-646.

Abstract

BACKGROUND

mRNA polyadenylation is an essential step of pre-mRNA processing in eukaryotes. Accurate prediction of the pre-mRNA 3'-end cleavage/polyadenylation sites is important for defining the gene boundaries and understanding gene expression mechanisms.

RESULTS

28761 human mapped poly(A) sites have been classified into three classes containing different known forms of polyadenylation signal (PAS) or none of them (PAS-strong, PAS-weak and PAS-less, respectively) and a new computer program POLYAR for the prediction of poly(A) sites of each class was developed. In comparison with polya_svm (till date the most accurate computer program for prediction of poly(A) sites) while searching for PAS-strong poly(A) sites in human sequences, POLYAR had a significantly higher prediction sensitivity (80.8% versus 65.7%) and specificity (66.4% versus 51.7%) However, when a similar sort of search was conducted for PAS-weak and PAS-less poly(A) sites, both programs had a very low prediction accuracy, which indicates that our knowledge about factors involved in the determination of the poly(A) sites is not sufficient to identify such polyadenylation regions.

CONCLUSIONS

We present a new classification of polyadenylation sites into three classes and a novel computer program POLYAR for prediction of poly(A) sites/regions of each of the class. In tests, POLYAR shows high accuracy of prediction of the PAS-strong poly(A) sites, though this program's efficiency in searching for PAS-weak and PAS-less poly(A) sites is not very high but is comparable to other available programs. These findings suggest that additional characteristics of such poly(A) sites remain to be elucidated. POLYAR program with a stand-alone version for downloading is available at http://cub.comsats.edu.pk/polyapredict.htm.

摘要

背景

mRNA 多聚腺苷酸化是真核生物中前体 mRNA 加工的一个必要步骤。准确预测前体 mRNA 的 3'-末端切割/多聚腺苷酸化位点对于定义基因边界和理解基因表达机制非常重要。

结果

28761 个人类映射的 poly(A) 位点已分为三类,分别包含不同形式的多聚腺苷酸化信号(PAS)或没有(分别为 PAS-强、PAS-弱和 PAS-无),并开发了一种新的计算机程序 POLYAR 用于预测每个类别的 poly(A) 位点。与 polya_svm(迄今为止最准确的预测 poly(A) 位点的计算机程序)相比,在搜索人类序列中的 PAS-强 poly(A) 位点时,POLYAR 的预测灵敏度(80.8% 对 65.7%)和特异性(66.4% 对 51.7%)显著更高。然而,当对 PAS-弱和 PAS-无 poly(A) 位点进行类似的搜索时,两个程序的预测准确性都非常低,这表明我们对决定 poly(A) 位点的因素的了解还不足以识别这些多聚腺苷酸化区域。

结论

我们提出了一种新的 polyadenylation 位点分类方法,分为三类,并提出了一种新的计算机程序 POLYAR,用于预测每一类的 poly(A) 位点/区域。在测试中,POLYAR 显示出 PAS-强 poly(A) 位点预测的高精度,尽管该程序搜索 PAS-弱和 PAS-无 poly(A) 位点的效率不是很高,但与其他可用程序相当。这些发现表明,这些 poly(A) 位点的其他特征仍有待阐明。可从 http://cub.comsats.edu.pk/polyapredict.htm 下载 POLYAR 程序的独立版本。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验