• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

后生动物基因组中的计算机模拟miRNA预测:敏感性与特异性之间的平衡

In silico miRNA prediction in metazoan genomes: balancing between sensitivity and specificity.

作者信息

van der Burgt Ate, Fiers Mark W J E, Nap Jan-Peter, van Ham Roeland C H J

机构信息

Applied Bioinformatics, Plant Research International, Wageningen University & Research Centre, PO Box 16, 6700 AA Wageningen, The Netherlands.

出版信息

BMC Genomics. 2009 Apr 30;10:204. doi: 10.1186/1471-2164-10-204.

DOI:10.1186/1471-2164-10-204
PMID:19405940
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2688010/
Abstract

BACKGROUND

MicroRNAs (miRNAs), short approximately 21-nucleotide RNA molecules, play an important role in post-transcriptional regulation of gene expression. The number of known miRNA hairpins registered in the miRBase database is rapidly increasing, but recent reports suggest that many miRNAs with restricted temporal or tissue-specific expression remain undiscovered. Various strategies for in silico miRNA identification have been proposed to facilitate miRNA discovery. Notably support vector machine (SVM) methods have recently gained popularity. However, a drawback of these methods is that they do not provide insight into the biological properties of miRNA sequences.

RESULTS

We here propose a new strategy for miRNA hairpin prediction in which the likelihood that a genomic hairpin is a true miRNA hairpin is evaluated based on statistical distributions of observed biological variation of properties (descriptors) of known miRNA hairpins. These distributions are transformed into a single and continuous outcome classifier called the L score. Using a dataset of known miRNA hairpins from the miRBase database and an exhaustive set of genomic hairpins identified in the genome of Caenorhabditis elegans, a subset of 18 most informative descriptors was selected after detailed analysis of correlation among and discriminative power of individual descriptors. We show that the majority of previously identified miRNA hairpins have high L scores, that the method outperforms miRNA prediction by threshold filtering and that it is more transparent than SVM classifiers.

CONCLUSION

The L score is applicable as a prediction classifier with high sensitivity for novel miRNA hairpins. The L-score approach can be used to rank and select interesting miRNA hairpin candidates for downstream experimental analysis when coupled to a genome-wide set of in silico-identified hairpins or to facilitate the analysis of large sets of putative miRNA hairpin loci obtained in deep-sequencing efforts of small RNAs. Moreover, the in-depth analyses of miRNA hairpins descriptors preceding and determining the L score outcome could be used as an extension to miRBase entries to help increase the reliability and biological relevance of the miRNA registry.

摘要

背景

微小RNA(miRNA)是长度约为21个核苷酸的短RNA分子,在基因表达的转录后调控中发挥重要作用。miRBase数据库中注册的已知miRNA发夹数量正在迅速增加,但最近的报告表明,许多具有受限时间或组织特异性表达的miRNA仍未被发现。为促进miRNA的发现,人们提出了各种基于计算机的miRNA识别策略。值得注意的是,支持向量机(SVM)方法最近受到了广泛关注。然而,这些方法的一个缺点是它们无法深入了解miRNA序列的生物学特性。

结果

我们在此提出一种新的miRNA发夹预测策略,即根据已知miRNA发夹的特性(描述符)的观察生物学变异的统计分布,评估基因组发夹是真正miRNA发夹的可能性。这些分布被转换为一个单一的连续结果分类器,称为L分数。使用来自miRBase数据库的已知miRNA发夹数据集以及在秀丽隐杆线虫基因组中鉴定出的一组详尽的基因组发夹,在详细分析各个描述符之间的相关性和判别力后,选择了18个最具信息性的描述符子集。我们表明,大多数先前鉴定的miRNA发夹具有较高的L分数,该方法优于通过阈值过滤进行的miRNA预测,并且比SVM分类器更具透明度。

结论

L分数可作为对新型miRNA发夹具有高灵敏度的预测分类器。当与全基因组范围内基于计算机识别的发夹集结合使用时,L分数方法可用于对有趣的miRNA发夹候选物进行排名和选择,以进行下游实验分析,或促进对在小RNA深度测序工作中获得的大量假定miRNA发夹位点的分析。此外,对决定L分数结果的miRNA发夹描述符的深入分析可作为miRBase条目的扩展,以帮助提高miRNA登记的可靠性和生物学相关性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/15cbc56a4afc/1471-2164-10-204-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/a4333c3fb695/1471-2164-10-204-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/5a624cc640f4/1471-2164-10-204-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/2e64ce03d808/1471-2164-10-204-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/02a24d3228f0/1471-2164-10-204-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/2b786ecf1df7/1471-2164-10-204-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/2542118b049e/1471-2164-10-204-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/5a3266b78796/1471-2164-10-204-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/005a7caeb14f/1471-2164-10-204-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/d85536dc970f/1471-2164-10-204-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/39c0b334bc35/1471-2164-10-204-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/15cbc56a4afc/1471-2164-10-204-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/a4333c3fb695/1471-2164-10-204-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/5a624cc640f4/1471-2164-10-204-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/2e64ce03d808/1471-2164-10-204-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/02a24d3228f0/1471-2164-10-204-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/2b786ecf1df7/1471-2164-10-204-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/2542118b049e/1471-2164-10-204-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/5a3266b78796/1471-2164-10-204-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/005a7caeb14f/1471-2164-10-204-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/d85536dc970f/1471-2164-10-204-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/39c0b334bc35/1471-2164-10-204-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a144/2688010/15cbc56a4afc/1471-2164-10-204-11.jpg

相似文献

1
In silico miRNA prediction in metazoan genomes: balancing between sensitivity and specificity.后生动物基因组中的计算机模拟miRNA预测:敏感性与特异性之间的平衡
BMC Genomics. 2009 Apr 30;10:204. doi: 10.1186/1471-2164-10-204.
2
MiRFinder: an improved approach and software implementation for genome-wide fast microRNA precursor scans.MiRFinder:一种用于全基因组快速微小RNA前体扫描的改进方法及软件实现
BMC Bioinformatics. 2007 Sep 17;8:341. doi: 10.1186/1471-2105-8-341.
3
Ensemble-based classification approach for micro-RNA mining applied on diverse metagenomic sequences.基于集成的微RNA挖掘分类方法应用于多样的宏基因组序列。
BMC Res Notes. 2014 May 6;7:286. doi: 10.1186/1756-0500-7-286.
4
Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine.利用局部结构序列特征和支持向量机对真实和伪微小RNA前体进行分类
BMC Bioinformatics. 2005 Dec 29;6:310. doi: 10.1186/1471-2105-6-310.
5
Genome-wide pre-miRNA discovery from few labeled examples.从少量标记的样本中进行全基因组预 miRNA 发现。
Bioinformatics. 2018 Feb 15;34(4):541-549. doi: 10.1093/bioinformatics/btx612.
6
Computational prediction of the localization of microRNAs within their pre-miRNA.计算预测 microRNA 在其 pre-miRNA 中的定位。
Nucleic Acids Res. 2013 Aug;41(15):7200-11. doi: 10.1093/nar/gkt466. Epub 2013 Jun 8.
7
Mammalian microRNA prediction through a support vector machine model of sequence and structure.通过序列和结构的支持向量机模型预测哺乳动物微小RNA
PLoS One. 2007 Sep 26;2(9):e946. doi: 10.1371/journal.pone.0000946.
8
Identification of Schistosoma mansoni microRNAs.鉴定曼氏血吸虫 microRNAs。
BMC Genomics. 2011 Jan 19;12:47. doi: 10.1186/1471-2164-12-47.
9
De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures.利用全局和内在折叠度量对来自基因组假发夹结构的前体微小RNA进行从头支持向量机分类。
Bioinformatics. 2007 Jun 1;23(11):1321-30. doi: 10.1093/bioinformatics/btm026. Epub 2007 Jan 31.
10
MiRTif: a support vector machine-based microRNA target interaction filter.MiRTif:一种基于支持向量机的微小RNA靶标相互作用筛选工具
BMC Bioinformatics. 2008 Dec 12;9 Suppl 12(Suppl 12):S4. doi: 10.1186/1471-2105-9-S12-S4.

引用本文的文献

1
On the performance of pre-microRNA detection algorithms.论前体微小RNA检测算法的性能
Nat Commun. 2017 Aug 24;8(1):330. doi: 10.1038/s41467-017-00403-z.
2
Delineating the impact of machine learning elements in pre-microRNA detection.描绘机器学习元素在pre-miRNA检测中的影响。
PeerJ. 2017 Mar 29;5:e3131. doi: 10.7717/peerj.3131. eCollection 2017.
3
MicroRNA categorization using sequence motifs and k-mers.使用序列基序和k-mer对微小RNA进行分类。

本文引用的文献

1
MicroRNA profile of Marek's disease virus-transformed T-cell line MSB-1: predominance of virus-encoded microRNAs.马立克氏病病毒转化的T细胞系MSB-1的微小RNA谱:病毒编码微小RNA占主导地位
J Virol. 2008 Apr;82(8):4007-15. doi: 10.1128/JVI.02659-07. Epub 2008 Feb 6.
2
Functionally distinct regulatory RNAs generated by bidirectional transcription and processing of microRNA loci.由微小RNA基因座双向转录和加工产生的功能不同的调控RNA。
Genes Dev. 2008 Jan 1;22(1):26-36. doi: 10.1101/gad.1615208.
3
A single Hox locus in Drosophila produces functional microRNAs from opposite DNA strands.
BMC Bioinformatics. 2017 Mar 14;18(1):170. doi: 10.1186/s12859-017-1584-1.
4
Prediction and characterization of microRNAs from eleven fish species by computational methods.利用计算方法对十一种鱼类的微小RNA进行预测与表征。
Saudi J Biol Sci. 2015 Jul;22(4):374-81. doi: 10.1016/j.sjbs.2014.10.005. Epub 2014 Oct 23.
5
Fast selection of miRNA candidates based on large-scale pre-computed MFE sets of randomized sequences.基于大规模预先计算的随机序列最小自由能(MFE)集快速选择微小RNA(miRNA)候选序列
BMC Res Notes. 2014 Jan 13;7:34. doi: 10.1186/1756-0500-7-34.
6
Computational methods for ab initio detection of microRNAs.从头预测 microRNAs 的计算方法。
Front Genet. 2012 Oct 10;3:209. doi: 10.3389/fgene.2012.00209. eCollection 2012.
7
Discovery of hundreds of mirtrons in mouse and human small RNA data.在小鼠和人类小 RNA 数据中发现数百个 mirtrons。
Genome Res. 2012 Sep;22(9):1634-45. doi: 10.1101/gr.133553.111.
8
A fast ab-initio method for predicting miRNA precursors in genomes.一种快速从头预测基因组中 miRNA 前体的方法。
Nucleic Acids Res. 2012 Jun;40(11):e80. doi: 10.1093/nar/gks146. Epub 2012 Feb 22.
9
Mirtrons: microRNA biogenesis via splicing.Mirtrons:通过剪接进行 microRNA 生物发生。
Biochimie. 2011 Nov;93(11):1897-904. doi: 10.1016/j.biochi.2011.06.017. Epub 2011 Jun 21.
10
Drosha processing controls the specificity and efficiency of global microRNA expression.Drosha加工过程控制着整体微小RNA表达的特异性和效率。
Biochim Biophys Acta. 2011 Nov-Dec;1809(11-12):700-7. doi: 10.1016/j.bbagrm.2011.05.015. Epub 2011 Jun 13.
果蝇中的单个Hox基因座从相反的DNA链产生功能性微小RNA。
Genes Dev. 2008 Jan 1;22(1):8-13. doi: 10.1101/gad.1613108.
4
Intragenomic matching reveals a huge potential for miRNA-mediated regulation in plants.基因组内匹配揭示了植物中miRNA介导调控的巨大潜力。
PLoS Comput Biol. 2007 Nov;3(11):e238. doi: 10.1371/journal.pcbi.0030238.
5
Systematic identification of C. elegans miRISC proteins, miRNAs, and mRNA targets by their interactions with GW182 proteins AIN-1 and AIN-2.通过与GW182蛋白AIN-1和AIN-2的相互作用对秀丽隐杆线虫miRISC蛋白、miRNA和mRNA靶标进行系统鉴定。
Mol Cell. 2007 Nov 30;28(4):598-613. doi: 10.1016/j.molcel.2007.09.014.
6
Ensembl 2008.Ensembl 2008。
Nucleic Acids Res. 2008 Jan;36(Database issue):D707-14. doi: 10.1093/nar/gkm988. Epub 2007 Nov 13.
7
Systematic discovery and characterization of fly microRNAs using 12 Drosophila genomes.利用12种果蝇基因组对果蝇微小RNA进行系统发现与特征分析。
Genome Res. 2007 Dec;17(12):1865-79. doi: 10.1101/gr.6593807. Epub 2007 Nov 7.
8
Principles and limitations of computational microRNA gene and target finding.计算方法寻找微小RNA基因及其靶标的原理与局限性
DNA Cell Biol. 2007 May;26(5):339-51. doi: 10.1089/dna.2006.0551.
9
Origin and evolution of human microRNAs from transposable elements.人类微小RNA源自转座元件的起源与进化
Genetics. 2007 Jun;176(2):1323-37. doi: 10.1534/genetics.107.072553. Epub 2007 Apr 15.
10
A novel microarray approach reveals new tissue-specific signatures of known and predicted mammalian microRNAs.一种新型微阵列方法揭示了已知和预测的哺乳动物微小RNA的新组织特异性特征。
Nucleic Acids Res. 2007;35(7):e52. doi: 10.1093/nar/gkl1118. Epub 2007 Mar 13.