Kandhro Abdul H, Shoombuatong Watshara, Nantasenamat Chanin, Prachayasittikul Virapong, Nuchnoi Pornlada
Center for Research and Innovation, Faculty of Medical Technology, Mahidol UniversityBangkok, Thailand.
Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol UniversityBangkok, Thailand.
Front Genet. 2017 Sep 22;8:116. doi: 10.3389/fgene.2017.00116. eCollection 2017.
Dyslipidemia is one of the major forms of lipid disorder, characterized by increased triglycerides (TGs), increased low-density lipoprotein-cholesterol (LDL-C), and decreased high-density lipoprotein-cholesterol (HDL-C) levels in blood. Recently, MicroRNAs (miRNAs) have been reported to involve in various biological processes; their potential usage being a biomarkers and in diagnosis of various diseases. Computational approaches including text mining have been used recently to analyze abstracts from the public databases to observe the relationships/associations between the biological molecules, miRNAs, and disease phenotypes. In the present study, significance of text mined extracted pair associations (miRNA-lipid disease) were estimated by one-sided Fisher's exact test. The top 20 significant miRNA-disease associations were visualized on Cytoscape. The CyTargetLinker plug-in tool on Cytoscape was used to extend the network and predicts new miRNA target genes. The Biological Networks Gene Ontology (BiNGO) plug-in tool on Cytoscape was used to retrieve gene ontology (GO) annotations for the targeted genes. We retrieved 227 miRNA-lipid disease associations including 148 miRNAs. The top 20 significant miRNAs analysis on CyTargetLinker provides defined, predicted and validated gene targets, further targeted genes analyzed by BiNGO showed targeted genes were significantly associated with lipid, cholesterol, apolipoprotein, and fatty acids GO terms. We are the first to provide a reliable miRNA-lipid disease association network based on text mining. This could help future experimental studies that aim to validate predicted gene targets.
血脂异常是脂质紊乱的主要形式之一,其特征是血液中甘油三酯(TGs)升高、低密度脂蛋白胆固醇(LDL-C)升高以及高密度脂蛋白胆固醇(HDL-C)水平降低。最近,据报道微小RNA(miRNAs)参与各种生物学过程;它们作为生物标志物和用于各种疾病诊断的潜在用途。包括文本挖掘在内的计算方法最近已被用于分析来自公共数据库的摘要,以观察生物分子、miRNAs和疾病表型之间的关系/关联。在本研究中,通过单侧Fisher精确检验估计文本挖掘提取的配对关联(miRNA-脂质疾病)的显著性。前20个显著的miRNA-疾病关联在Cytoscape上可视化。使用Cytoscape上的CyTargetLinker插件工具扩展网络并预测新的miRNA靶基因。使用Cytoscape上的生物网络基因本体(BiNGO)插件工具检索靶基因的基因本体(GO)注释。我们检索到227个miRNA-脂质疾病关联,包括148个miRNAs。在CyTargetLinker上对前20个显著miRNAs的分析提供了定义的、预测的和经过验证的基因靶标,通过BiNGO进一步分析的靶基因显示靶基因与脂质、胆固醇、载脂蛋白和脂肪酸GO术语显著相关。我们首次基于文本挖掘提供了一个可靠的miRNA-脂质疾病关联网络。这可能有助于未来旨在验证预测基因靶标的实验研究。