• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用医学在线(MEDLINE)摘要的潜在语义索引对微小RNA进行优先级排序、聚类和功能注释。

Prioritization, clustering and functional annotation of MicroRNAs using latent semantic indexing of MEDLINE abstracts.

作者信息

Roy Sujoy, Curry Brandon C, Madahian Behrouz, Homayouni Ramin

机构信息

Bioinformatics Program, University of Memphis, Memphis, 38152, USA.

Center for Translational Informatics, University of Memphis, Memphis, 38152, USA.

出版信息

BMC Bioinformatics. 2016 Oct 6;17(Suppl 13):350. doi: 10.1186/s12859-016-1223-2.

DOI:10.1186/s12859-016-1223-2
PMID:27766940
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5073981/
Abstract

BACKGROUND

The amount of scientific information about MicroRNAs (miRNAs) is growing exponentially, making it difficult for researchers to interpret experimental results. In this study, we present an automated text mining approach using Latent Semantic Indexing (LSI) for prioritization, clustering and functional annotation of miRNAs.

RESULTS

For approximately 900 human miRNAs indexed in miRBase, text documents were created by concatenating titles and abstracts of MEDLINE citations which refer to the miRNAs. The documents were parsed and a weighted term-by-miRNA frequency matrix was created, which was subsequently factorized via singular value decomposition to extract pair-wise cosine values between the term (keyword) and miRNA vectors in reduced rank semantic space. LSI enables derivation of both explicit and implicit associations between entities based on word usage patterns. Using miR2Disease as a gold standard, we found that LSI identified keyword-to-miRNA relationships with high accuracy. In addition, we demonstrate that pair-wise associations between miRNAs can be used to group them into categories which are functionally aligned. Finally, term ranking by querying the LSI space with a group of miRNAs enabled annotation of the clusters with functionally related terms.

CONCLUSIONS

LSI modeling of MEDLINE abstracts provides a robust and automated method for miRNA related knowledge discovery. The latest collection of miRNA abstracts and LSI model can be accessed through the web tool miRNA Literature Network (miRLiN) at http://bioinfo.memphis.edu/mirlin .

摘要

背景

关于微小RNA(miRNA)的科学信息量呈指数级增长,这使得研究人员难以解读实验结果。在本研究中,我们提出了一种使用潜在语义索引(LSI)的自动化文本挖掘方法,用于miRNA的优先级排序、聚类和功能注释。

结果

对于miRBase中索引的约900个人类miRNA,通过拼接引用这些miRNA的MEDLINE文献的标题和摘要创建了文本文件。对这些文件进行解析,并创建了一个加权的词- miRNA频率矩阵,随后通过奇异值分解对其进行分解,以提取降维语义空间中词(关键词)和miRNA向量之间的成对余弦值。LSI能够基于词的使用模式推导实体之间的显式和隐式关联。以miR2Disease作为金标准,我们发现LSI能高精度地识别关键词与miRNA的关系。此外,我们证明了miRNA之间的成对关联可用于将它们分组到功能上一致的类别中。最后,通过用一组miRNA查询LSI空间进行词排序,能够用功能相关的词对聚类进行注释。

结论

MEDLINE摘要的LSI建模为miRNA相关知识发现提供了一种强大的自动化方法。最新的miRNA摘要集合和LSI模型可通过网络工具miRNA文献网络(miRLiN)在http://bioinfo.memphis.edu/mirlin上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc3/5073981/8372a41ea955/12859_2016_1223_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc3/5073981/9fa221dbf1d4/12859_2016_1223_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc3/5073981/a743bd0d5bcf/12859_2016_1223_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc3/5073981/344065240480/12859_2016_1223_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc3/5073981/c737b8f32ed8/12859_2016_1223_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc3/5073981/ab867ac8029f/12859_2016_1223_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc3/5073981/6083533ca5b8/12859_2016_1223_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc3/5073981/8372a41ea955/12859_2016_1223_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc3/5073981/9fa221dbf1d4/12859_2016_1223_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc3/5073981/a743bd0d5bcf/12859_2016_1223_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc3/5073981/344065240480/12859_2016_1223_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc3/5073981/c737b8f32ed8/12859_2016_1223_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc3/5073981/ab867ac8029f/12859_2016_1223_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc3/5073981/6083533ca5b8/12859_2016_1223_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bc3/5073981/8372a41ea955/12859_2016_1223_Fig7_HTML.jpg

相似文献

1
Prioritization, clustering and functional annotation of MicroRNAs using latent semantic indexing of MEDLINE abstracts.使用医学在线(MEDLINE)摘要的潜在语义索引对微小RNA进行优先级排序、聚类和功能注释。
BMC Bioinformatics. 2016 Oct 6;17(Suppl 13):350. doi: 10.1186/s12859-016-1223-2.
2
Gene clustering by latent semantic indexing of MEDLINE abstracts.通过MEDLINE摘要的潜在语义索引进行基因聚类。
Bioinformatics. 2005 Jan 1;21(1):104-15. doi: 10.1093/bioinformatics/bth464. Epub 2004 Aug 12.
3
Latent Semantic Indexing of PubMed abstracts for identification of transcription factor candidates from microarray derived gene sets.基于PubMed 摘要的潜在语义索引从微阵列基因集中识别转录因子候选物。
BMC Bioinformatics. 2011 Oct 18;12 Suppl 10(Suppl 10):S19. doi: 10.1186/1471-2105-12-S10-S19.
4
Software Suite for Gene and Protein Annotation Prediction and Similarity Search.用于基因和蛋白质注释预测及相似性搜索的软件套件。
IEEE/ACM Trans Comput Biol Bioinform. 2015 Jul-Aug;12(4):837-43. doi: 10.1109/TCBB.2014.2382127.
5
Text Influenced Molecular Indexing (TIMI): a literature database mining approach that handles text and chemistry.文本影响分子索引(TIMI):一种处理文本和化学信息的文献数据库挖掘方法。
J Chem Inf Comput Sci. 2003 May-Jun;43(3):743-52. doi: 10.1021/ci025587a.
6
miRBaseConverter: an R/Bioconductor package for converting and retrieving miRNA name, accession, sequence and family information in different versions of miRBase.miRBaseConverter:一个 R/Bioconductor 软件包,用于在不同版本的 miRBase 中转换和检索 miRNA 名称、标识符、序列和家族信息。
BMC Bioinformatics. 2018 Dec 31;19(Suppl 19):514. doi: 10.1186/s12859-018-2531-5.
7
EmDL: Extracting miRNA-Drug Interactions from Literature.EmDL:从文献中提取 miRNA-药物相互作用。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Sep-Oct;16(5):1722-1728. doi: 10.1109/TCBB.2017.2723394. Epub 2017 Jul 6.
8
miRiaD: A Text Mining Tool for Detecting Associations of microRNAs with Diseases.miRiaD:一种用于检测微小RNA与疾病关联的文本挖掘工具。
J Biomed Semantics. 2016 Apr 29;7(1):9. doi: 10.1186/s13326-015-0044-y.
9
miRTex: A Text Mining System for miRNA-Gene Relation Extraction.miRTex:一种用于提取miRNA与基因关系的文本挖掘系统。
PLoS Comput Biol. 2015 Sep 25;11(9):e1004391. doi: 10.1371/journal.pcbi.1004391. eCollection 2015.
10
miRClassify: an advanced web server for miRNA family classification and annotation.miRClassify:一个用于 miRNA 家族分类和注释的高级网络服务器。
Comput Biol Med. 2014 Feb;45:157-60. doi: 10.1016/j.compbiomed.2013.12.007. Epub 2013 Dec 21.

引用本文的文献

1
Large-scale identification of social and behavioral determinants of health from clinical notes: comparison of Latent Semantic Indexing and Generative Pretrained Transformer (GPT) models.从临床记录中大规模识别健康的社会和行为决定因素:潜在语义索引和生成式预训练转换器 (GPT) 模型的比较。
BMC Med Inform Decis Mak. 2024 Oct 10;24(1):296. doi: 10.1186/s12911-024-02705-x.
2
Literature Mining of Disease Associated Noncoding RNA in the Omics Era.组学时代疾病相关非编码 RNA 的文献挖掘。
Molecules. 2022 Jul 23;27(15):4710. doi: 10.3390/molecules27154710.
3
Evaluation of Sirtuin-3 probe quality and co-expressed genes using literature cohesion.

本文引用的文献

1
miRiaD: A Text Mining Tool for Detecting Associations of microRNAs with Diseases.miRiaD:一种用于检测微小RNA与疾病关联的文本挖掘工具。
J Biomed Semantics. 2016 Apr 29;7(1):9. doi: 10.1186/s13326-015-0044-y.
2
Macrophage miRNAs in atherosclerosis.动脉粥样硬化中的巨噬细胞微小RNA
Biochim Biophys Acta. 2016 Dec;1861(12 Pt B):2087-2093. doi: 10.1016/j.bbalip.2016.02.006. Epub 2016 Feb 18.
3
miRNA regulation of white and brown adipose tissue differentiation and function.微小RNA对白色和棕色脂肪组织分化及功能的调控
使用文献内聚度评价 Sirtuin-3 探针质量和共表达基因。
BMC Bioinformatics. 2019 Mar 14;20(Suppl 2):104. doi: 10.1186/s12859-019-2621-z.
4
Navigating the Functional Landscape of Transcription Factors via Non-Negative Tensor Factorization Analysis of MEDLINE Abstracts.通过对MEDLINE摘要进行非负张量分解分析来探索转录因子的功能格局
Front Bioeng Biotechnol. 2017 Aug 28;5:48. doi: 10.3389/fbioe.2017.00048. eCollection 2017.
5
An index-based algorithm for fast on-line query processing of latent semantic analysis.一种基于索引的用于潜在语义分析快速在线查询处理的算法。
PLoS One. 2017 May 16;12(5):e0177523. doi: 10.1371/journal.pone.0177523. eCollection 2017.
6
Proceedings of the 2016 MidSouth Computational Biology and Bioinformatics Society (MCBIOS) Conference.2016年中南计算生物学与生物信息学学会(MCBIOS)会议论文集
BMC Bioinformatics. 2016 Oct 6;17(Suppl 13):356. doi: 10.1186/s12859-016-1213-4.
Biochim Biophys Acta. 2016 Dec;1861(12 Pt B):2104-2110. doi: 10.1016/j.bbalip.2016.02.010. Epub 2016 Feb 16.
4
Targeting MicroRNA Function in Respiratory Diseases: Mini-Review.针对呼吸道疾病中微小RNA功能的综述
Front Physiol. 2016 Feb 4;7:21. doi: 10.3389/fphys.2016.00021. eCollection 2016.
5
miRNAs and High-Density Lipoprotein metabolism.微小RNA与高密度脂蛋白代谢
Biochim Biophys Acta. 2016 Dec;1861(12 Pt B):2053-2061. doi: 10.1016/j.bbalip.2016.01.021. Epub 2016 Feb 9.
6
MicroRNAs as regulators of endothelial cell functions in cardiometabolic diseases.微小RNA作为心脏代谢疾病中内皮细胞功能的调节因子
Biochim Biophys Acta. 2016 Dec;1861(12 Pt B):2094-2103. doi: 10.1016/j.bbalip.2016.01.013. Epub 2016 Jan 26.
7
miRNA and cholesterol homeostasis.微小RNA与胆固醇稳态
Biochim Biophys Acta. 2016 Dec;1861(12 Pt B):2041-2046. doi: 10.1016/j.bbalip.2016.01.005. Epub 2016 Jan 15.
8
miRTex: A Text Mining System for miRNA-Gene Relation Extraction.miRTex:一种用于提取miRNA与基因关系的文本挖掘系统。
PLoS Comput Biol. 2015 Sep 25;11(9):e1004391. doi: 10.1371/journal.pcbi.1004391. eCollection 2015.
9
miRBase: annotating high confidence microRNAs using deep sequencing data.miRBase:利用深度测序数据注释高可信度 microRNAs。
Nucleic Acids Res. 2014 Jan;42(Database issue):D68-73. doi: 10.1093/nar/gkt1181. Epub 2013 Nov 25.
10
miRCancer: a microRNA-cancer association database constructed by text mining on literature.miRCancer:通过对文献进行文本挖掘构建的 miRNA-癌症关联数据库。
Bioinformatics. 2013 Mar 1;29(5):638-44. doi: 10.1093/bioinformatics/btt014. Epub 2013 Jan 16.