• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于总结微小RNA在调节mRNA表达中作用的大型语言模型与基因组学

Large Language Models and Genomics for Summarizing the Role of microRNA in Regulating mRNA Expression.

作者信息

Bhasuran Balu, Manoharan Sharanya, Iyyappan Oviya Ramalakshmi, Murugesan Gurusamy, Prabahar Archana, Raja Kalpana

机构信息

School of Information, Florida State University, Tallahassee, FL 32306, USA.

Department of Bioinformatics, Stella Maris College, Chennai 600086, Tamil Nadu, India.

出版信息

Biomedicines. 2024 Jul 10;12(7):1535. doi: 10.3390/biomedicines12071535.

DOI:10.3390/biomedicines12071535
PMID:39062108
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11274411/
Abstract

microRNA (miRNA)-messenger RNA (mRNA or gene) interactions are pivotal in various biological processes, including the regulation of gene expression, cellular differentiation, proliferation, apoptosis, and development, as well as the maintenance of cellular homeostasis and pathogenesis of numerous diseases, such as cancer, cardiovascular diseases, neurological disorders, and metabolic conditions. Understanding the mechanisms of miRNA-mRNA interactions can provide insights into disease mechanisms and potential therapeutic targets. However, extracting these interactions efficiently from a huge collection of published articles in PubMed is challenging. In the current study, we annotated a miRNA-mRNA Interaction Corpus (MMIC) and used it for evaluating the performance of a variety of machine learning (ML) models, deep learning-based transformer (DLT) models, and large language models (LLMs) in extracting the miRNA-mRNA interactions mentioned in PubMed. We used the genomics approaches for validating the extracted miRNA-mRNA interactions. Among the ML, DLT, and LLM models, PubMedBERT showed the highest precision, recall, and F-score, with all equal to 0.783. Among the LLM models, the performance of Llama-2 is better when compared to others. Llama 2 achieved 0.56 precision, 0.86 recall, and 0.68 F-score in a zero-shot experiment and 0.56 precision, 0.87 recall, and 0.68 F-score in a three-shot experiment. Our study shows that Llama 2 achieves better recall than ML and DLT models and leaves space for further improvement in terms of precision and F-score.

摘要

微小RNA(miRNA)与信使核糖核酸(mRNA或基因)的相互作用在各种生物过程中起着关键作用,包括基因表达调控、细胞分化、增殖、凋亡和发育,以及细胞稳态的维持和众多疾病(如癌症、心血管疾病、神经疾病和代谢疾病)的发病机制。了解miRNA与mRNA相互作用的机制可以为疾病机制和潜在治疗靶点提供见解。然而,从PubMed上大量已发表文章中高效提取这些相互作用具有挑战性。在本研究中,我们注释了一个miRNA - mRNA相互作用语料库(MMIC),并将其用于评估各种机器学习(ML)模型、基于深度学习的Transformer(DLT)模型和大语言模型(LLM)在提取PubMed中提到的miRNA - mRNA相互作用方面的性能。我们使用基因组学方法来验证提取的miRNA - mRNA相互作用。在ML、DLT和LLM模型中,PubMedBERT的精确率、召回率和F值最高,均为0.783。在LLM模型中,Llama - 2的性能比其他模型更好。Llama 2在零样本实验中实现了0.56的精确率、0.86的召回率和0.68的F值,在三样本实验中实现了0.56的精确率、0.87的召回率和0.68的F值。我们的研究表明,Llama 2的召回率比ML和DLT模型更好,在精确率和F值方面还有进一步改进的空间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1f1/11274411/8b1d7ef9c25c/biomedicines-12-01535-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1f1/11274411/f5716b1faad6/biomedicines-12-01535-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1f1/11274411/638be7c5c388/biomedicines-12-01535-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1f1/11274411/615a86539613/biomedicines-12-01535-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1f1/11274411/a19502930be8/biomedicines-12-01535-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1f1/11274411/b29d6ce34d22/biomedicines-12-01535-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1f1/11274411/4d9b1dd9c6f3/biomedicines-12-01535-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1f1/11274411/8b1d7ef9c25c/biomedicines-12-01535-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1f1/11274411/f5716b1faad6/biomedicines-12-01535-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1f1/11274411/638be7c5c388/biomedicines-12-01535-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1f1/11274411/615a86539613/biomedicines-12-01535-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1f1/11274411/a19502930be8/biomedicines-12-01535-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1f1/11274411/b29d6ce34d22/biomedicines-12-01535-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1f1/11274411/4d9b1dd9c6f3/biomedicines-12-01535-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1f1/11274411/8b1d7ef9c25c/biomedicines-12-01535-g007.jpg

相似文献

1
Large Language Models and Genomics for Summarizing the Role of microRNA in Regulating mRNA Expression.用于总结微小RNA在调节mRNA表达中作用的大型语言模型与基因组学
Biomedicines. 2024 Jul 10;12(7):1535. doi: 10.3390/biomedicines12071535.
2
An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing: Algorithm Development and Validation Study.零样本临床自然语言处理中大型语言模型提示策略的实证评估:算法开发与验证研究
JMIR Med Inform. 2024 Apr 8;12:e55318. doi: 10.2196/55318.
3
Few-Shot Learning for Clinical Natural Language Processing Using Siamese Neural Networks: Algorithm Development and Validation Study.使用暹罗神经网络的临床自然语言处理少样本学习:算法开发与验证研究
JMIR AI. 2023 May 4;2:e44293. doi: 10.2196/44293.
4
A comprehensive evaluation of large Language models on benchmark biomedical text processing tasks.对基准生物医学文本处理任务中大型语言模型的全面评估。
Comput Biol Med. 2024 Mar;171:108189. doi: 10.1016/j.compbiomed.2024.108189. Epub 2024 Feb 20.
5
Model tuning or prompt Tuning? a study of large language models for clinical concept and relation extraction.模型调优还是提示调优?大型语言模型在临床概念和关系抽取中的应用研究。
J Biomed Inform. 2024 May;153:104630. doi: 10.1016/j.jbi.2024.104630. Epub 2024 Mar 26.
6
Evaluating Large Language Models in Extracting Cognitive Exam Dates and Scores.评估大型语言模型在提取认知测试日期和分数方面的能力。
medRxiv. 2024 Feb 13:2023.07.10.23292373. doi: 10.1101/2023.07.10.23292373.
7
TEC-miTarget: enhancing microRNA target prediction based on deep learning of ribonucleic acid sequences.TEC-miTarget:基于 RNA 序列深度学习的 miRNA 靶基因预测增强方法。
BMC Bioinformatics. 2024 Apr 20;25(1):159. doi: 10.1186/s12859-024-05780-z.
8
Quality of Answers of Generative Large Language Models Versus Peer Users for Interpreting Laboratory Test Results for Lay Patients: Evaluation Study.生成式大语言模型与同行用户对解释非专业患者实验室检测结果的答案质量比较:评估研究。
J Med Internet Res. 2024 Apr 17;26:e56655. doi: 10.2196/56655.
9
Triage Performance Across Large Language Models, ChatGPT, and Untrained Doctors in Emergency Medicine: Comparative Study.分诊表现比较:大型语言模型、ChatGPT 和未经训练的急诊医生:一项对比研究。
J Med Internet Res. 2024 Jun 14;26:e53297. doi: 10.2196/53297.
10
Machine Learning Techniques in Exploring MicroRNA Gene Discovery, Targets, and Functions.探索微小RNA基因发现、靶点及功能中的机器学习技术
Methods Mol Biol. 2017;1617:211-224. doi: 10.1007/978-1-4939-7046-9_16.

引用本文的文献

1
Exploring the Role of microRNAs as Blood Biomarkers in Alzheimer's Disease and Frontotemporal Dementia.探索微小RNA作为阿尔茨海默病和额颞叶痴呆血液生物标志物的作用。
Int J Mol Sci. 2025 Apr 5;26(7):3399. doi: 10.3390/ijms26073399.

本文引用的文献

1
A Study of Biomedical Relation Extraction Using GPT Models.一项使用GPT模型进行生物医学关系提取的研究。
AMIA Jt Summits Transl Sci Proc. 2024 May 31;2024:391-400. eCollection 2024.
2
RDscan: Extracting RNA-disease relationship from the literature based on pre-training model.RDscan:基于预训练模型从文献中提取 RNA-疾病关系。
Methods. 2024 Aug;228:48-54. doi: 10.1016/j.ymeth.2024.05.012. Epub 2024 May 22.
3
Self-distillation framework for document-level relation extraction in low-resource environments.低资源环境下文档级关系抽取的自蒸馏框架
PeerJ Comput Sci. 2024 Mar 29;10:e1930. doi: 10.7717/peerj-cs.1930. eCollection 2024.
4
Open-source large language models in action: A bioinformatics chatbot for PRIDE database.开源大型语言模型应用实例:PRIDE 数据库的生物信息学聊天机器人。
Proteomics. 2024 Nov;24(21-22):e2400005. doi: 10.1002/pmic.202400005. Epub 2024 Mar 31.
5
Model tuning or prompt Tuning? a study of large language models for clinical concept and relation extraction.模型调优还是提示调优?大型语言模型在临床概念和关系抽取中的应用研究。
J Biomed Inform. 2024 May;153:104630. doi: 10.1016/j.jbi.2024.104630. Epub 2024 Mar 26.
6
Serial KinderMiner (SKiM) discovers and annotates biomedical knowledge using co-occurrence and transformer models.使用共现和转换器模型,串行 KinderMiner (SKiM) 发现和注释生物医学知识。
BMC Bioinformatics. 2023 Nov 1;24(1):412. doi: 10.1186/s12859-023-05539-y.
7
Evaluating ChatGPT-3.5 and Claude-2 in Answering and Explaining Conceptual Medical Physiology Multiple-Choice Questions.评估ChatGPT-3.5和Claude-2在回答和解释概念性医学生理学选择题方面的表现。
Cureus. 2023 Sep 29;15(9):e46222. doi: 10.7759/cureus.46222. eCollection 2023 Sep.
8
Revisiting Relation Extraction in the era of Large Language Models.重访大语言模型时代的关系抽取
Proc Conf Assoc Comput Linguist Meet. 2023 Jul;2023:15566-15589. doi: 10.18653/v1/2023.acl-long.868.
9
Relation extraction: advancements through deep learning and entity-related features.关系抽取:通过深度学习和实体相关特征取得的进展。
Soc Netw Anal Min. 2023;13(1):92. doi: 10.1007/s13278-023-01095-8. Epub 2023 Jun 10.
10
Extraction of microRNA-target interaction sentences from biomedical literature by deep learning approach.通过深度学习方法从生物医学文献中提取微小RNA-靶标相互作用句子。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac497.