• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

开发和验证一种基于人工智能的系统,用于自动文献分析和分子调控网络构建。

Development and Validation of an AI-Driven System for Automatic Literature Analysis and Molecular Regulatory Network Construction.

机构信息

Department of Gastroenterology, Renmin Hospital of Wuhan University, Wuhan, Hubei, 430060, P. R. China.

Hubei Key Laboratory of Digestive System, Renmin Hospital of Wuhan University, Wuhan, Hubei, 430060, P. R. China.

出版信息

Adv Sci (Weinh). 2024 Nov;11(44):e2405395. doi: 10.1002/advs.202405395. Epub 2024 Oct 7.

DOI:10.1002/advs.202405395
PMID:39373342
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11600262/
Abstract

Decoding gene regulatory networks is essential for understanding the mechanisms underlying many complex diseases. GENET is developed, an automated system designed to extract and visualize extensive molecular relationships from published biomedical literature. Using natural language processing, entities and relations are identified from a randomly selected set of 1788 scientific articles, and visualized in a filterable knowledge graph. The performance of GENET is evaluated and compared with existing methods. The named entity recognition model has achieved an overall precision of 94.23% (4835/5131; 93.56-94.84%), recall of 97.72% (4835/4948; 97.27-98.10%), and an F1 score of 95.94%. The relation extraction model has demonstrated an overall precision of 91.63% (2593/2830; 90.55-92.59%), recall of 89.17% (2593/2908; 87.99-90.25%), and an F1 score of 90.38%. GENET significantly outperforms existing methods in extracting molecular relationships (P < 0.001). Additionally, GENET has successfully predicted WNT family member 4 regulates insulin-like growth factor 2 via signal transducer and activator of transcription 3 in colon cancer. With RNA sequencing data and multiple immunofluorescence, the authenticity of this prediction is validated, supporting the promising feasibility of GENET.

摘要

解码基因调控网络对于理解许多复杂疾病的机制至关重要。本文开发了一个名为 GENET 的自动化系统,旨在从已发表的生物医学文献中提取和可视化广泛的分子关系。该系统使用自然语言处理技术,从随机选择的 1788 篇科学文章中识别实体和关系,并以可过滤的知识图谱形式可视化。评估并比较了 GENET 的性能与现有方法。命名实体识别模型的整体精度为 94.23%(4835/5131;93.56-94.84%),召回率为 97.72%(4835/4948;97.27-98.10%),F1 得分为 95.94%。关系提取模型的整体精度为 91.63%(2593/2830;90.55-92.59%),召回率为 89.17%(2593/2908;87.99-90.25%),F1 得分为 90.38%。GENET 在提取分子关系方面明显优于现有方法(P<0.001)。此外,GENET 成功预测了 WNT 家族成员 4 通过信号转导和转录激活因子 3 调节结肠癌中的胰岛素样生长因子 2。使用 RNA 测序数据和多种免疫荧光技术,验证了这一预测的真实性,支持了 GENET 的有前途的可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6a7/11600262/635281541a93/ADVS-11-2405395-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6a7/11600262/2852f8c048a1/ADVS-11-2405395-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6a7/11600262/4ca2862e7f4e/ADVS-11-2405395-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6a7/11600262/635281541a93/ADVS-11-2405395-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6a7/11600262/2852f8c048a1/ADVS-11-2405395-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6a7/11600262/4ca2862e7f4e/ADVS-11-2405395-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6a7/11600262/635281541a93/ADVS-11-2405395-g004.jpg

相似文献

1
Development and Validation of an AI-Driven System for Automatic Literature Analysis and Molecular Regulatory Network Construction.开发和验证一种基于人工智能的系统,用于自动文献分析和分子调控网络构建。
Adv Sci (Weinh). 2024 Nov;11(44):e2405395. doi: 10.1002/advs.202405395. Epub 2024 Oct 7.
2
An Automatic and End-to-End System for Rare Disease Knowledge Graph Construction Based on Ontology-Enhanced Large Language Models: Development Study.基于本体增强大语言模型的罕见病知识图谱构建自动端到端系统:开发研究
JMIR Med Inform. 2024 Dec 18;12:e60665. doi: 10.2196/60665.
3
BioGSF: a graph-driven semantic feature integration framework for biomedical relation extraction.BioGSF:一种用于生物医学关系提取的图驱动语义特征集成框架。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf025.
4
Enhancing Thyroid Pathology With Artificial Intelligence: Automated Data Extraction From Electronic Health Reports Using RUBY.利用人工智能增强甲状腺病理学:使用RUBY从电子健康报告中自动提取数据
JCO Clin Cancer Inform. 2024 Dec;8:e2300263. doi: 10.1200/CCI.23.00263. Epub 2024 Dec 10.
5
Towards discovery: an end-to-end system for uncovering novel biomedical relations.探索之路:一个端到端的系统,用于揭示新的生物医学关系。
Database (Oxford). 2024 Jul 11;2024. doi: 10.1093/database/baae057.
6
A Text Mining Protocol for Mining Biological Pathways and Regulatory Networks from Biomedical Literature.一种从生物医学文献中挖掘生物途径和调控网络的文本挖掘协议。
Methods Mol Biol. 2022;2496:141-157. doi: 10.1007/978-1-0716-2305-3_8.
7
BioCreative VI Precision Medicine Track system performance is constrained by entity recognition and variations in corpus characteristics.生物创意 VI 精准医疗轨道系统的性能受到实体识别和语料库特征变化的限制。
Database (Oxford). 2018 Jan 1;2018:bay122. doi: 10.1093/database/bay122.
8
Text mining biomedical literature for constructing gene regulatory networks.从生物医学文献中挖掘文本构建基因调控网络。
Interdiscip Sci. 2009 Sep;1(3):179-86. doi: 10.1007/s12539-009-0028-7. Epub 2009 Aug 7.
9
Automatic extraction of transcriptional regulatory interactions of bacteria from biomedical literature using a BERT-based approach.基于 BERT 的方法从生物医学文献中自动提取细菌的转录调控相互作用。
Database (Oxford). 2024 Aug 30;2024. doi: 10.1093/database/baae094.
10
Combining relation extraction with function detection for BEL statement extraction.将关系抽取与功能检测相结合进行 BEL 语句抽取。
Database (Oxford). 2019 Jan 1;2019:bay133. doi: 10.1093/database/bay133.

引用本文的文献

1
BioGSF: a graph-driven semantic feature integration framework for biomedical relation extraction.BioGSF:一种用于生物医学关系提取的图驱动语义特征集成框架。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf025.