大规模自动化机器阅读发现新的致癌驱动机制。

Large-scale automated machine reading discovers new cancer-driving mechanisms.

机构信息

Department of Computer Science, University of Arizona, Tucson, AZ, USA.

School of Medicine, Oregon Health & Science University, Portland, OR, USA.

出版信息

Database (Oxford). 2018 Jan 1;2018:bay098. doi: 10.1093/database/bay098.

DOI:10.1093/database/bay098

PMID:30256986

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6156821/

Abstract

PubMed, a repository and search engine for biomedical literature, now indexes >1 million articles each year. This exceeds the processing capacity of human domain experts, limiting our ability to truly understand many diseases. We present Reach, a system for automated, large-scale machine reading of biomedical papers that can extract mechanistic descriptions of biological processes with relatively high precision at high throughput. We demonstrate that combining the extracted pathway fragments with existing biological data analysis algorithms that rely on curated models helps identify and explain a large number of previously unidentified mutually exclusive altered signaling pathways in seven different cancer types. This work shows that combining human-curated 'big mechanisms' with extracted 'big data' can lead to a causal, predictive understanding of cellular processes and unlock important downstream applications.

摘要

PubMed 是一个生物医学文献的存储库和搜索引擎，现在每年索引超过 100 万篇文章。这超过了人类领域专家的处理能力，限制了我们真正理解许多疾病的能力。我们提出了 Reach，这是一个用于自动、大规模机器阅读生物医学论文的系统，可以以相对较高的精度和高通量提取生物过程的机制描述。我们证明，将提取的途径片段与依赖于精心设计模型的现有生物数据分析算法相结合，有助于识别和解释七种不同癌症类型中大量以前未被识别的相互排斥的改变的信号通路。这项工作表明，将人类精心设计的“大机制”与提取的“大数据”相结合，可以导致对细胞过程的因果预测理解，并解锁重要的下游应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c83/6156821/e57737064419/bay098f1.jpg

相似文献

Large-scale automated machine reading discovers new cancer-driving mechanisms.大规模自动化机器阅读发现新的致癌驱动机制。

Database (Oxford). 2018 Jan 1;2018:bay098. doi: 10.1093/database/bay098.

[Machine Learning Applications in Cancer Genome Medicine].[机器学习在癌症基因组医学中的应用]

Gan To Kagaku Ryoho. 2019 Mar;46(3):423-426.

Automated ontology generation framework powered by linked biomedical ontologies for disease-drug domain.基于链接生物医学本体的疾病-药物领域自动化本体生成框架。

Comput Methods Programs Biomed. 2018 Oct;165:117-128. doi: 10.1016/j.cmpb.2018.08.010. Epub 2018 Aug 16.

A review of image analysis and machine learning techniques for automated cervical cancer screening from pap-smear images.基于巴氏涂片图像的宫颈癌自动筛查的图像分析和机器学习技术综述。

Comput Methods Programs Biomed. 2018 Oct;164:15-22. doi: 10.1016/j.cmpb.2018.05.034. Epub 2018 Jun 26.

PyBDA: a command line tool for automated analysis of big biological data sets.PyBDA：一个用于自动化分析大型生物数据集的命令行工具。

BMC Bioinformatics. 2019 Nov 12;20(1):564. doi: 10.1186/s12859-019-3087-8.

Automated detection of discourse segment and experimental types from the text of cancer pathway results sections.从癌症通路结果部分的文本中自动检测语篇片段和实验类型。

Database (Oxford). 2016 Aug 31;2016. doi: 10.1093/database/baw122. Print 2016.

Opportunities at the Intersection of Synthetic Biology, Machine Learning, and Automation.合成生物学、机器学习与自动化交叉领域的机遇。

ACS Synth Biol. 2019 Jul 19;8(7):1474-1477. doi: 10.1021/acssynbio.8b00540.

Using machine learning algorithms to identify genes essential for cell survival.使用机器学习算法识别细胞存活所必需的基因。

BMC Bioinformatics. 2017 Oct 3;18(Suppl 11):397. doi: 10.1186/s12859-017-1799-1.

Prediction of cancer proteins by integrating protein interaction, domain frequency, and domain interaction data using machine learning algorithms.利用机器学习算法整合蛋白质相互作用、结构域频率和结构域相互作用数据来预测癌症蛋白质。

Biomed Res Int. 2015;2015:312047. doi: 10.1155/2015/312047. Epub 2015 Mar 17.

Machine Learning Approaches in Cardiovascular Imaging.心血管成像中的机器学习方法

Circ Cardiovasc Imaging. 2017 Oct;10(10). doi: 10.1161/CIRCIMAGING.117.005614.

引用本文的文献

Context-aware knowledge selection and reliable model recommendation with ACCORDION.使用ACCORDION进行上下文感知知识选择和可靠模型推荐。

Front Syst Biol. 2024 Apr 18;4:1308292. doi: 10.3389/fsysb.2024.1308292. eCollection 2024.

Computational tools and data integration to accelerate vaccine development: challenges, opportunities, and future directions.加速疫苗开发的计算工具与数据整合：挑战、机遇及未来方向

Front Immunol. 2025 Mar 7;16:1502484. doi: 10.3389/fimmu.2025.1502484. eCollection 2025.

A Computational Protocol for the Knowledge-Based Assessment and Capture of Pathologies.基于知识的病理评估和捕获的计算方案。

Methods Mol Biol. 2025;2868:265-284. doi: 10.1007/978-1-0716-4200-9_14.

Beyond protein lists: AI-assisted interpretation of proteomic investigations in the context of evolving scientific knowledge.超越蛋白质列表：在不断发展的科学知识背景下，人工智能辅助蛋白质组学研究的解读

Nat Methods. 2024 Aug;21(8):1387-1389. doi: 10.1038/s41592-024-02324-4.

Semantics-enabled biomedical literature analytics.支持语义分析的生物医学文献分析

J Biomed Inform. 2024 Feb;150:104588. doi: 10.1016/j.jbi.2024.104588. Epub 2024 Jan 19.

Leveraging Structured Biological Knowledge for Counterfactual Inference: A Case Study of Viral Pathogenesis.利用结构化生物学知识进行反事实推理：病毒致病机制的案例研究

IEEE Trans Big Data. 2021 Jan 18;7(1):25-37. doi: 10.1109/TBDATA.2021.3050680. eCollection 2021 Mar 1.

Automated assembly of molecular mechanisms at scale from text mining and curated databases.从文本挖掘和经过整理的数据库中大规模自动组装分子机制。

Mol Syst Biol. 2023 May 9;19(5):e11325. doi: 10.15252/msb.202211325. Epub 2023 Mar 20.

Developing a Knowledge Graph for Pharmacokinetic Natural Product-Drug Interactions.开发药代动力学天然产物-药物相互作用知识库。

J Biomed Inform. 2023 Apr;140:104341. doi: 10.1016/j.jbi.2023.104341. Epub 2023 Mar 17.

NDEx IQuery: a multi-method network gene set analysis leveraging the Network Data Exchange.NDEx IQuery：一种利用网络数据交换的多方法网络基因集分析。

Bioinformatics. 2023 Mar 1;39(3). doi: 10.1093/bioinformatics/btad118.

Gilda: biomedical entity text normalization with machine-learned disambiguation as a service.吉尔达：作为一种服务的、带有机器学习消歧功能的生物医学实体文本规范化。

Bioinform Adv. 2022 May 11;2(1):vbac034. doi: 10.1093/bioadv/vbac034. eCollection 2022.

本文引用的文献

Platelet procoagulant phenotype is modulated by a p38-MK2 axis that regulates RTN4/Nogo proximal to the endoplasmic reticulum: utility of pathway analysis.血小板促凝表型受 p38-MK2 轴调节，该轴靠近内质网调节 RTN4/Nogo：通路分析的实用性。

Am J Physiol Cell Physiol. 2018 May 1;314(5):C603-C615. doi: 10.1152/ajpcell.00177.2017. Epub 2018 Feb 7.

From word models to executable models of signaling networks using automated assembly.使用自动化装配从单词模型到信号网络的可执行模型。

Mol Syst Biol. 2017 Nov 24;13(11):954. doi: 10.15252/msb.20177651.

Inferring causal molecular networks: empirical assessment through a community-based effort.推断因果分子网络：通过基于社区的努力进行实证评估。

Nat Methods. 2016 Apr;13(4):310-8. doi: 10.1038/nmeth.3773. Epub 2016 Feb 22.

Perturbation biology nominates upstream-downstream drug combinations in RAF inhibitor resistant melanoma cells.扰动生物学确定了RAF抑制剂耐药黑色素瘤细胞中的上下游药物组合。

Elife. 2015 Aug 18;4:e04640. doi: 10.7554/eLife.04640.

Extending the evaluation of Genia Event task toward knowledge base construction and comparison to Gene Regulation Ontology task.将Genia事件任务的评估扩展到知识库构建，并与基因调控本体任务进行比较。

BMC Bioinformatics. 2015;16 Suppl 10(Suppl 10):S3. doi: 10.1186/1471-2105-16-S10-S3. Epub 2015 Jul 13.

Big Data: Astronomical or Genomical?大数据：天文学的还是基因组学的？

PLoS Biol. 2015 Jul 7;13(7):e1002195. doi: 10.1371/journal.pbio.1002195. eCollection 2015 Jul.

Systematic identification of cancer driving signaling pathways based on mutual exclusivity of genomic alterations.基于基因组改变的互斥性对癌症驱动信号通路进行系统鉴定。

Genome Biol. 2015 Feb 26;16(1):45. doi: 10.1186/s13059-015-0612-6.

Prediction of individualized therapeutic vulnerabilities in cancer from genomic profiles.从基因组图谱预测癌症的个体化治疗弱点。

Bioinformatics. 2014 Jul 15;30(14):2051-9. doi: 10.1093/bioinformatics/btu164. Epub 2014 Mar 24.

Pathway Commons at virtual cell: use of pathway data for mathematical modeling.通路公共知识库在虚拟细胞中的应用：通路数据在数学建模中的应用。

Bioinformatics. 2014 Jan 15;30(2):292-4. doi: 10.1093/bioinformatics/btt660. Epub 2013 Nov 22.

A robust approach to extract biomedical events from literature.从文献中提取生物医学事件的稳健方法。

Bioinformatics. 2012 Oct 15;28(20):2654-61. doi: 10.1093/bioinformatics/bts487. Epub 2012 Aug 1.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

大规模自动化机器阅读发现新的致癌驱动机制。

Large-scale automated machine reading discovers new cancer-driving mechanisms.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献