Suppr超能文献

拓扑和重描述可从临床表型中检测到多种替代的生物学途径。

Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes.

机构信息

Purdue University, West Lafayette, IN 47907, USA.

IBM Thomas J. Watson Research Center, Yorktown Heights, NY 10598, USA.

出版信息

Exp Biol Med (Maywood). 2022 Nov;247(22):2015-2024. doi: 10.1177/15353702221126671. Epub 2022 Nov 18.

Abstract

Biological pathways play a crucial role in the properties of diseases and are important in drug discovery. Identifying the logical relationships among distinctive phenotypic clusters could reveal possible connections to the underlying pathways. However, this process is challenging since clinical phenotypes are often available through unstructured electronic health records. Moreover, in the absence of a standardized questionnaire, there could be bias among physicians toward selecting certain medical terms. In this article, we develop an efficient pipeline to address these challenges and help practitioners to reveal the pathways associated with the disease. We use topological data analysis and redescriptions and propose a pipeline of four phases: (1) pre-processing the clinical notes to extract the salient concepts, (2) constructing a feature space of the patients to characterize the extracted concepts, (3) leveraging the topological properties to distill the available knowledge and visualize the extracted features, and finally, (4) investigating the bias in the clinical notes of the selected features and identify possible pathways. Our experiments on a publicly available dataset of COVID-19 clinical notes testify that our pipeline can indeed extract meaningful pathways.

摘要

生物途径在疾病的性质中起着至关重要的作用,在药物发现中也很重要。识别独特表型聚类之间的逻辑关系可以揭示与潜在途径的可能联系。然而,由于临床表型通常可通过非结构化电子健康记录获得,因此这一过程具有挑战性。此外,由于缺乏标准化问卷,医生在选择某些医学术语时可能存在偏见。在本文中,我们开发了一种有效的管道来解决这些挑战,并帮助医生揭示与疾病相关的途径。我们使用拓扑数据分析和重新描述,并提出了一个包含四个阶段的管道:(1)预处理临床笔记以提取突出概念,(2)构建患者特征空间以描述提取的概念,(3)利用拓扑属性提取可用知识并可视化提取的特征,最后,(4)调查所选特征的临床笔记中的偏见并识别可能的途径。我们在公开的 COVID-19 临床笔记数据集上的实验证明,我们的管道确实可以提取有意义的途径。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5815/9791306/46a63102276d/10.1177_15353702221126671-fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验