• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关系型数据集的模式发现与解缠。

Pattern discovery and disentanglement on relational datasets.

机构信息

Systems Design Engineering, University of Waterloo, Waterloo, ON, Canada.

School of Public Health and Health Systems, University of Waterloo, Waterloo, ON, Canada.

出版信息

Sci Rep. 2021 Mar 11;11(1):5688. doi: 10.1038/s41598-021-84869-4.

DOI:10.1038/s41598-021-84869-4
PMID:33707478
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7952710/
Abstract

Machine Learning has made impressive advances in many applications akin to human cognition for discernment. However, success has been limited in the areas of relational datasets, particularly for data with low volume, imbalanced groups, and mislabeled cases, with outputs that typically lack transparency and interpretability. The difficulties arise from the subtle overlapping and entanglement of functional and statistical relations at the source level. Hence, we have developed Pattern Discovery and Disentanglement System (PDD), which is able to discover explicit patterns from the data with various sizes, imbalanced groups, and screen out anomalies. We present herein four case studies on biomedical datasets to substantiate the efficacy of PDD. It improves prediction accuracy and facilitates transparent interpretation of discovered knowledge in an explicit representation framework PDD Knowledge Base that links the sources, the patterns, and individual patients. Hence, PDD promises broad and ground-breaking applications in genomic and biomedical machine learning.

摘要

机器学习在许多类似于人类认知的应用中取得了令人瞩目的进展,能够进行识别。然而,在关系型数据集方面,特别是在数据量少、不均衡群体和标记错误的情况下,其成功受到了限制,输出结果通常缺乏透明度和可解释性。这些困难源于源级别功能和统计关系的微妙重叠和纠缠。因此,我们开发了模式发现和分解系统(PDD),它能够从各种大小、不均衡群体的数据中发现显式模式,并筛选出异常值。我们在此介绍了四个关于生物医学数据集的案例研究,以证实 PDD 的功效。它提高了预测准确性,并在一个显式表示框架 PDD 知识库中促进了发现知识的透明解释,该知识库将源、模式和个体患者联系起来。因此,PDD 有望在基因组学和生物医学机器学习中得到广泛而突破性的应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b76b/7952710/e402acac54df/41598_2021_84869_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b76b/7952710/eb48efd8ba9d/41598_2021_84869_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b76b/7952710/9c493ddda870/41598_2021_84869_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b76b/7952710/1a6e3fdcd551/41598_2021_84869_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b76b/7952710/609f7b99b992/41598_2021_84869_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b76b/7952710/e402acac54df/41598_2021_84869_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b76b/7952710/eb48efd8ba9d/41598_2021_84869_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b76b/7952710/9c493ddda870/41598_2021_84869_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b76b/7952710/1a6e3fdcd551/41598_2021_84869_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b76b/7952710/609f7b99b992/41598_2021_84869_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b76b/7952710/e402acac54df/41598_2021_84869_Fig5_HTML.jpg

相似文献

1
Pattern discovery and disentanglement on relational datasets.关系型数据集的模式发现与解缠。
Sci Rep. 2021 Mar 11;11(1):5688. doi: 10.1038/s41598-021-84869-4.
2
Explanation and prediction of clinical data with imbalanced class distribution based on pattern discovery and disentanglement.基于模式发现与解缠的具有不平衡类别分布的临床数据的解释与预测。
BMC Med Inform Decis Mak. 2021 Jan 9;21(1):16. doi: 10.1186/s12911-020-01356-y.
3
Theory and rationale of interpretable all-in-one pattern discovery and disentanglement system.可解释一体化模式发现与解缠系统的理论与原理
NPJ Digit Med. 2023 May 22;6(1):92. doi: 10.1038/s41746-023-00816-9.
4
Pattern Discovery and Disentanglement for Aligned Pattern Cluster Analysis and Protein Binding Complexes Detection用于对齐模式聚类分析和蛋白质结合复合物检测的模式发现与解缠
5
An Unsupervised Error Detection Methodology for Detecting Mislabels in Healthcare Analytics.一种用于医疗保健分析中检测错误标签的无监督错误检测方法。
Bioengineering (Basel). 2024 Jul 31;11(8):770. doi: 10.3390/bioengineering11080770.
6
Discovery and disentanglement of aligned residue associations from aligned pattern clusters to reveal subgroup characteristics.从比对模式簇中发现并解开比对残基关联以揭示亚组特征。
BMC Med Genomics. 2018 Nov 20;11(Suppl 5):103. doi: 10.1186/s12920-018-0417-z.
7
Chained study and the discovery of relational structure.连锁研究与关系结构的发现。
Mem Cognit. 2022 Jan;50(1):95-111. doi: 10.3758/s13421-021-01201-1. Epub 2021 Jul 15.
8
Variational Disentanglement for Rare Event Modeling.用于罕见事件建模的变分解缠
ArXiv. 2020 Sep 17:arXiv:2009.08541v5.
9
Variational Disentanglement for Rare Event Modeling.用于罕见事件建模的变分解缠
Proc AAAI Conf Artif Intell. 2021 May 18;35(12):10469-10477.
10
A theory of the discovery and predication of relational concepts.一种关于关系概念的发现与预测的理论。
Psychol Rev. 2008 Jan;115(1):1-43. doi: 10.1037/0033-295X.115.1.1.

引用本文的文献

1
Benchmarking Interpretability in Healthcare Using Pattern Discovery and Disentanglement.利用模式发现和解缠结对医疗保健中的可解释性进行基准测试。
Bioengineering (Basel). 2025 Mar 18;12(3):308. doi: 10.3390/bioengineering12030308.
2
AI illuminates paths in oral cancer: transformative insights, diagnostic precision, and personalized strategies.人工智能照亮口腔癌治疗之路:变革性见解、诊断精准度与个性化策略。
EXCLI J. 2024 Sep 3;23:1091-1116. doi: 10.17179/excli2024-7253. eCollection 2024.
3
An Unsupervised Error Detection Methodology for Detecting Mislabels in Healthcare Analytics.

本文引用的文献

1
Explanation and prediction of clinical data with imbalanced class distribution based on pattern discovery and disentanglement.基于模式发现与解缠的具有不平衡类别分布的临床数据的解释与预测。
BMC Med Inform Decis Mak. 2021 Jan 9;21(1):16. doi: 10.1186/s12911-020-01356-y.
2
With an eye to AI and autonomous diagnosis.着眼于人工智能与自主诊断。
NPJ Digit Med. 2018 Aug 28;1:40. doi: 10.1038/s41746-018-0048-y. eCollection 2018.
3
Early Detection of Depression: Social Network Analysis and Random Forest Techniques.抑郁症的早期检测:社交网络分析与随机森林技术
一种用于医疗保健分析中检测错误标签的无监督错误检测方法。
Bioengineering (Basel). 2024 Jul 31;11(8):770. doi: 10.3390/bioengineering11080770.
4
Theory and rationale of interpretable all-in-one pattern discovery and disentanglement system.可解释一体化模式发现与解缠系统的理论与原理
NPJ Digit Med. 2023 May 22;6(1):92. doi: 10.1038/s41746-023-00816-9.
J Med Internet Res. 2019 Jun 10;21(6):e12554. doi: 10.2196/12554.
4
Artificial intelligence in healthcare.人工智能在医疗保健领域的应用。
Nat Biomed Eng. 2018 Oct;2(10):719-731. doi: 10.1038/s41551-018-0305-z. Epub 2018 Oct 10.
5
Enhancing Diagnosis of Autism With Optimized Machine Learning Models and Personal Characteristic Data.利用优化的机器学习模型和个人特征数据增强自闭症诊断
Front Comput Neurosci. 2019 Feb 15;13:9. doi: 10.3389/fncom.2019.00009. eCollection 2019.
6
Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence.人工智能在儿科疾病评估和精准诊断中的应用。
Nat Med. 2019 Mar;25(3):433-438. doi: 10.1038/s41591-018-0335-9. Epub 2019 Feb 11.
7
High-performance medicine: the convergence of human and artificial intelligence.高性能医学:人机智能融合。
Nat Med. 2019 Jan;25(1):44-56. doi: 10.1038/s41591-018-0300-7. Epub 2019 Jan 7.
8
Discovery and disentanglement of aligned residue associations from aligned pattern clusters to reveal subgroup characteristics.从比对模式簇中发现并解开比对残基关联以揭示亚组特征。
BMC Med Genomics. 2018 Nov 20;11(Suppl 5):103. doi: 10.1186/s12920-018-0417-z.
9
Pattern to Knowledge: Deep Knowledge-Directed Machine Learning for Residue-Residue Interaction Prediction.模式到知识:用于残基-残基相互作用预测的深度知识导向的机器学习。
Sci Rep. 2018 Oct 4;8(1):14841. doi: 10.1038/s41598-018-32834-z.
10
Artificial intelligence in healthcare: past, present and future.人工智能在医疗保健中的应用:过去、现在和未来。
Stroke Vasc Neurol. 2017 Jun 21;2(4):230-243. doi: 10.1136/svn-2017-000101. eCollection 2017 Dec.