• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用基于密度的异常值识别方法,结合多个数据集对脑卒中临床结局进行验证。

Applying density-based outlier identifications using multiple datasets for validation of stroke clinical outcomes.

机构信息

Center for Information Technology, National Institutes of Health, Bethesda, MD, United States.

Bioinformatics Section, National Institute of Neurological Disorder and Stroke, National Institutes of Health, Bethesda, MD, United States; Department of Neurology, National Taiwan University Hospital, Taipei, Taiwan.

出版信息

Int J Med Inform. 2019 Dec;132:103988. doi: 10.1016/j.ijmedinf.2019.103988. Epub 2019 Oct 3.

DOI:10.1016/j.ijmedinf.2019.103988
PMID:31590140
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6880867/
Abstract

INTRODUCTION

Clinicians commonly use the modified Rankin Scale (mRS) and the Barthel Index (BI) to measure clinical outcome after stroke. These are potential targets in machine learning models for stroke outcome prediction. Therefore, the quality of the measurements is crucial for training and validation of these models. The objective of this study was to apply and evaluate density-based outlier detection methods for identifying potentially incorrect measurements in multiple large stroke datasets to assess the measurement quality.

METHOD

We applied three density-based outlier detection methods including density-based spatial clustering of applications (DBSCAN), hierarchical DBSCAN (HDBSCAN) and local outlier factor (LOF) based on a large dataset obtained from a nationwide prospective stroke registry in Taiwan. The testing of each method was done by using four different NINDS funded stroke datasets.

RESULT

The DBSCAN achieved a high performance across all mRS values where the highest average accuracy was 99.2 ± 0.7 at mRS of 4 and the lowest average accuracy was 92.0 ± 4.6 at mRS of 3. The LOF also achieved similar performance, however, the HDBSCAN with default parameters setting required further tuning improvement.

CONCLUSION

The density-based outlier detection methods were proven to be promising for validation of stroke outcome measures. The outlier detection algorithm developed from a large prospective registry dataset was effectively applied in four different NINDS stroke datasets with high performance results. The tool developed from this detection algorithm can be further applied to real world datasets to increase the data quality in stroke outcome measures.

摘要

简介

临床医生通常使用改良的 Rankin 量表(mRS)和巴氏指数(BI)来衡量中风后的临床结果。这些都是中风预后预测机器学习模型中的潜在目标。因此,测量的质量对于这些模型的训练和验证至关重要。本研究的目的是应用和评估基于密度的异常值检测方法,以识别多个大型中风数据集的潜在错误测量,从而评估测量质量。

方法

我们应用了三种基于密度的异常值检测方法,包括基于密度的应用空间聚类(DBSCAN)、层次 DBSCAN(HDBSCAN)和局部离群因子(LOF),这些方法基于从台湾全国前瞻性中风登记处获得的一个大型数据集。每种方法的测试都是通过使用四个不同的 NINDS 资助的中风数据集完成的。

结果

DBSCAN 在所有 mRS 值上都表现出了很高的性能,其中 mRS 为 4 时的最高平均准确率为 99.2±0.7,mRS 为 3 时的最低平均准确率为 92.0±4.6。LOF 也表现出了类似的性能,然而,HDBSCAN 在默认参数设置下需要进一步的调整改进。

结论

基于密度的异常值检测方法已被证明是验证中风预后测量的一种很有前途的方法。从大型前瞻性登记处数据集开发的异常值检测算法有效地应用于四个不同的 NINDS 中风数据集,结果性能较高。从该检测算法开发的工具可以进一步应用于真实世界的数据集,以提高中风预后测量中的数据质量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/6880867/ffcea187a761/nihms-1544055-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/6880867/f1e48587209c/nihms-1544055-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/6880867/5dce81ea03f4/nihms-1544055-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/6880867/592ea17621fb/nihms-1544055-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/6880867/8592bbd53b02/nihms-1544055-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/6880867/f268fad5c012/nihms-1544055-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/6880867/ffcea187a761/nihms-1544055-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/6880867/f1e48587209c/nihms-1544055-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/6880867/5dce81ea03f4/nihms-1544055-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/6880867/592ea17621fb/nihms-1544055-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/6880867/8592bbd53b02/nihms-1544055-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/6880867/f268fad5c012/nihms-1544055-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4389/6880867/ffcea187a761/nihms-1544055-f0006.jpg

相似文献

1
Applying density-based outlier identifications using multiple datasets for validation of stroke clinical outcomes.应用基于密度的异常值识别方法,结合多个数据集对脑卒中临床结局进行验证。
Int J Med Inform. 2019 Dec;132:103988. doi: 10.1016/j.ijmedinf.2019.103988. Epub 2019 Oct 3.
2
Entropy-based grid approach for handling outliers: a case study to environmental monitoring data.基于熵的网格方法处理异常值:以环境监测数据为例。
Environ Sci Pollut Res Int. 2023 Dec;30(60):125138-125157. doi: 10.1007/s11356-023-26780-1. Epub 2023 Jun 12.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
How well do standard stroke outcome measures reflect quality of life? A retrospective analysis of clinical trial data.标准卒中结局测量指标在多大程度上反映了生活质量?临床试验数据的回顾性分析。
Stroke. 2013 Nov;44(11):3161-5. doi: 10.1161/STROKEAHA.113.001126. Epub 2013 Sep 19.
5
Differences in psychometric properties, cut-off scores, and outcomes between the Barthel Index and Modified Rankin Scale in pharmacotherapy-based stroke trials: systematic literature review.基于药物治疗的卒中试验中,巴氏指数和改良 Rankin 量表在心理测量特性、截断值和结局方面的差异:系统文献回顾。
Curr Med Res Opin. 2009 Jun;25(6):1329-41. doi: 10.1185/03007990902875877.
6
Shift analysis versus dichotomization of the modified Rankin scale outcome scores in the NINDS and ECASS-II trials.在国立神经疾病与中风研究所(NINDS)和欧洲急性卒中协作研究II(ECASS-II)试验中,改良Rankin量表结局评分的移位分析与二分法比较
Stroke. 2007 Dec;38(12):3205-12. doi: 10.1161/STROKEAHA.107.489351. Epub 2007 Nov 1.
7
An outlier removal method based on PCA-DBSCAN for blood-SERS data analysis.基于 PCA-DBSCAN 的血液 SERS 数据分析异常值去除方法。
Anal Methods. 2024 Feb 8;16(6):846-855. doi: 10.1039/d3ay02037a.
8
Population-based Stroke Atlas for outcome prediction: method and preliminary results for ischemic stroke from CT.基于人群的卒中预后预测图谱:CT 对缺血性卒中的预测方法及初步结果
PLoS One. 2014 Aug 14;9(8):e102048. doi: 10.1371/journal.pone.0102048. eCollection 2014.
9
Evaluation of machine learning methods to stroke outcome prediction using a nationwide disease registry.利用全国性疾病登记系统评估机器学习方法对脑卒中结局的预测。
Comput Methods Programs Biomed. 2020 Jul;190:105381. doi: 10.1016/j.cmpb.2020.105381. Epub 2020 Feb 1.
10
Disability status at 1 month is a reliable proxy for final ischemic stroke outcome.1 个月时的残疾状况是最终缺血性脑卒中结局的可靠替代指标。
Neurology. 2010 Aug 24;75(8):688-92. doi: 10.1212/WNL.0b013e3181eee426.

引用本文的文献

1
Predictive model for totally implanted venous access ports‑related long‑term complications in patients with lung cancer.肺癌患者完全植入式静脉输液港相关长期并发症的预测模型
Oncol Lett. 2024 May 15;28(1):326. doi: 10.3892/ol.2024.14459. eCollection 2024 Jul.
2
Selective ensemble method for anomaly detection based on parallel learning.基于并行学习的异常检测选择性集成方法
Sci Rep. 2024 Jan 16;14(1):1420. doi: 10.1038/s41598-024-51849-3.
3
Using Time Series Clustering to Segment and Infer Emergency Department Nursing Shifts from Electronic Health Record Log Files.

本文引用的文献

1
The utility of multivariate outlier detection techniques for data quality evaluation in large studies: an application within the ONDRI project.多元离群值检测技术在大型研究中数据质量评估的效用:ONDRI 项目中的应用。
BMC Med Res Methodol. 2019 May 15;19(1):102. doi: 10.1186/s12874-019-0737-5.
2
Effects of increasing IV tPA-treated stroke mimic rates at CT-based centers on clinical outcomes.在基于CT的中心提高静脉注射组织型纤溶酶原激活剂治疗的疑似中风发生率对临床结果的影响。
Neurology. 2017 Jul 25;89(4):343-348. doi: 10.1212/WNL.0000000000004149. Epub 2017 Jun 28.
3
Predicting the Future - Big Data, Machine Learning, and Clinical Medicine.
使用时间序列聚类从电子健康记录日志文件中分割和推断急诊护理班次。
AMIA Annu Symp Proc. 2023 Apr 29;2022:805-814. eCollection 2022.
4
Cluster-Based Improved Isolation Forest.基于聚类的改进孤立森林
Entropy (Basel). 2022 Apr 27;24(5):611. doi: 10.3390/e24050611.
5
Comparison of outcome prediction models post-stroke for a population-based registry with clinical variables collected at admission . discharge.基于人群登记处的卒中后结局预测模型与入院、出院时收集的临床变量的比较 。 出院 。
Vessel Plus. 2021;5. Epub 2021 Jan 15.
6
A Review on Computer Aided Diagnosis of Acute Brain Stroke.急性脑卒中专研综述
Sensors (Basel). 2021 Dec 20;21(24):8507. doi: 10.3390/s21248507.
7
Construction and Use of Body Weight Measures from Administrative Data in a Large National Health System: A Systematic Review.大型国家卫生系统中基于行政数据的体重测量指标的构建与应用:一项系统评价
Obesity (Silver Spring). 2020 Jul;28(7):1205-1214. doi: 10.1002/oby.22790. Epub 2020 Jun 1.
预测未来——大数据、机器学习与临床医学。
N Engl J Med. 2016 Sep 29;375(13):1216-9. doi: 10.1056/NEJMp1606181.
4
Outlier detection and removal improves accuracy of machine learning approach to multispectral burn diagnostic imaging.异常值检测与去除提高了机器学习方法在多光谱烧伤诊断成像中的准确性。
J Biomed Opt. 2015 Dec;20(12):121305. doi: 10.1117/1.JBO.20.12.121305.
5
Prehospital use of magnesium sulfate as neuroprotection in acute stroke.院前使用硫酸镁对急性卒中进行神经保护。
N Engl J Med. 2015 Feb 5;372(6):528-36. doi: 10.1056/NEJMoa1408827.
6
High-dose albumin treatment for acute ischaemic stroke (ALIAS) Part 2: a randomised, double-blind, phase 3, placebo-controlled trial.高剂量白蛋白治疗急性缺血性脑卒中(ALIAS)第 2 部分:一项随机、双盲、3 期、安慰剂对照试验。
Lancet Neurol. 2013 Nov;12(11):1049-58. doi: 10.1016/S1474-4422(13)70223-0. Epub 2013 Sep 27.
7
Bayesian methods to determine performance differences and to quantify variability among centers in multi-center trials: the IHAST trial.贝叶斯方法用于确定多中心试验中中心间的性能差异和变异性:IHAST 试验。
BMC Med Res Methodol. 2013 Jan 16;13:5. doi: 10.1186/1471-2288-13-5.
8
Outlier detection for patient monitoring and alerting.患者监测和报警的异常值检测。
J Biomed Inform. 2013 Feb;46(1):47-55. doi: 10.1016/j.jbi.2012.08.004. Epub 2012 Aug 27.
9
The Albumin in Acute Stroke Part 1 Trial: an exploratory efficacy analysis.急性脑卒中白蛋白治疗试验第 1 部分:探索性疗效分析。
Stroke. 2011 Jun;42(6):1621-5. doi: 10.1161/STROKEAHA.110.610980. Epub 2011 May 5.
10
Barthel index for stroke trials: development, properties, and application.脑卒中临床试验中的巴氏指数:发展、特性及应用。
Stroke. 2011 Apr;42(4):1146-51. doi: 10.1161/STROKEAHA.110.598540. Epub 2011 Mar 3.