• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于电子健康记录(EHR)数据多队列分析的快速高效特征工程

Fast and Efficient Feature Engineering for Multi-Cohort Analysis of EHR Data.

作者信息

Ozery-Flato Michal, Yanover Chen, Gottlieb Assaf, Weissbrod Omer, Parush Shear-Yashuv Naama, Goldschmidt Yaara

机构信息

Healthcare Informatics Department, IBM Research - Haifa, Israel.

出版信息

Stud Health Technol Inform. 2017;235:181-185.

PMID:28423779
Abstract

We present a framework for feature engineering, tailored for longitudinal structured data, such as electronic health records (EHRs). To fast-track feature engineering and extraction, the framework combines general-use plug-in extractors, a multi-cohort management mechanism, and modular memoization. Using this framework, we rapidly extracted thousands of features from diverse and large healthcare data sources in multiple projects.

摘要

我们提出了一个针对纵向结构化数据(如电子健康记录(EHR))量身定制的特征工程框架。为了快速推进特征工程和提取,该框架结合了通用插件提取器、多队列管理机制和模块化记忆功能。使用此框架,我们在多个项目中从多样且庞大的医疗数据源中快速提取了数千个特征。

相似文献

1
Fast and Efficient Feature Engineering for Multi-Cohort Analysis of EHR Data.用于电子健康记录(EHR)数据多队列分析的快速高效特征工程
Stud Health Technol Inform. 2017;235:181-185.
2
A machine learning-based framework to identify type 2 diabetes through electronic health records.一种基于机器学习的通过电子健康记录识别2型糖尿病的框架。
Int J Med Inform. 2017 Jan;97:120-127. doi: 10.1016/j.ijmedinf.2016.09.014. Epub 2016 Oct 1.
3
CER Hub: An informatics platform for conducting comparative effectiveness research using multi-institutional, heterogeneous, electronic clinical data.CER中心:一个利用多机构、异构电子临床数据进行比较效果研究的信息学平台。
Int J Med Inform. 2015 Oct;84(10):763-73. doi: 10.1016/j.ijmedinf.2015.06.002. Epub 2015 Jun 10.
4
Importance of health information technology, electronic health records, and continuously aggregating data to comparative effectiveness research and learning health care.重视健康信息技术、电子健康记录以及不断积累数据,以进行比较效果研究和学习医疗保健。
J Clin Oncol. 2012 Dec 1;30(34):4243-8. doi: 10.1200/JCO.2012.42.8011. Epub 2012 Oct 15.
5
Machine learning for administrative health records: A systematic review of techniques and applications.基于管理健康记录的机器学习:技术与应用的系统综述。
Artif Intell Med. 2023 Oct;144:102642. doi: 10.1016/j.artmed.2023.102642. Epub 2023 Aug 27.
6
A classification framework for exploiting sparse multi-variate temporal features with application to adverse drug event detection in medical records.利用稀疏多元时间特征的分类框架及其在医疗记录中药物不良事件检测的应用。
BMC Med Inform Decis Mak. 2019 Jan 10;19(1):7. doi: 10.1186/s12911-018-0717-4.
7
The geographic distribution of cardiovascular health in the stroke prevention in healthcare delivery environments (SPHERE) study.医疗服务环境中卒中预防(SPHERE)研究中心血管健康的地理分布情况。
J Biomed Inform. 2016 Apr;60:95-103. doi: 10.1016/j.jbi.2016.01.013. Epub 2016 Jan 29.
8
Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records.深度患者:一种从电子健康记录中预测患者未来的无监督表示。
Sci Rep. 2016 May 17;6:26094. doi: 10.1038/srep26094.
9
Strategies for Primary Care Stakeholders to Improve Electronic Health Records (EHRs).基层医疗利益相关者改善电子健康记录(EHRs)的策略。
J Am Board Fam Med. 2016 Jan-Feb;29(1):126-34. doi: 10.3122/jabfm.2016.01.150212.
10
Reverse Engineering and Evaluation of Prediction Models for Progression to Type 2 Diabetes: An Application of Machine Learning Using Electronic Health Records.2型糖尿病进展预测模型的逆向工程与评估:基于电子健康记录的机器学习应用
J Diabetes Sci Technol. 2015 Dec 20;10(1):6-18. doi: 10.1177/1932296815620200.

引用本文的文献

1
Scalable information extraction from free text electronic health records using large language models.使用大语言模型从自由文本电子健康记录中进行可扩展的信息提取。
BMC Med Res Methodol. 2025 Jan 28;25(1):23. doi: 10.1186/s12874-025-02470-z.
2
Subtyping Gastrointestinal Surgical Outcomes from Real World Data: A Comprehensive Analysis of UK Biobank.从真实世界数据对胃肠外科手术结局进行亚分型:英国生物样本库的全面分析。
AMIA Annu Symp Proc. 2024 Jan 11;2023:426-435. eCollection 2023.
3
Leveraging Comprehensive Health Records for Breast Cancer Risk Prediction: A Binational Assessment.
利用综合健康记录进行乳腺癌风险预测:一项中澳评估。
AMIA Annu Symp Proc. 2023 Apr 29;2022:385-394. eCollection 2022.
4
Impact of the COVID-19 Pandemic on Clinical Findings in Medical Imaging Exams in a Nationwide Israeli Health Organization: Observational Study.新冠疫情对以色列全国性卫生组织医学影像检查临床结果的影响:观察性研究
JMIR Form Res. 2023 May 2;7:e42930. doi: 10.2196/42930.
5
Artificial intelligence-based prediction of overall survival in metastatic renal cell carcinoma.基于人工智能的转移性肾细胞癌总生存期预测
Front Oncol. 2023 Feb 16;13:1021684. doi: 10.3389/fonc.2023.1021684. eCollection 2023.
6
Predictive and Causal Analysis of No-Shows for Medical Exams During COVID-19: A Case Study of Breast Imaging in a Nationwide Israeli Health Organization.预测和因果分析 COVID-19 期间医疗检查的爽约情况:以以色列全国性医疗组织的乳腺成像为例。
AMIA Annu Symp Proc. 2022 Feb 21;2021:930-939. eCollection 2021.
7
Longitudinal K-means approaches to clustering and analyzing EHR opioid use trajectories for clinical subtypes.纵向 K 均值方法在聚类和分析电子健康记录阿片类药物使用轨迹中的临床亚型的应用。
J Biomed Inform. 2021 Oct;122:103889. doi: 10.1016/j.jbi.2021.103889. Epub 2021 Aug 16.
8
Emulated Clinical Trials from Longitudinal Real-World Data Efficiently Identify Candidates for Neurological Disease Modification: Examples from Parkinson's Disease.基于纵向真实世界数据的模拟临床试验能够有效识别神经疾病修饰治疗的候选对象:帕金森病的实例
Front Pharmacol. 2021 Apr 22;12:631584. doi: 10.3389/fphar.2021.631584. eCollection 2021.
9
Framework for identifying drug repurposing candidates from observational healthcare data.从观察性医疗保健数据中识别药物重新利用候选药物的框架。
JAMIA Open. 2020 Dec 31;3(4):536-544. doi: 10.1093/jamiaopen/ooaa048. eCollection 2020 Dec.
10
Performing an Informatics Consult: Methods and Challenges.开展信息学咨询:方法与挑战。
J Am Coll Radiol. 2018 Mar;15(3 Pt B):563-568. doi: 10.1016/j.jacr.2017.12.023. Epub 2018 Feb 13.