• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Risk prediction with imperfect survival outcome information from electronic health records.从电子健康记录中获取不完美的生存结局信息进行风险预测。
Biometrics. 2023 Mar;79(1):190-202. doi: 10.1111/biom.13599. Epub 2021 Nov 22.
2
Semisupervised Calibration of Risk with Noisy Event Times (SCORNET) using electronic health record data.基于电子健康记录数据的带噪声事件时间的半监督风险校准(SCORNET)。
Biostatistics. 2023 Jul 14;24(3):760-775. doi: 10.1093/biostatistics/kxac003.
3
Semi-supervised Double Deep Learning Temporal Risk Prediction (SeDDLeR) with Electronic Health Records.基于电子健康记录的半监督双深度学习时间风险预测(SeDDLeR)
J Biomed Inform. 2024 Sep;157:104685. doi: 10.1016/j.jbi.2024.104685. Epub 2024 Jul 14.
4
Semi-supervised calibration of noisy event risk (SCANER) with electronic health records.基于电子健康记录的带噪事件风险的半监督校准(SCANER)。
J Biomed Inform. 2023 Aug;144:104425. doi: 10.1016/j.jbi.2023.104425. Epub 2023 Jun 16.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
Semiparametric estimation of the transformation model by leveraging external aggregate data in the presence of population heterogeneity.利用群体异质性下外部聚合数据对半参数变换模型进行估计。
Biometrics. 2023 Sep;79(3):1996-2009. doi: 10.1111/biom.13778. Epub 2022 Nov 10.
7
Semisupervised transfer learning for evaluation of model classification performance.半监督迁移学习在模型分类性能评估中的应用。
Biometrics. 2024 Jan 29;80(1). doi: 10.1093/biomtc/ujae002.
8
An augmented estimation procedure for EHR-based association studies accounting for differential misclassification.基于电子健康记录的关联研究的增强估计程序,考虑到差异误诊。
J Am Med Inform Assoc. 2020 Feb 1;27(2):244-253. doi: 10.1093/jamia/ocz180.
9
Semisupervised multicategory classification with imperfect model.使用不完美模型的半监督多类别分类
IEEE Trans Neural Netw. 2009 Oct;20(10):1594-603. doi: 10.1109/TNN.2009.2027320. Epub 2009 Aug 28.
10
Automated feature selection of predictors in electronic medical records data.电子病历数据中预测指标的自动特征选择
Biometrics. 2019 Mar;75(1):268-277. doi: 10.1111/biom.12987. Epub 2019 Apr 2.

引用本文的文献

1
LATTE: Label-efficient incident phenotyping from longitudinal electronic health records.LATTE:从纵向电子健康记录中进行高效标签事件表型分析。
Patterns (N Y). 2023 Dec 27;5(1):100906. doi: 10.1016/j.patter.2023.100906. eCollection 2024 Jan 12.

本文引用的文献

1
Raking and regression calibration: Methods to address bias from correlated covariate and time-to-event error.耙平和回归校准:解决相关协变量和生存误差偏倚的方法。
Stat Med. 2021 Feb 10;40(3):631-649. doi: 10.1002/sim.8793. Epub 2020 Nov 2.
2
High-throughput multimodal automated phenotyping (MAP) with application to PheWAS.高通量多模态自动化表型分析 (MAP) 在 pheWAS 中的应用。
J Am Med Inform Assoc. 2019 Nov 1;26(11):1255-1262. doi: 10.1093/jamia/ocz066.
3
Determining the Time of Cancer Recurrence Using Claims or Electronic Medical Record Data.利用理赔数据或电子病历数据确定癌症复发时间
JCO Clin Cancer Inform. 2018 Dec;2:1-10. doi: 10.1200/CCI.17.00163.
4
Nonparametric Adjustment for Measurement Error in Time-to-Event Data: Application to Risk Prediction Models.生存时间数据测量误差的非参数调整:在风险预测模型中的应用
J Am Stat Assoc. 2018;113(521):14-25. doi: 10.1080/01621459.2017.1311261. Epub 2017 Sep 14.
5
A validation sampling approach for consistent estimation of adverse drug reaction risk with misclassified right-censored survival data.一种验证性抽样方法,用于一致估计右删失生存数据中错误分类的药物不良反应风险。
Stat Med. 2018 Nov 30;37(27):3887-3903. doi: 10.1002/sim.7854. Epub 2018 Aug 6.
6
Considerations for analysis of time-to-event outcomes measured with error: Bias and correction with SIMEX.考虑对存在误差的生存时间结局进行分析:SIMEX 法的偏倚和校正。
Stat Med. 2018 Apr 15;37(8):1276-1289. doi: 10.1002/sim.7554. Epub 2017 Nov 29.
7
Health Effects of Overweight and Obesity in 195 Countries over 25 Years.25年间195个国家超重和肥胖对健康的影响
N Engl J Med. 2017 Jul 6;377(1):13-27. doi: 10.1056/NEJMoa1614362. Epub 2017 Jun 12.
8
Using electronic health record collected clinical variables to predict medical intensive care unit mortality.利用电子健康记录收集的临床变量预测医学重症监护病房的死亡率。
Ann Med Surg (Lond). 2016 Sep 6;11:52-57. doi: 10.1016/j.amsu.2016.09.002. eCollection 2016 Nov.
9
Maximum likelihood estimation for semiparametric transformation models with interval-censored data.具有区间删失数据的半参数变换模型的极大似然估计
Biometrika. 2016 Jun;103(2):253-271. doi: 10.1093/biomet/asw013. Epub 2016 May 24.
10
A genetic risk score combining 32 SNPs is associated with body mass index and improves obesity prediction in people with major depressive disorder.一个结合了32个单核苷酸多态性的遗传风险评分与体重指数相关,并改善了重度抑郁症患者的肥胖预测。
BMC Med. 2015 Apr 17;13:86. doi: 10.1186/s12916-015-0334-3.

从电子健康记录中获取不完美的生存结局信息进行风险预测。

Risk prediction with imperfect survival outcome information from electronic health records.

机构信息

Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA.

Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.

出版信息

Biometrics. 2023 Mar;79(1):190-202. doi: 10.1111/biom.13599. Epub 2021 Nov 22.

DOI:10.1111/biom.13599
PMID:34747010
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9741856/
Abstract

Readily available proxies for the time of disease onset such as the time of the first diagnostic code can lead to substantial risk prediction error if performing analyses based on poor proxies. Due to the lack of detailed documentation and labor intensiveness of manual annotation, it is often only feasible to ascertain for a small subset the current status of the disease by a follow-up time rather than the exact time. In this paper, we aim to develop risk prediction models for the onset time efficiently leveraging both a small number of labels on the current status and a large number of unlabeled observations on imperfect proxies. Under a semiparametric transformation model for onset and a highly flexible measurement error model for proxy onset time, we propose the semisupervised risk prediction method by combining information from proxies and limited labels efficiently. From an initially estimator solely based on the labeled subset, we perform a one-step correction with the full data augmenting against a mean zero rank correlation score derived from the proxies. We establish the consistency and asymptotic normality of the proposed semisupervised estimator and provide a resampling procedure for interval estimation. Simulation studies demonstrate that the proposed estimator performs well in a finite sample. We illustrate the proposed estimator by developing a genetic risk prediction model for obesity using data from Mass General Brigham Healthcare Biobank.

摘要

易于获得的疾病发病时间的替代指标,如首次诊断代码的时间,如果基于这些较差的替代指标进行分析,可能会导致严重的风险预测误差。由于缺乏详细的文档记录和手动注释的劳动强度,通常只有一小部分可以通过随访时间而不是确切时间来确定疾病的当前状态。在本文中,我们旨在通过充分利用当前状态的少量标签和大量不完善替代指标的未标记观察值,有效地开发发病时间的风险预测模型。在发病的半参数转换模型和代理发病时间的高度灵活的测量误差模型下,我们提出了通过有效结合代理和有限标签的信息的半监督风险预测方法。从最初仅基于标记子集的估计器开始,我们通过从代理中获得的均值为零的秩相关得分的全数据扩充来执行一步校正。我们证明了所提出的半监督估计器的一致性和渐近正态性,并提供了一个用于区间估计的重采样过程。模拟研究表明,该估计器在有限样本中表现良好。我们通过使用 Mass General Brigham Healthcare Biobank 的数据开发肥胖的遗传风险预测模型来说明所提出的估计器。