• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
ATLAS: an automated association test using probabilistically linked health records with application to genetic studies.ATLAS:一种使用概率链接健康记录进行自动关联测试的方法,应用于遗传研究。
J Am Med Inform Assoc. 2021 Nov 25;28(12):2582-2592. doi: 10.1093/jamia/ocab187.
2
Impact of linkage quality on inferences drawn from analyses using data with high rates of linkage errors in rural Tanzania.坦桑尼亚农村地区高连锁错误率数据的分析中,连锁质量对推论的影响。
BMC Med Res Methodol. 2018 Dec 10;18(1):165. doi: 10.1186/s12874-018-0632-5.
3
Linking Electronic Health Record and Trauma Registry Data: Assessing the Value of Probabilistic Linkage.连接电子健康记录与创伤登记数据:评估概率性连接的价值。
Methods Inf Med. 2018 Nov;57(5-06):261-269. doi: 10.1055/s-0039-1681087. Epub 2019 Mar 15.
4
Evaluating bias due to data linkage error in electronic healthcare records.评估电子医疗记录中因数据链接错误导致的偏差。
BMC Med Res Methodol. 2014 Mar 5;14:36. doi: 10.1186/1471-2288-14-36.
5
Evaluation of record linkage of two large administrative databases in a middle income country: stillbirths and notifications of dengue during pregnancy in Brazil.中等收入国家两个大型行政数据库的记录关联评估:巴西的死产与孕期登革热通报情况
BMC Med Inform Decis Mak. 2017 Jul 17;17(1):108. doi: 10.1186/s12911-017-0506-5.
6
Evaluating the use of existing data sources, probabilistic linkage, and multiple imputation to build population-based injury databases across phases of trauma care.评估利用现有数据源、概率性链接和多重插补在创伤救治各阶段构建基于人群的伤害数据库。
Acad Emerg Med. 2012 Apr;19(4):469-80. doi: 10.1111/j.1553-2712.2012.01324.x.
7
Automated linkage of patient records from disparate sources.来自不同来源的患者记录的自动链接。
Stat Methods Med Res. 2018 Jan;27(1):172-184. doi: 10.1177/0962280215626180. Epub 2016 Jul 20.
8
Comparing Methods for Record Linkage for Public Health Action: Matching Algorithm Validation Study.比较公共卫生行动记录链接的方法:匹配算法验证研究。
JMIR Public Health Surveill. 2020 Apr 30;6(2):e15917. doi: 10.2196/15917.
9
Building A Longitudinal Cohort From 9-1-1 to 1-Year Using Existing Data Sources, Probabilistic Linkage, and Multiple Imputation: A Validation Study.从现有的数据源、概率链接和多重插补构建一个从 9-1-1 到 1 年的纵向队列:一项验证研究。
Acad Emerg Med. 2018 Nov;25(11):1268-1283. doi: 10.1111/acem.13512. Epub 2018 Jul 31.
10
Utilising identifier error variation in linkage of large administrative data sources.利用大型行政数据源链接中的标识符错误变异。
BMC Med Res Methodol. 2017 Feb 7;17(1):23. doi: 10.1186/s12874-017-0306-8.

本文引用的文献

1
A new hybrid record linkage process to make epidemiological databases interoperable: application to the GEMO and GENEPSO studies involving BRCA1 and BRCA2 mutation carriers.一种新的混合记录链接过程,使流行病学数据库具有互操作性:在涉及 BRCA1 和 BRCA2 突变携带者的 GEMO 和 GENEPSO 研究中的应用。
BMC Med Res Methodol. 2021 Jul 29;21(1):155. doi: 10.1186/s12874-021-01299-6.
2
C-reactive protein and implications in rheumatoid arthritis and associated comorbidities.C反应蛋白及其在类风湿关节炎和相关合并症中的意义。
Semin Arthritis Rheum. 2021 Feb;51(1):219-229. doi: 10.1016/j.semarthrit.2020.11.005. Epub 2020 Dec 17.
3
Impact of ICD10 and secular changes on electronic medical record rheumatoid arthritis algorithms.ICD10 和长期变化对电子病历类风湿关节炎算法的影响。
Rheumatology (Oxford). 2020 Dec 1;59(12):3759-3766. doi: 10.1093/rheumatology/keaa198.
4
Reflections on modern methods: linkage error bias.关于现代方法的思考:连锁错误偏差。
Int J Epidemiol. 2019 Dec 1;48(6):2050-2060. doi: 10.1093/ije/dyz203.
5
Probabilistic record linkage of de-identified research datasets with discrepancies using diagnosis codes.使用诊断代码对存在差异的去标识研究数据集进行概率性记录链接。
Sci Data. 2019 Jan 8;6:180298. doi: 10.1038/sdata.2018.298.
6
Impact of linkage quality on inferences drawn from analyses using data with high rates of linkage errors in rural Tanzania.坦桑尼亚农村地区高连锁错误率数据的分析中,连锁质量对推论的影响。
BMC Med Res Methodol. 2018 Dec 10;18(1):165. doi: 10.1186/s12874-018-0632-5.
7
Evaluating phecodes, clinical classification software, and ICD-9-CM codes for phenome-wide association studies in the electronic health record.评估电子健康记录中全表型关联研究的疾病编码、临床分类软件和国际疾病分类第九版临床修订本编码。
PLoS One. 2017 Jul 7;12(7):e0175508. doi: 10.1371/journal.pone.0175508. eCollection 2017.
8
The Biobank Portal for Partners Personalized Medicine: A Query Tool for Working with Consented Biobank Samples, Genotypes, and Phenotypes Using i2b2.合作伙伴个性化医疗生物样本库门户:一个使用i2b2处理已获同意的生物样本库样本、基因型和表型的查询工具。
J Pers Med. 2016 Feb 26;6(1):11. doi: 10.3390/jpm6010011.
9
The Information Technology Infrastructure for the Translational Genomics Core and the Partners Biobank at Partners Personalized Medicine.转化基因组学核心以及合作伙伴个性化医疗生物样本库的信息技术基础设施。
J Pers Med. 2016 Jan 21;6(1):6. doi: 10.3390/jpm6010006.
10
Building the Partners HealthCare Biobank at Partners Personalized Medicine: Informed Consent, Return of Research Results, Recruitment Lessons and Operational Considerations.在合作伙伴个性化医疗中心建立合作伙伴医疗生物样本库:知情同意、研究结果反馈、招募经验及操作考量。
J Pers Med. 2016 Jan 14;6(1):2. doi: 10.3390/jpm6010002.

ATLAS:一种使用概率链接健康记录进行自动关联测试的方法,应用于遗传研究。

ATLAS: an automated association test using probabilistically linked health records with application to genetic studies.

机构信息

Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA.

Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Boston, Massachusetts, USA.

出版信息

J Am Med Inform Assoc. 2021 Nov 25;28(12):2582-2592. doi: 10.1093/jamia/ocab187.

DOI:10.1093/jamia/ocab187
PMID:34608931
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8633652/
Abstract

OBJECTIVE

Large amounts of health data are becoming available for biomedical research. Synthesizing information across databases may capture more comprehensive pictures of patient health and enable novel research studies. When no gold standard mappings between patient records are available, researchers may probabilistically link records from separate databases and analyze the linked data. However, previous linked data inference methods are constrained to certain linkage settings and exhibit low power. Here, we present ATLAS, an automated, flexible, and robust association testing algorithm for probabilistically linked data.

MATERIALS AND METHODS

Missing variables are imputed at various thresholds using a weighted average method that propagates uncertainty from probabilistic linkage. Next, estimated effect sizes are obtained using a generalized linear model. ATLAS then conducts the threshold combination test by optimally combining P values obtained from data imputed at varying thresholds using Fisher's method and perturbation resampling.

RESULTS

In simulations, ATLAS controls for type I error and exhibits high power compared to previous methods. In a real-world genetic association study, meta-analysis of ATLAS-enabled analyses on a linked cohort with analyses using an existing cohort yielded additional significant associations between rheumatoid arthritis genetic risk score and laboratory biomarkers.

DISCUSSION

Weighted average imputation weathers false matches and increases contribution of true matches to mitigate linkage error-induced bias. The threshold combination test avoids arbitrarily choosing a threshold to rule a match, thus automating linked data-enabled analyses and preserving power.

CONCLUSION

ATLAS promises to enable novel and powerful research studies using linked data to capitalize on all available data sources.

摘要

目的

大量的健康数据可用于生物医学研究。在数据库之间综合信息可以更全面地了解患者的健康状况,并能够开展新的研究。当患者记录之间没有标准的映射关系时,研究人员可以通过概率链接来自不同数据库的记录,并对链接的数据进行分析。然而,以前的链接数据推断方法受到某些链接设置的限制,并且功效较低。本研究提出了 ATLAS,这是一种用于概率链接数据的自动、灵活和稳健的关联测试算法。

材料和方法

使用加权平均方法在各种阈值处对缺失变量进行插补,该方法从概率链接传播不确定性。接下来,使用广义线性模型获得估计的效应大小。然后,ATLAS 通过使用 Fisher 方法和扰动重采样最优地组合从在不同阈值处插补数据获得的 P 值来进行阈值组合检验。

结果

在模拟中,ATLAS 控制了Ⅰ型错误,与以前的方法相比具有更高的功效。在一项真实的遗传关联研究中,对链接队列进行了 ATLAS 支持的分析的荟萃分析,并对使用现有队列的分析进行了荟萃分析,结果表明类风湿关节炎遗传风险评分与实验室生物标志物之间存在更多的显著关联。

讨论

加权平均插补可以减轻错误匹配的影响,增加真实匹配的贡献,从而减轻链接错误引起的偏差。阈值组合检验避免了任意选择阈值来判断匹配的情况,从而实现了链接数据驱动的分析自动化,并保持了功效。

结论

ATLAS 有望通过使用链接数据来开展新的、强大的研究,从而充分利用所有可用的数据资源。