• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自动识别儿科电子健康记录中生长数据的不合理值。

Automated identification of implausible values in growth data from pediatric electronic health records.

作者信息

Daymont Carrie, Ross Michelle E, Russell Localio A, Fiks Alexander G, Wasserman Richard C, Grundmeier Robert W

机构信息

Departments of Pediatrics and Public Health Sciences, Penn State College of Medicine, Hershey, PA, USA.

Department of Biostatistics and Epidemiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.

出版信息

J Am Med Inform Assoc. 2017 Nov 1;24(6):1080-1087. doi: 10.1093/jamia/ocx037.

DOI:10.1093/jamia/ocx037
PMID:28453637
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7651915/
Abstract

OBJECTIVE

Large electronic health record (EHR) datasets are increasingly used to facilitate research on growth, but measurement and recording errors can lead to biased results. We developed and tested an automated method for identifying implausible values in pediatric EHR growth data.

MATERIALS AND METHODS

Using deidentified data from 46 primary care sites, we developed an algorithm to identify weight and height values that should be excluded from analysis, including implausible values and values that were recorded repeatedly without remeasurement. The foundation of the algorithm is a comparison of each measurement, expressed as a standard deviation score, with a weighted moving average of a child's other measurements. We evaluated the performance of the algorithm by (1) comparing its results with the judgment of physician reviewers for a stratified random selection of 400 measurements and (2) evaluating its accuracy in a dataset with simulated errors.

RESULTS

Of 2 000 595 growth measurements from 280 610 patients 1 to 21 years old, 3.8% of weight and 4.5% of height values were identified as implausible or excluded for other reasons. The proportion excluded varied widely by primary care site. The automated method had a sensitivity of 97% (95% confidence interval [CI], 94-99%) and a specificity of 90% (95% CI, 85-94%) for identifying implausible values compared to physician judgment, and identified 95% (weight) and 98% (height) of simulated errors.

DISCUSSION AND CONCLUSION

This automated, flexible, and validated method for preparing large datasets will facilitate the use of pediatric EHR growth datasets for research.

摘要

目的

大型电子健康记录(EHR)数据集越来越多地用于促进生长研究,但测量和记录误差可能导致结果有偏差。我们开发并测试了一种自动方法,用于识别儿科EHR生长数据中不合理的值。

材料与方法

利用来自46个初级保健机构的去识别化数据,我们开发了一种算法,以识别应从分析中排除的体重和身高值,包括不合理的值以及未经重新测量而重复记录的值。该算法的基础是将每个测量值(表示为标准差分数)与儿童其他测量值的加权移动平均值进行比较。我们通过以下方式评估该算法的性能:(1)将其结果与医生评审员对分层随机选择的400个测量值的判断进行比较;(2)在一个带有模拟误差的数据集中评估其准确性。

结果

在来自280610名1至21岁患者的2000595次生长测量中,3.8%的体重值和4.5%的身高值被确定为不合理或因其他原因被排除。排除比例在不同初级保健机构之间差异很大。与医生的判断相比,该自动方法识别不合理值的灵敏度为97%(95%置信区间[CI],94 - 99%),特异性为90%(95%CI,85 - 94%),并且识别出了95%(体重)和98%(身高)的模拟误差。

讨论与结论

这种用于准备大型数据集的自动化、灵活且经过验证的方法将有助于利用儿科EHR生长数据集进行研究。

相似文献

1
Automated identification of implausible values in growth data from pediatric electronic health records.自动识别儿科电子健康记录中生长数据的不合理值。
J Am Med Inform Assoc. 2017 Nov 1;24(6):1080-1087. doi: 10.1093/jamia/ocx037.
2
A better performing algorithm for identification of implausible growth data from longitudinal pediatric medical records.一种能够更好地识别纵向儿科医疗记录中不合理增长数据的算法。
Sci Rep. 2024 Aug 6;14(1):18276. doi: 10.1038/s41598-024-69161-5.
3
Identifying biologically implausible values in big longitudinal data: an example applied to child growth data from the Brazilian food and nutrition surveillance system.识别大型纵向数据中的生物学上不合理的值:应用于巴西食品和营养监测系统儿童生长数据的示例。
BMC Med Res Methodol. 2024 Feb 15;24(1):38. doi: 10.1186/s12874-024-02161-1.
4
Identifying erroneous height and weight values from adult electronic health records in the All of Us research program.从“我们所有人”研究计划中的成人电子健康记录中识别错误的身高和体重值。
J Biomed Inform. 2024 Jul;155:104660. doi: 10.1016/j.jbi.2024.104660. Epub 2024 May 23.
5
Body weight and height data in electronic medical records of children.儿童电子病历中的体重和身高数据。
Int J Pediatr Obes. 2010 May 3;5(3):237-42. doi: 10.3109/17477160903268308.
6
Automated data cleaning of paediatric anthropometric data from longitudinal electronic health records: protocol and application to a large patient cohort.从纵向电子健康记录中自动清理儿科人体测量数据:方案和在大型患者队列中的应用。
Sci Rep. 2020 Jun 23;10(1):10164. doi: 10.1038/s41598-020-66925-7.
7
A clustering approach for detecting implausible observation values in electronic health records data.一种用于检测电子健康记录数据中不合理观测值的聚类方法。
BMC Med Inform Decis Mak. 2019 Jul 23;19(1):142. doi: 10.1186/s12911-019-0852-6.
8
New approach for the identification of implausible values and outliers in longitudinal childhood anthropometric data.用于鉴定纵向儿童人体测量数据中不合理值和离群值的新方法。
Ann Epidemiol. 2018 Mar;28(3):204-211.e3. doi: 10.1016/j.annepidem.2018.01.007. Epub 2018 Jan 11.
9
Completeness and accuracy of anthropometric measurements in electronic medical records for children attending primary care.基层医疗中儿童电子病历中人体测量数据的完整性和准确性。
J Innov Health Inform. 2018 Mar 9;25(1):963. doi: 10.14236/jhi.v25i1.963.
10
A big-data approach to producing descriptive anthropometric references: a feasibility and validation study of paediatric growth charts.大数据方法生成描述性人体测量参考值:儿科生长图表的可行性和验证研究。
Lancet Digit Health. 2019 Dec;1(8):e413-e423. doi: 10.1016/S2589-7500(19)30149-9. Epub 2019 Nov 7.

引用本文的文献

1
Universal free school meals and child body mass index: impact among children in community-based health care organizations.普及免费学校餐与儿童体重指数:对社区医疗保健机构中儿童的影响
Health Aff Sch. 2025 Jul 18;3(8):qxaf144. doi: 10.1093/haschl/qxaf144. eCollection 2025 Aug.
2
Classification of childhood obesity using longitudinal clinical body mass index and its validation.利用纵向临床体重指数对儿童肥胖进行分类及其验证。
Int J Obes (Lond). 2025 Jul 17. doi: 10.1038/s41366-025-01836-z.
3
Early Infancy Weight Gain Patterns Are Associated with Child Body Mass Index in Singletons but Not Twins.单胎而非双胎的早期婴儿体重增加模式与儿童体重指数相关。
J Pediatr. 2025 Jun 12;285:114687. doi: 10.1016/j.jpeds.2025.114687.
4
Weight status of children and adolescents with autism spectrum disorder: A cross-sectional analysis of primary care electronic medical records and linked health administrative datasets in Ontario, Canada.自闭症谱系障碍儿童和青少年的体重状况:加拿大安大略省初级保健电子病历及相关健康管理数据集的横断面分析
Pediatr Obes. 2025 Jun;20(6):e70000. doi: 10.1111/ijpo.70000. Epub 2025 Mar 18.
5
Utilization of locally estimated scatterplot smoothing (LOESS) regression to estimate missing weights in a longitudinal cohort of breast cancer patients.利用局部估计散点图平滑(LOESS)回归来估计乳腺癌患者纵向队列中缺失的权重。
Ann Epidemiol. 2025 Apr;104:55-60. doi: 10.1016/j.annepidem.2025.02.013. Epub 2025 Mar 4.
6
Early childhood weight trajectory differences in twins, singletons, and gestational-age matched singletons.双胞胎、单胎以及孕周匹配单胎的幼儿期体重轨迹差异
Prev Med Rep. 2025 Jan 24;50:102984. doi: 10.1016/j.pmedr.2025.102984. eCollection 2025 Feb.
7
Classification of childhood obesity using longitudinal clinical body mass index and its validation.利用纵向临床体重指数对儿童肥胖进行分类及其验证
Res Sq. 2024 Dec 16:rs.3.rs-5392188. doi: 10.21203/rs.3.rs-5392188/v1.
8
The Philadelphia Beverage Tax and Pediatric Weight Outcomes.费城饮料税与儿童体重结果
JAMA Pediatr. 2025 Jan 1;179(1):46-54. doi: 10.1001/jamapediatrics.2024.4782.
9
Associations of the Philadelphia sweetened beverage tax with changes in adult body weight: an interrupted time series analysis.费城甜味饮料税与成人体重变化的关联:一项中断时间序列分析。
Lancet Reg Health Am. 2024 Oct 12;39:100906. doi: 10.1016/j.lana.2024.100906. eCollection 2024 Nov.
10
A framework for conducting GWAS using repeated measures data with an application to childhood BMI.使用重复测量数据进行 GWAS 的框架及其在儿童 BMI 中的应用。
Nat Commun. 2024 Nov 20;15(1):10067. doi: 10.1038/s41467-024-53687-3.

本文引用的文献

1
Antibiotic Exposure During the First 6 Months of Life and Weight Gain During Childhood.生命最初 6 个月抗生素暴露与儿童期体重增加。
JAMA. 2016;315(12):1258-65. doi: 10.1001/jama.2016.2395.
2
Identifying outliers and implausible values in growth trajectory data.识别生长轨迹数据中的异常值和不合理值。
Ann Epidemiol. 2016 Jan;26(1):77-80.e1-2. doi: 10.1016/j.annepidem.2015.10.002. Epub 2015 Oct 19.
3
Validity of the WHO cutoffs for biologically implausible values of weight, height, and BMI in children and adolescents in NHANES from 1999 through 2012.1999年至2012年美国国家健康与营养检查调查(NHANES)中世界卫生组织(WHO)针对儿童和青少年体重、身高及体重指数(BMI)生物学上不合理值的临界值的有效性。
Am J Clin Nutr. 2015 Nov;102(5):1000-6. doi: 10.3945/ajcn.115.115576. Epub 2015 Sep 16.
4
Comparing Methods for Identifying Biologically Implausible Values in Height, Weight, and Body Mass Index Among Youth.比较青少年身高、体重和体重指数中生物学上不合理值的识别方法。
Am J Epidemiol. 2015 Aug 15;182(4):359-65. doi: 10.1093/aje/kwv057. Epub 2015 Jul 15.
5
Comparative Effectiveness Research Through a Collaborative Electronic Reporting Consortium.通过协作电子报告联盟开展的比较效果研究。
Pediatrics. 2015 Jul;136(1):e215-24. doi: 10.1542/peds.2015-0673. Epub 2015 Jun 22.
6
Systematic growth monitoring for the early detection of celiac disease in children.系统生长监测用于早期发现儿童乳糜泻。
JAMA Pediatr. 2015 Mar;169(3):e1525. doi: 10.1001/jamapediatrics.2015.25. Epub 2015 Mar 2.
7
Negation's not solved: generalizability versus optimizability in clinical natural language processing.否定问题尚未解决:临床自然语言处理中的可推广性与可优化性
PLoS One. 2014 Nov 13;9(11):e112774. doi: 10.1371/journal.pone.0112774. eCollection 2014.
8
Trends in relative weight over 1 year in low-income urban youth.低收入城市青年一年来相对体重的变化趋势。
Obesity (Silver Spring). 2015 Feb;23(2):436-42. doi: 10.1002/oby.20928. Epub 2014 Oct 30.
9
Launching PCORnet, a national patient-centered clinical research network.启动 PCORnet,一个全国性的以患者为中心的临床研究网络。
J Am Med Inform Assoc. 2014 Jul-Aug;21(4):578-82. doi: 10.1136/amiajnl-2014-002747. Epub 2014 May 12.
10
Growth in children with congenital heart disease.儿童先天性心脏病的生长发育。
Pediatrics. 2013 Jan;131(1):e236-42. doi: 10.1542/peds.2012-1157. Epub 2012 Dec 10.