Sheikh Zayed Institute for Pediatric Surgical Innovation, Children's National Hospital, Washington, DC, USA; Department of Biostatistics & Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.
Rare Disease Institute, Department of Genetics and Metabolism, Children's National Hospital, Washington, DC, USA.
Lancet Digit Health. 2021 Oct;3(10):e635-e643. doi: 10.1016/S2589-7500(21)00137-0. Epub 2021 Sep 1.
Delays in the diagnosis of genetic syndromes are common, particularly in low and middle-income countries with limited access to genetic screening services. We, therefore, aimed to develop and evaluate a machine learning-based screening technology using facial photographs to evaluate a child's risk of presenting with a genetic syndrome for use at the point of care.
In this retrospective study, we developed a facial deep phenotyping technology based on deep neural networks and facial statistical shape models to screen children for genetic syndromes. We trained the machine learning models on facial photographs from children (aged <21 years) with a clinical or molecular diagnosis of a genetic syndrome and controls without a genetic syndrome matched for age, sex, and race or ethnicity. Images were obtained from three publicly available databases (the Atlas of Human Malformations in Diverse Populations of the National Human Genome Research Institute, Face2Gene, and the dataset available from Ferry and colleagues) and the archives of the Children's National Hospital (Washington, DC, USA), in addition to photographs taken on a standard smartphone at the Children's National Hospital. We designed a deep learning architecture structured into three neural networks, which performed image standardisation (Network A), facial morphology detection (Network B), and genetic syndrome risk estimation, accounting for phenotypic variations due to age, sex, and race or ethnicity (Network C). Data were divided randomly into 40 groups for cross validation, and the performance of the model was evaluated in terms of accuracy, sensitivity, and specificity in both the total population and stratified by race or ethnicity, age, and sex.
Our dataset included 2800 facial photographs of children (1318 [47%] female and 1482 [53%] male; 1576 [56%] White, 432 [15%] African, 430 [15%] Hispanic, and 362 [13%] Asian). 1400 children with 128 genetic conditions were included (the most prevalent being Williams-Beuren syndrome [19%], Cornelia de Lange syndrome [17%], Down syndrome [16%], 22q11.2 deletion [13%], and Noonan syndrome [12%] syndrome) in addition to 1400 photographs of matched controls. In the total population, our deep learning-based model had an accuracy of 88% (95% CI 87-89) for the detection of a genetic syndrome, with 90% sensitivity (95% CI 88-92) and 86% specificity (95% CI 84-88). Accuracy was greater in White (90%, 89-91) and Hispanic populations (91%, 88-94) than in African (84%, 81-87) and Asian populations (82%, 78-86). Accuracy was also similar in male (89%, 87-91) and female children (87%, 85-89), and similar in children younger than 2 years (86%, 84-88) and children aged 2 years or older (eg, 89% [87-91] for those aged 2 years to <5 years).
This genetic screening technology could support early risk stratification at the point of care in global populations, which has the potential accelerate diagnosis and reduce mortality and morbidity through preventive care.
Children's National Hospital and Government of Abu Dhabi.
遗传综合征的诊断延迟很常见,尤其是在获得遗传筛查服务有限的中低收入国家。因此,我们旨在开发和评估一种基于机器学习的筛查技术,该技术使用面部照片来评估儿童患有遗传综合征的风险,以便在护理点使用。
在这项回顾性研究中,我们开发了一种基于深度神经网络和面部统计形状模型的面部深度表型技术,用于筛查儿童遗传综合征。我们使用来自患有遗传综合征的儿童(<21 岁)的临床或分子诊断以及与年龄、性别、种族或民族相匹配的无遗传综合征的对照的面部照片来训练机器学习模型。图像来自三个公开可用的数据库(国家人类基因组研究所的多样化人群的人类畸形图集、Face2Gene 和 Ferry 及其同事提供的数据集)和华盛顿特区儿童国家医院的档案,除了在儿童国家医院的标准智能手机上拍摄的照片。我们设计了一个深度学习架构,该架构分为三个神经网络,分别执行图像标准化(网络 A)、面部形态检测(网络 B)和遗传综合征风险估计,同时考虑了由于年龄、性别和种族或民族而导致的表型变化(网络 C)。数据随机分为 40 组进行交叉验证,并根据总人群和按种族或民族、年龄和性别分层的准确性、敏感性和特异性来评估模型的性能。
我们的数据集包括 2800 张儿童面部照片(1318 张[47%]女性和 1482 张[53%]男性;1576 张[56%]白人,432 张[15%]非洲人,430 张[15%]西班牙裔,362 张[13%]亚洲人)。包括 1400 名患有 128 种遗传疾病的儿童(最常见的是威廉姆斯-贝伦综合征[19%]、科恩利德朗格综合征[17%]、唐氏综合征[16%]、22q11.2 缺失[13%]和努南综合征[12%]综合征),以及 1400 名匹配的对照照片。在总人群中,我们的基于深度学习的模型对遗传综合征的检测准确率为 88%(95%CI 87-89),敏感性为 90%(95%CI 88-92),特异性为 86%(95%CI 84-88)。白人(90%,89-91)和西班牙裔人群(91%,88-94)的准确率高于非洲人(84%,81-87)和亚洲人(82%,78-86)。男性(89%,87-91)和女性儿童(87%,85-89)的准确率相似,2 岁以下儿童(86%,84-88)和 2 岁及以上儿童(例如,2 岁至<5 岁儿童为 89%[87-91])的准确率也相似。
这项遗传筛查技术可以在全球人群中支持护理点的早期风险分层,这有可能通过预防性护理来加速诊断并降低死亡率和发病率。
儿童国家医院和阿布扎比政府。