常用的基因组芯片可能会因为对自闭症谱系障碍发现的变异覆盖不完美而丢失信息。

Commonly used genomic arrays may lose information due to imperfect coverage of discovered variants for autism spectrum disorder.

机构信息

Mental Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.

Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.

出版信息

J Neurodev Disord. 2024 Sep 12;16(1):54. doi: 10.1186/s11689-024-09571-8.

DOI:10.1186/s11689-024-09571-8

PMID:39266988

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11397030/

Abstract

BACKGROUND

Common genetic variation has been shown to account for a large proportion of ASD heritability. Polygenic scores generated for autism spectrum disorder (ASD-PGS) using the most recent discovery data, however, explain less variance than expected, despite reporting significant associations with ASD and other ASD-related traits. Here, we investigate the extent to which information loss on the target study genome-wide microarray weakens the predictive power of the ASD-PGS.

METHODS

We studied genotype data from three cohorts of individuals with high familial liability for ASD: The Early Autism Risk Longitudinal Investigation (EARLI), Markers of Autism Risk in Babies-Learning Early Signs (MARBLES), and the Infant Brain Imaging Study (IBIS), and one population-based sample, Study to Explore Early Development Phase I (SEED I). Individuals were genotyped on different microarrays ranging from 1 to 5 million sites. Coverage of the top 88 genome-wide suggestive variants implicated in the discovery was evaluated in all four studies before quality control (QC), after QC, and after imputation. We then created a novel method to assess coverage on the resulting ASD-PGS by correlating a PGS informed by a comprehensive list of variants to a PGS informed with only the available variants.

RESULTS

Prior to imputations, None of the four cohorts directly or indirectly covered all 88 variants among the measured genotype data. After imputation, the two cohorts genotyped on 5-million arrays reached full coverage. Analysis of our novel metric showed generally high genome-wide coverage across all four studies, but a greater number of SNPs informing the ASD-PGS did not result in improved coverage according to our metric.

LIMITATIONS

The studies we analyzed contained modest sample sizes. Our analyses included microarrays with more than 1-million sites, so smaller arrays such as Global Diversity and the PsychArray were not included. Our PGS metric for ASD is only generalizable to samples of European ancestries, though the coverage metric can be computed for traits that have sufficiently large-sized discovery findings in other ancestries.

CONCLUSIONS

We show that commonly used genotyping microarrays have incomplete coverage for common ASD variants, and imputation cannot always recover lost information. Our novel metric provides an intuitive approach to reporting information loss in PGS and an alternative to reporting the total number of SNPs included in the PGS. While applied only to ASD here, this metric can easily be used with other traits.

摘要

背景

已证实常见遗传变异可在很大程度上解释 ASD 的遗传率。然而，使用最新发现的数据生成的自闭症谱系障碍多基因评分 (ASD-PGS) 解释的方差比预期的要小，尽管其与 ASD 及其他与 ASD 相关的特征存在显著关联。在此，我们研究了目标研究全基因组微阵列上的信息丢失在多大程度上削弱了 ASD-PGS 的预测能力。

方法

我们研究了三个具有高 ASD 家族易感性的个体队列的基因型数据：早期自闭症风险纵向研究 (EARLI)、婴儿自闭症风险标志物-学习早期迹象 (MARBLES) 和婴儿大脑成像研究 (IBIS)，以及一个基于人群的样本——探索早期发育阶段 I 研究 (SEED I)。个体在从 100 万到 500 万个位点不等的不同微阵列上进行了基因分型。在进行质量控制 (QC) 之前、之后以及在进行 imputation 之后，我们评估了在四项研究中对发现中涉及的前 88 个全基因组提示性变异的最高覆盖率。然后，我们创建了一种新方法，通过将由综合变异列表提供的 PGS 与仅由可用变异提供的 PGS 进行相关，来评估对生成的 ASD-PGS 的覆盖率。

结果

在 imputation 之前，四个队列中没有一个直接或间接覆盖了测量的基因型数据中所有 88 个变异。在 imputation 之后，两个使用 500 万微阵列的队列达到了完全覆盖。根据我们的新指标对全基因组的分析显示，四个研究均具有较高的覆盖率，但根据我们的指标，提供 ASD-PGS 的 SNP 数量的增加并未导致覆盖率的提高。

局限性

我们分析的研究包含的样本量适中。我们的分析包括了超过 100 万个位点的微阵列，因此没有包括较小的阵列，如 Global Diversity 和 PsychArray。我们的 ASD-PGS 指标仅适用于欧洲血统的样本，尽管对于在其他血统中具有足够大的发现发现的特征，可以计算覆盖度指标。

结论

我们表明，常用的基因分型微阵列对常见的 ASD 变体的覆盖度不完整，并且 imputation 并不总是能恢复丢失的信息。我们的新指标为报告 PGS 中的信息丢失提供了一种直观的方法，并且是报告 PGS 中包含的 SNP 总数的替代方法。虽然此处仅应用于 ASD，但该指标可轻松用于其他特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1131/11397030/a7f53a3ef26e/11689_2024_9571_Fig1_HTML.jpg

相似文献

Commonly used genomic arrays may lose information due to imperfect coverage of discovered variants for autism spectrum disorder.常用的基因组芯片可能会因为对自闭症谱系障碍发现的变异覆盖不完美而丢失信息。

J Neurodev Disord. 2024 Sep 12;16(1):54. doi: 10.1186/s11689-024-09571-8.

The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》

Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施：系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。

Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.

Methylphenidate for children and adolescents with autism spectrum disorder.用于治疗自闭症谱系障碍儿童和青少年的哌醋甲酯

Cochrane Database Syst Rev. 2017 Nov 21;11(11):CD011144. doi: 10.1002/14651858.CD011144.pub2.

Can a Liquid Biopsy Detect Circulating Tumor DNA With Low-passage Whole-genome Sequencing in Patients With a Sarcoma? A Pilot Evaluation.液体活检能否通过低深度全基因组测序检测肉瘤患者的循环肿瘤DNA？一项初步评估。

Clin Orthop Relat Res. 2025 Jan 1;483(1):39-48. doi: 10.1097/CORR.0000000000003161. Epub 2024 Jun 21.

A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。

Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.

Pharmacological intervention for irritability, aggression, and self-injury in autism spectrum disorder (ASD).自闭症谱系障碍（ASD）中易怒、攻击行为和自我伤害的药物干预。

Cochrane Database Syst Rev. 2023 Oct 9;10(10):CD011769. doi: 10.1002/14651858.CD011769.pub2.

Memantine for autism spectrum disorder.美金刚治疗自闭症谱系障碍。

Cochrane Database Syst Rev. 2022 Aug 25;8(8):CD013845. doi: 10.1002/14651858.CD013845.pub2.

Behavioural and cognitive behavioural therapy for obsessive compulsive disorder (OCD) in individuals with autism spectrum disorder (ASD).针对自闭症谱系障碍（ASD）个体的强迫症（OCD）的行为和认知行为疗法。

Cochrane Database Syst Rev. 2021 Sep 3;9(9):CD013173. doi: 10.1002/14651858.CD013173.pub2.

本文引用的文献

A comprehensive evaluation of polygenic score and genotype imputation performances of human SNP arrays in diverse populations.多基因评分与人类 SNP 芯片在不同人群中的基因型推断性能的综合评估。

Sci Rep. 2022 Oct 20;12(1):17556. doi: 10.1038/s41598-022-22215-y.

Subcortical Brain Development in Autism and Fragile X Syndrome: Evidence for Dynamic, Age- and Disorder-Specific Trajectories in Infancy.自闭症和脆性 X 综合征的皮质下脑发育：婴儿期动态、年龄和疾病特异性轨迹的证据。

Am J Psychiatry. 2022 Aug;179(8):562-572. doi: 10.1176/appi.ajp.21090896. Epub 2022 Mar 25.

How rare and common risk variation jointly affect liability for autism spectrum disorder.罕见和常见风险变异如何共同影响自闭症谱系障碍的责任。

Mol Autism. 2021 Oct 6;12(1):66. doi: 10.1186/s13229-021-00466-2.

A fast and robust Bayesian nonparametric method for prediction of complex traits using summary statistics.一种基于贝叶斯非参数方法的快速稳健的统计量预测复杂性状的方法。

PLoS Genet. 2021 Jul 26;17(7):e1009697. doi: 10.1371/journal.pgen.1009697. eCollection 2021 Jul.

A Comparison of Ten Polygenic Score Methods for Psychiatric Disorders Applied Across Multiple Cohorts.多种队列研究中精神障碍十种多基因风险评分方法的比较

Biol Psychiatry. 2021 Nov 1;90(9):611-620. doi: 10.1016/j.biopsych.2021.04.018. Epub 2021 May 4.

The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation.多基因风险评分目录作为一个开放的数据库，用于可重复性和系统评估。

Nat Genet. 2021 Apr;53(4):420-425. doi: 10.1038/s41588-021-00783-5.

LDpred2: better, faster, stronger.LDpred2：更优、更快、更强。

Bioinformatics. 2021 Apr 1;36(22-23):5424-5431. doi: 10.1093/bioinformatics/btaa1029.

Risk in Relatives, Heritability, SNP-Based Heritability, and Genetic Correlations in Psychiatric Disorders: A Review.精神障碍亲属风险、遗传性、基于 SNP 的遗传性以及遗传相关性：综述。

Biol Psychiatry. 2021 Jan 1;89(1):11-19. doi: 10.1016/j.biopsych.2020.05.034. Epub 2020 Jun 10.

Polygenic risk scores: from research tools to clinical instruments.多基因风险评分：从研究工具到临床工具。

Genome Med. 2020 May 18;12(1):44. doi: 10.1186/s13073-020-00742-5.

Association of Genetic Risks With Autism Spectrum Disorder and Early Neurodevelopmental Delays Among Children Without Intellectual Disability.遗传风险与非智力残疾儿童自闭症谱系障碍和早期神经发育迟缓的关联。

JAMA Netw Open. 2020 Feb 5;3(2):e1921644. doi: 10.1001/jamanetworkopen.2019.21644.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

常用的基因组芯片可能会因为对自闭症谱系障碍发现的变异覆盖不完美而丢失信息。

Commonly used genomic arrays may lose information due to imperfect coverage of discovered variants for autism spectrum disorder.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

LIMITATIONS

CONCLUSIONS

背景

方法

结果

局限性

结论

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献