Baker Brennan H, Zhang Shaoyi, Simon Jeremy M, McLarnan Sarah M, Chung Wendy K, Pearson Brandon L
Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, NY, United States.
Master of Public Health Program, Department of Epidemiology, Mailman School of Public Health, Columbia University, New York, NY, United States.
Front Neurosci. 2023 Aug 3;17:1106573. doi: 10.3389/fnins.2023.1106573. eCollection 2023.
mutations contribute to a large proportion of sporadic psychiatric and developmental disorders, yet the potential role of environmental carcinogens as drivers of causal mutations in neurodevelopmental disorders is poorly studied.
To explore environmental mutation vulnerability of disease-associated gene sets, we analyzed publicly available whole genome sequencing datasets of mutations in human induced pluripotent stem cell clonal lines exposed to 12 classes of environmental carcinogens, and human lung cancers from individuals living in highly polluted regions. We compared observed rates of exposure-induced mutations in disease-related gene sets with the expected rates of mutations based on control genes randomly sampled from the genome using exact binomial tests. To explore the role of sequence characteristics in mutation vulnerability, we modeled the effects of sequence length, gene expression, and percent GC content on mutation rates of entire genes and gene coding sequences using multivariate Quasi-Poisson regressions.
We demonstrate that several mutagens, including radiation and polycyclic aromatic hydrocarbons, disproportionately mutate genes related to neurodevelopmental disorders including autism spectrum disorders, schizophrenia, and attention deficit hyperactivity disorder. Other disease genes including amyotrophic lateral sclerosis, Alzheimer's disease, congenital heart disease, orofacial clefts, and coronary artery disease were generally not mutated more than expected. Longer sequence length was more strongly associated with elevated mutations in entire genes compared with mutations in coding sequences. Increased expression was associated with decreased coding sequence mutation rate, but not with the mutability of entire genes. Increased GC content was associated with increased coding sequence mutation rates but decreased mutation rates in entire genes.
Our findings support the possibility that neurodevelopmental disorder genetic etiology is partially driven by a contribution of environment-induced germ line and somatic mutations.
突变在很大一部分散发性精神疾病和发育障碍中起作用,然而环境致癌物作为神经发育障碍中因果突变驱动因素的潜在作用却鲜有研究。
为了探究疾病相关基因集的环境突变易感性,我们分析了公开可用的全基因组测序数据集,这些数据集包括暴露于12类环境致癌物的人类诱导多能干细胞克隆系中的突变,以及生活在高污染地区个体的肺癌。我们使用精确二项式检验,将疾病相关基因集中观察到的暴露诱导突变率与基于从基因组中随机抽样的对照基因的预期突变率进行比较。为了探究序列特征在突变易感性中的作用,我们使用多变量拟泊松回归对序列长度、基因表达和GC含量百分比对整个基因和基因编码序列突变率的影响进行建模。
我们证明,包括辐射和多环芳烃在内的几种诱变剂对与神经发育障碍相关的基因(包括自闭症谱系障碍、精神分裂症和注意力缺陷多动障碍)造成的突变比例过高。其他疾病相关基因,包括肌萎缩侧索硬化症、阿尔茨海默病、先天性心脏病、口腔颌面部裂隙和冠状动脉疾病,通常突变情况没有超过预期。与编码序列中的突变相比,更长序列长度与整个基因中更高的突变率更密切相关。表达增加与编码序列突变率降低相关,但与整个基因的突变性无关。GC含量增加与编码序列突变率增加相关,但与整个基因的突变率降低相关。
我们的研究结果支持这样一种可能性,即神经发育障碍的遗传病因部分是由环境诱导的生殖系和体细胞突变所驱动的。