Boonin Patcharin, Klumsathian Sommon, Iemwimangsa Nareenart, Sensorn Insee, Charoenyingwatana Angkana, Chantratita Wasun, Chareonsirisuthigul Takol
Department of Pathology, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok 10400, Thailand.
Center for Medical Genomics, Faculty of Medicine, Ramathibodi Hospital, Mahidol University, Bangkok 10400, Thailand.
Biology (Basel). 2025 Mar 17;14(3):301. doi: 10.3390/biology14030301.
This trio-based whole-genome sequencing (WGS) study enhances the accuracy of variant detection by leveraging parental genotypes, which facilitates the identification of de novo mutations and population-specific variants. Nonetheless, the comprehensive genetic variation data of the Thai population remain limited, posing challenges to advancing personalized medicine and population-based screening strategies. We establish the genetic variation information of a healthy Thai population by analyzing the sequences of 40 trios, yielding 120 whole genomes (excluding offspring). The resulting dataset encompasses 20.2 million variants, including 1.1 million novel and 19.1 million known variants. Within this dataset, we identify 169 pathogenic variants, of which 56 are classified as rare and 87 are absent from the ClinVar database as of version 2023. These pathogenic variants, particularly the rare and de novo mutations, will likely be of significant interest for genetic association studies. Notably, one pathogenic variant linked to a de novo mutation is found in the gene, which is associated with craniofacial microsomia. With its innovative methodology and comprehensive dataset, our trio-based whole-genome sequencing study provides an invaluable representation of the genetic variations in the Thai population. These data provide a critical foundation for further analyses of the pathogenic variants related to human disease phenotypes in genetic association studies.
这项基于三人组的全基因组测序(WGS)研究通过利用亲本基因型提高了变异检测的准确性,这有助于识别新生突变和特定人群的变异。尽管如此,泰国人群的全面遗传变异数据仍然有限,这给推进个性化医疗和基于人群的筛查策略带来了挑战。我们通过分析40个三人组的序列建立了健康泰国人群的遗传变异信息,得到了120个全基因组(不包括后代)。所得数据集包含2020万个变异,其中包括110万个新变异和1910万个已知变异。在这个数据集中,我们识别出169个致病变异,其中56个被归类为罕见变异,截至2023年版本,ClinVar数据库中没有87个。这些致病变异,特别是罕见变异和新生突变,可能对遗传关联研究具有重要意义。值得注意的是,在与颅面短小畸形相关的基因中发现了一个与新生突变相关的致病变异。凭借其创新的方法和全面的数据集,我们基于三人组的全基因组测序研究提供了泰国人群遗传变异的宝贵代表。这些数据为在遗传关联研究中进一步分析与人类疾病表型相关的致病变异提供了关键基础。