Zhang Sijia, Song Qiao, Zhang Peng, Wang Xiaona, Guo Rong, Li Yanyan, Liu Shuai, Yan Xiaoyu, Zhang Jingjing, Niu Yiwei, Shi Yirong, Song Tingrui, Xu Tao, He Shunmin
Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China; Department of Scientific Research, Jiangsu Cancer Hospital & Jiangsu Institute of Cancer Research & Affiliated Cancer Hospital of Nanjing Medical University, Nanjing, China.
Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China.
Cell Genom. 2024 Dec 11;4(12):100699. doi: 10.1016/j.xgen.2024.100699. Epub 2024 Nov 27.
Variable number tandem repeat (VNTR) is a pervasive and highly mutable genetic feature that varies in both length and repeat sequence. Despite the well-studied copy-number variants, the functional impacts of repeat motif polymorphisms remain unknown. Here, we present the largest genome-wide VNTR polymorphism map to date, with over 2.5 million VNTR length polymorphisms (VNTR-LPs) and over 11 million VNTR motif polymorphisms (VNTR-MPs) detected in 8,222 high-coverage genomes. Leveraging the large-scale NyuWa cohort, we identified 2,982,456 (31.8%) NyuWa-specific VNTR-MPs, of which 95.3% were rare. Moreover, we found 1,937 out of 38,685 VNTRs that were associated with gene expression through VNTR-MPs in lymphoblastoid cell lines. Specifically, we clarified that the expansion of a likely causal motif could upregulate gene expression by improving the binding concentration of PU.1. We also explored the potential impacts of VNTR polymorphisms on phenotypic differentiation and disease susceptibility. This study expands our knowledge of VNTR-MPs and their functional implications.
可变数目串联重复序列(VNTR)是一种普遍存在且高度可变的遗传特征,其长度和重复序列均有所不同。尽管对拷贝数变异已有深入研究,但重复基序多态性的功能影响仍不明确。在此,我们展示了迄今为止最大的全基因组VNTR多态性图谱,在8222个高覆盖度基因组中检测到超过250万个VNTR长度多态性(VNTR-LP)和超过1100万个VNTR基序多态性(VNTR-MP)。利用大规模的女娲队列,我们鉴定出2982456个(31.8%)女娲特异性VNTR-MP,其中95.3%为罕见型。此外,我们在38685个VNTR中发现1937个通过VNTR-MP与淋巴母细胞系中的基因表达相关。具体而言,我们阐明了一个可能的因果基序的扩增可通过提高PU.1的结合浓度来上调基因表达。我们还探讨了VNTR多态性对表型分化和疾病易感性的潜在影响。本研究扩展了我们对VNTR-MP及其功能意义的认识。