Suppr超能文献

利用来自全球 19 个群体的 41 个样本的长读测序数据,揭示 370 个具有挑战性的医学相关基因中的新型遗传变异。

Unveiling novel genetic variants in 370 challenging medically relevant genes using the long read sequencing data of 41 samples from 19 global populations.

机构信息

State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, School of Life Science, Fudan University, Shanghai, 200438, China.

Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.

出版信息

Mol Genet Genomics. 2024 Jul 7;299(1):65. doi: 10.1007/s00438-024-02158-x.

Abstract

BACKGROUND

A large number of challenging medically relevant genes (CMRGs) are situated in complex or highly repetitive regions of the human genome, hindering comprehensive characterization of genetic variants using next-generation sequencing technologies. In this study, we employed long-read sequencing technology, extensively utilized in studying complex genomic regions, to characterize genetic alterations, including short variants (single nucleotide variants and short insertions and deletions) and copy number variations, in 370 CMRGs across 41 individuals from 19 global populations.

RESULTS

Our analysis revealed high levels of genetic variants in CMRGs, with 68.73% exhibiting copy number variations and 65.20% containing short variants that may disrupt protein function across individuals. Such variants can influence pharmacogenomics, genetic disease susceptibility, and other clinical outcomes. We observed significant differences in CMRG variation across populations, with individuals of African ancestry harboring the highest number of copy number variants and short variants compared to samples from other continents. Notably, 15.79% to 33.96% of short variants were exclusively detectable through long-read sequencing. While the T2T-CHM13 reference genome significantly improved the assembly of CMRG regions, thereby facilitating variant detection in these regions, some regions still lacked resolution.

CONCLUSION

Our results provide an important reference for future clinical and pharmacogenetic studies, highlighting the need for a comprehensive representation of global genetic diversity in the reference genome and improved variant calling techniques to fully resolve medically relevant genes.

摘要

背景

大量具有挑战性的医学相关基因(CMRGs)位于人类基因组的复杂或高度重复区域,这阻碍了使用下一代测序技术对遗传变异进行全面表征。在这项研究中,我们采用了长读测序技术,该技术广泛用于研究复杂基因组区域,以表征 41 名来自 19 个全球人群的个体中 370 个 CMRG 中的遗传改变,包括短变异(单核苷酸变异和短插入和缺失)和拷贝数变异。

结果

我们的分析显示 CMRG 中存在高水平的遗传变异,其中 68.73%表现出拷贝数变异,65.20%含有可能破坏个体蛋白功能的短变异。这些变异会影响药物基因组学、遗传疾病易感性和其他临床结果。我们观察到 CMRG 变异在人群之间存在显著差异,与来自其他大陆的样本相比,非洲裔个体携带的拷贝数变异和短变异数量最多。值得注意的是,15.79%至 33.96%的短变异只能通过长读测序来检测。虽然 T2T-CHM13 参考基因组显著提高了 CMRG 区域的组装,从而有助于在这些区域中检测变异,但某些区域仍缺乏分辨率。

结论

我们的研究结果为未来的临床和药物遗传学研究提供了重要参考,突出了在参考基因组中全面代表全球遗传多样性和改进变异调用技术的必要性,以充分解析医学相关基因。

相似文献

3
HiFi long-read genomes for difficult-to-detect, clinically relevant variants.
Am J Hum Genet. 2025 Feb 6;112(2):450-456. doi: 10.1016/j.ajhg.2024.12.013. Epub 2025 Jan 13.
6
Systematic analysis of paralogous regions in 41,755 exomes uncovers clinically relevant variation.
Nat Commun. 2023 Oct 27;14(1):6845. doi: 10.1038/s41467-023-42531-9.
9
Copy number variations in the genome of the Qatari population.
BMC Genomics. 2015 Oct 22;16:834. doi: 10.1186/s12864-015-1991-5.
10
Comparison of mitochondrial DNA variants detection using short- and long-read sequencing.
J Hum Genet. 2019 Nov;64(11):1107-1116. doi: 10.1038/s10038-019-0654-9. Epub 2019 Aug 13.

本文引用的文献

2
Symphonizing pileup and full-alignment for deep learning-based long-read variant calling.
Nat Comput Sci. 2022 Dec;2(12):797-803. doi: 10.1038/s43588-022-00387-x. Epub 2022 Dec 19.
3
Lipoprotein(a) beyond the kringle IV repeat polymorphism: The complexity of genetic variation in the LPA gene.
Atherosclerosis. 2022 May;349:17-35. doi: 10.1016/j.atherosclerosis.2022.04.003.
4
Pharmacogenomics: the low-hanging fruit in the personalized medicine tree.
Hum Genet. 2022 Jun;141(6):1109-1111. doi: 10.1007/s00439-022-02456-7.
5
A complete reference genome improves analysis of human genetic variation.
Science. 2022 Apr;376(6588):eabl3533. doi: 10.1126/science.abl3533. Epub 2022 Apr 1.
6
The complete sequence of a human genome.
Science. 2022 Apr;376(6588):44-53. doi: 10.1126/science.abj6987. Epub 2022 Mar 31.
7
Segmental duplications and their variation in a complete human genome.
Science. 2022 Apr;376(6588):eabj6965. doi: 10.1126/science.abj6965. Epub 2022 Apr 1.
8
Complete genomic and epigenetic maps of human centromeres.
Science. 2022 Apr;376(6588):eabl4178. doi: 10.1126/science.abl4178. Epub 2022 Apr 1.
9
Curated variation benchmarks for challenging medically relevant autosomal genes.
Nat Biotechnol. 2022 May;40(5):672-680. doi: 10.1038/s41587-021-01158-1. Epub 2022 Feb 7.
10
Haplotype-aware variant calling with PEPPER-Margin-DeepVariant enables high accuracy in nanopore long-reads.
Nat Methods. 2021 Nov;18(11):1322-1332. doi: 10.1038/s41592-021-01299-w. Epub 2021 Nov 1.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验