• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用来自全球 19 个群体的 41 个样本的长读测序数据,揭示 370 个具有挑战性的医学相关基因中的新型遗传变异。

Unveiling novel genetic variants in 370 challenging medically relevant genes using the long read sequencing data of 41 samples from 19 global populations.

机构信息

State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, School of Life Science, Fudan University, Shanghai, 200438, China.

Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.

出版信息

Mol Genet Genomics. 2024 Jul 7;299(1):65. doi: 10.1007/s00438-024-02158-x.

DOI:10.1007/s00438-024-02158-x
PMID:38972030
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11955097/
Abstract

BACKGROUND

A large number of challenging medically relevant genes (CMRGs) are situated in complex or highly repetitive regions of the human genome, hindering comprehensive characterization of genetic variants using next-generation sequencing technologies. In this study, we employed long-read sequencing technology, extensively utilized in studying complex genomic regions, to characterize genetic alterations, including short variants (single nucleotide variants and short insertions and deletions) and copy number variations, in 370 CMRGs across 41 individuals from 19 global populations.

RESULTS

Our analysis revealed high levels of genetic variants in CMRGs, with 68.73% exhibiting copy number variations and 65.20% containing short variants that may disrupt protein function across individuals. Such variants can influence pharmacogenomics, genetic disease susceptibility, and other clinical outcomes. We observed significant differences in CMRG variation across populations, with individuals of African ancestry harboring the highest number of copy number variants and short variants compared to samples from other continents. Notably, 15.79% to 33.96% of short variants were exclusively detectable through long-read sequencing. While the T2T-CHM13 reference genome significantly improved the assembly of CMRG regions, thereby facilitating variant detection in these regions, some regions still lacked resolution.

CONCLUSION

Our results provide an important reference for future clinical and pharmacogenetic studies, highlighting the need for a comprehensive representation of global genetic diversity in the reference genome and improved variant calling techniques to fully resolve medically relevant genes.

摘要

背景

大量具有挑战性的医学相关基因(CMRGs)位于人类基因组的复杂或高度重复区域,这阻碍了使用下一代测序技术对遗传变异进行全面表征。在这项研究中,我们采用了长读测序技术,该技术广泛用于研究复杂基因组区域,以表征 41 名来自 19 个全球人群的个体中 370 个 CMRG 中的遗传改变,包括短变异(单核苷酸变异和短插入和缺失)和拷贝数变异。

结果

我们的分析显示 CMRG 中存在高水平的遗传变异,其中 68.73%表现出拷贝数变异,65.20%含有可能破坏个体蛋白功能的短变异。这些变异会影响药物基因组学、遗传疾病易感性和其他临床结果。我们观察到 CMRG 变异在人群之间存在显著差异,与来自其他大陆的样本相比,非洲裔个体携带的拷贝数变异和短变异数量最多。值得注意的是,15.79%至 33.96%的短变异只能通过长读测序来检测。虽然 T2T-CHM13 参考基因组显著提高了 CMRG 区域的组装,从而有助于在这些区域中检测变异,但某些区域仍缺乏分辨率。

结论

我们的研究结果为未来的临床和药物遗传学研究提供了重要参考,突出了在参考基因组中全面代表全球遗传多样性和改进变异调用技术的必要性,以充分解析医学相关基因。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c1a/11955097/12c7f9e5a94a/nihms-2063759-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c1a/11955097/ab7ef6f7667b/nihms-2063759-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c1a/11955097/11953cda9a0c/nihms-2063759-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c1a/11955097/bb9a7b384563/nihms-2063759-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c1a/11955097/12c7f9e5a94a/nihms-2063759-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c1a/11955097/ab7ef6f7667b/nihms-2063759-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c1a/11955097/11953cda9a0c/nihms-2063759-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c1a/11955097/bb9a7b384563/nihms-2063759-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0c1a/11955097/12c7f9e5a94a/nihms-2063759-f0004.jpg

相似文献

1
Unveiling novel genetic variants in 370 challenging medically relevant genes using the long read sequencing data of 41 samples from 19 global populations.利用来自全球 19 个群体的 41 个样本的长读测序数据,揭示 370 个具有挑战性的医学相关基因中的新型遗传变异。
Mol Genet Genomics. 2024 Jul 7;299(1):65. doi: 10.1007/s00438-024-02158-x.
2
Deciphering new insights into copy number variations as drivers of genomic diversity and adaptation in farm animal species.解析家畜物种中作为基因组多样性和适应性驱动因素的拷贝数变异的新见解。
Gene. 2025 Mar 5;939:149159. doi: 10.1016/j.gene.2024.149159. Epub 2024 Dec 11.
3
HiFi long-read genomes for difficult-to-detect, clinically relevant variants.用于检测难以发现的临床相关变异的高保真长读长基因组。
Am J Hum Genet. 2025 Feb 6;112(2):450-456. doi: 10.1016/j.ajhg.2024.12.013. Epub 2025 Jan 13.
4
Enhancing SNV identification in whole-genome sequencing data through the incorporation of known genetic variants into the minimap2 index.通过将已知遗传变异纳入 minimap2 索引来提高全基因组测序数据中 SNV 的识别能力。
BMC Bioinformatics. 2024 Jul 13;25(1):238. doi: 10.1186/s12859-024-05862-y.
5
Innovative approach for high-throughput exploiting sex-specific markers in Japanese parrotfish Oplegnathus fasciatus.创新性方法用于高通量开发褐菖鲉 Oplegnathus fasciatus 的性别特异性标记。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae045.
6
Systematic analysis of paralogous regions in 41,755 exomes uncovers clinically relevant variation.对 41755 个外显子组中的直系同源区域进行系统分析揭示了具有临床意义的变异。
Nat Commun. 2023 Oct 27;14(1):6845. doi: 10.1038/s41467-023-42531-9.
7
Short and long-read genome sequencing methodologies for somatic variant detection; genomic analysis of a patient with diffuse large B-cell lymphoma.短读长读基因组测序方法用于体细胞变异检测;弥漫性大 B 细胞淋巴瘤患者的基因组分析。
Sci Rep. 2021 Mar 19;11(1):6408. doi: 10.1038/s41598-021-85354-8.
8
Performance Characteristics of a Targeted Sequencing Platform for Simultaneous Detection of Single Nucleotide Variants, Insertions/Deletions, Copy Number Alterations, and Gene Fusions in Cancer Genome.用于同时检测癌症基因组中单核苷酸变异、插入/缺失、拷贝数改变和基因融合的靶向测序平台的性能特征
Arch Pathol Lab Med. 2020 Dec 1;144(12):1535-1546. doi: 10.5858/arpa.2019-0162-OA.
9
Copy number variations in the genome of the Qatari population.卡塔尔人群基因组中的拷贝数变异
BMC Genomics. 2015 Oct 22;16:834. doi: 10.1186/s12864-015-1991-5.
10
Comparison of mitochondrial DNA variants detection using short- and long-read sequencing.短读长读测序检测线粒体 DNA 变异的比较。
J Hum Genet. 2019 Nov;64(11):1107-1116. doi: 10.1038/s10038-019-0654-9. Epub 2019 Aug 13.

本文引用的文献

1
Long-read sequencing of 945 Han individuals identifies structural variants associated with phenotypic diversity and disease susceptibility.对945名汉族个体进行的长读长测序确定了与表型多样性和疾病易感性相关的结构变异。
Nat Commun. 2025 Feb 10;16(1):1494. doi: 10.1038/s41467-025-56661-9.
2
Symphonizing pileup and full-alignment for deep learning-based long-read variant calling.基于深度学习的长读变异调用的交响乐堆积和全对齐。
Nat Comput Sci. 2022 Dec;2(12):797-803. doi: 10.1038/s43588-022-00387-x. Epub 2022 Dec 19.
3
Lipoprotein(a) beyond the kringle IV repeat polymorphism: The complexity of genetic variation in the LPA gene.
载脂蛋白(a)除了kringle IV 重复多态性:LPA 基因遗传变异的复杂性。
Atherosclerosis. 2022 May;349:17-35. doi: 10.1016/j.atherosclerosis.2022.04.003.
4
Pharmacogenomics: the low-hanging fruit in the personalized medicine tree.药物基因组学:个性化医疗之树上低垂的果实。
Hum Genet. 2022 Jun;141(6):1109-1111. doi: 10.1007/s00439-022-02456-7.
5
A complete reference genome improves analysis of human genetic variation.完整的参考基因组提高了人类遗传变异分析的能力。
Science. 2022 Apr;376(6588):eabl3533. doi: 10.1126/science.abl3533. Epub 2022 Apr 1.
6
The complete sequence of a human genome.人类基因组的完整序列。
Science. 2022 Apr;376(6588):44-53. doi: 10.1126/science.abj6987. Epub 2022 Mar 31.
7
Segmental duplications and their variation in a complete human genome.人类全基因组中的串联重复序列及其变异。
Science. 2022 Apr;376(6588):eabj6965. doi: 10.1126/science.abj6965. Epub 2022 Apr 1.
8
Complete genomic and epigenetic maps of human centromeres.人类着丝粒的完整基因组和表观基因组图谱。
Science. 2022 Apr;376(6588):eabl4178. doi: 10.1126/science.abl4178. Epub 2022 Apr 1.
9
Curated variation benchmarks for challenging medically relevant autosomal genes.针对具有挑战性的医学相关常染色体基因的精选变异基准。
Nat Biotechnol. 2022 May;40(5):672-680. doi: 10.1038/s41587-021-01158-1. Epub 2022 Feb 7.
10
Haplotype-aware variant calling with PEPPER-Margin-DeepVariant enables high accuracy in nanopore long-reads.使用 PEPPER-Margin-DeepVariant 进行单体型感知变异调用可实现纳米孔长读段的高精度。
Nat Methods. 2021 Nov;18(11):1322-1332. doi: 10.1038/s41592-021-01299-w. Epub 2021 Nov 1.