• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对 90 个汉族个体的全基因组深度测序。

Deep whole-genome sequencing of 90 Han Chinese genomes.

机构信息

BGI-Shenzhen, Build 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China.

BGI Genomics, BGI-Shenzhen, Building NO. 7, BGI Park, No. 21 Hongan 3rd Street, Yantian District, Shenzhen, 518083, China.

出版信息

Gigascience. 2017 Sep 1;6(9):1-7. doi: 10.1093/gigascience/gix067.

DOI:10.1093/gigascience/gix067
PMID:28938720
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5603764/
Abstract

Next-generation sequencing provides a high-resolution insight into human genetic information. However, the focus of previous studies has primarily been on low-coverage data due to the high cost of sequencing. Although the 1000 Genomes Project and the Haplotype Reference Consortium have both provided powerful reference panels for imputation, low-frequency and novel variants remain difficult to discover and call with accuracy on the basis of low-coverage data. Deep sequencing provides an optimal solution for the problem of these low-frequency and novel variants. Although whole-exome sequencing is also a viable choice for exome regions, it cannot account for noncoding regions, sometimes resulting in the absence of important, causal variants. For Han Chinese populations, the majority of variants have been discovered based upon low-coverage data from the 1000 Genomes Project. However, high-coverage, whole-genome sequencing data are limited for any population, and a large amount of low-frequency, population-specific variants remain uncharacterized. We have performed whole-genome sequencing at a high depth (∼×80) of 90 unrelated individuals of Chinese ancestry, collected from the 1000 Genomes Project samples, including 45 Northern Han Chinese and 45 Southern Han Chinese samples. Eighty-three of these 90 have been sequenced by the 1000 Genomes Project. We have identified 12 568 804 single nucleotide polymorphisms, 2 074 210 short InDels, and 26 142 structural variations from these 90 samples. Compared to the Han Chinese data from the 1000 Genomes Project, we have found 7 000 629 novel variants with low frequency (defined as minor allele frequency < 5%), including 5 813 503 single nucleotide polymorphisms, 1 169 199 InDels, and 17 927 structural variants. Using deep sequencing data, we have built a greatly expanded spectrum of genetic variation for the Han Chinese genome. Compared to the 1000 Genomes Project, these Han Chinese deep sequencing data enhance the characterization of a large number of low-frequency, novel variants. This will be a valuable resource for promoting Chinese genetics research and medical development. Additionally, it will provide a valuable supplement to the 1000 Genomes Project, as well as to other human genome projects.

摘要

下一代测序技术为人类遗传信息提供了高分辨率的洞察力。然而,由于测序成本高昂,之前的研究主要集中在低覆盖率数据上。尽管 1000 基因组计划和单倍型参考联盟都为推断提供了强大的参考面板,但低频和新变体仍然难以在低覆盖率数据的基础上准确发现和调用。深度测序为解决低频和新变体的问题提供了最佳解决方案。虽然外显子组测序也是外显子区域的可行选择,但它不能涵盖非编码区域,有时会导致重要的因果变体缺失。对于汉族人群,大多数变体都是基于 1000 基因组计划的低覆盖率数据发现的。然而,对于任何人群来说,高覆盖率的全基因组测序数据都是有限的,大量低频、人群特异性变体仍然未被描述。我们对来自 1000 基因组计划样本的 90 个无亲缘关系的中国人进行了深度约为×80 的全基因组测序,其中包括 45 个北方汉族人和 45 个南方汉族人。这些 90 个人中有 83 个已经被 1000 基因组计划测序过。我们从这 90 个样本中鉴定出了 12568804 个单核苷酸多态性、2074210 个短插入缺失和 26142 个结构变异。与 1000 基因组计划中的汉族数据相比,我们发现了 7000629 个低频新变体(定义为次要等位基因频率<5%),包括 5813503 个单核苷酸多态性、1169199 个插入缺失和 17927 个结构变异。使用深度测序数据,我们构建了一个大大扩展的汉族基因组遗传变异谱。与 1000 基因组计划相比,这些汉族深度测序数据增强了对大量低频新变体的描述。这将是促进中国遗传学研究和医学发展的宝贵资源。此外,它将为 1000 基因组计划以及其他人类基因组计划提供有价值的补充。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ffb/5603764/72d695dfa842/gix067fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ffb/5603764/db91e81463f7/gix067fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ffb/5603764/0b590d2368c2/gix067fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ffb/5603764/e83334bbcb59/gix067fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ffb/5603764/72d695dfa842/gix067fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ffb/5603764/db91e81463f7/gix067fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ffb/5603764/0b590d2368c2/gix067fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ffb/5603764/e83334bbcb59/gix067fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ffb/5603764/72d695dfa842/gix067fig4.jpg

相似文献

1
Deep whole-genome sequencing of 90 Han Chinese genomes.对 90 个汉族个体的全基因组深度测序。
Gigascience. 2017 Sep 1;6(9):1-7. doi: 10.1093/gigascience/gix067.
2
Whole Genome Analyses of Chinese Population and De Novo Assembly of A Northern Han Genome.中国人全基因组分析与北方汉族个体从头组装基因组。
Genomics Proteomics Bioinformatics. 2019 Jun;17(3):229-247. doi: 10.1016/j.gpb.2019.07.002. Epub 2019 Sep 5.
3
High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios.对扩展的 1000 基因组项目队列进行高覆盖率全基因组测序,包括 602 个三核苷酸重复序列。
Cell. 2022 Sep 1;185(18):3426-3440.e19. doi: 10.1016/j.cell.2022.08.004.
4
NyuWa Genome resource: A deep whole-genome sequencing-based variation profile and reference panel for the Chinese population.女娲基因组资源:中国人种的深度全基因组测序变异图谱和参考面板。
Cell Rep. 2021 Nov 16;37(7):110017. doi: 10.1016/j.celrep.2021.110017.
5
Rare variants discovery by extensive whole-genome sequencing of the Han Chinese population in Taiwan: Applications to cardiovascular medicine.通过对台湾汉族人群进行广泛的全基因组测序发现罕见变异:在心血管医学中的应用。
J Adv Res. 2020 Dec 7;30:147-158. doi: 10.1016/j.jare.2020.12.003. eCollection 2021 May.
6
Population-wide sampling of retrotransposon insertion polymorphisms using deep sequencing and efficient detection.利用深度测序和高效检测进行全人群逆转座子插入多态性的抽样研究。
Gigascience. 2017 Sep 1;6(9):1-11. doi: 10.1093/gigascience/gix066.
7
Misassembly of long reads undermines de novo-assembled ethnicity-specific genomes: validation in a Chinese Han population.长读段的组装错误会破坏从头组装的特定族群基因组:在中国汉族人群中的验证。
Hum Genet. 2019 Jul;138(7):757-769. doi: 10.1007/s00439-019-02032-6. Epub 2019 Jun 5.
8
KoVariome: Korean National Standard Reference Variome database of whole genomes with comprehensive SNV, indel, CNV, and SV analyses.KoVariome:韩国全基因组标准参考变异组数据库,包含全面的单核苷酸变异、插入缺失、拷贝数变异和结构变异分析。
Sci Rep. 2018 Apr 4;8(1):5677. doi: 10.1038/s41598-018-23837-x.
9
Deep sequencing of Danish Holstein dairy cattle for variant detection and insight into potential loss-of-function variants in protein coding genes.对丹麦荷斯坦奶牛进行深度测序,以检测变异并深入了解蛋白质编码基因中潜在的功能丧失变异。
BMC Genomics. 2015 Dec 9;16:1043. doi: 10.1186/s12864-015-2249-y.
10
Genotype imputation for Han Chinese population using Haplotype Reference Consortium as reference.基于 Haplotype Reference Consortium 进行中国汉族人群基因型推断。
Hum Genet. 2018 Jul;137(6-7):431-436. doi: 10.1007/s00439-018-1894-z. Epub 2018 May 31.

引用本文的文献

1
Long-read sequencing of 945 Han individuals identifies structural variants associated with phenotypic diversity and disease susceptibility.对945名汉族个体进行的长读长测序确定了与表型多样性和疾病易感性相关的结构变异。
Nat Commun. 2025 Feb 10;16(1):1494. doi: 10.1038/s41467-025-56661-9.
2
An effective strategy for assembling the sex-limited chromosome.组装限性染色体的有效策略。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae015.
3
Polymorphic USP8 allele promotes Parkinson's disease by inducing the accumulation of α-synuclein through deubiquitination.

本文引用的文献

1
A reference panel of 64,976 haplotypes for genotype imputation.用于基因型插补的64976个单倍型参考面板。
Nat Genet. 2016 Oct;48(10):1279-83. doi: 10.1038/ng.3643. Epub 2016 Aug 22.
2
An integrated map of structural variation in 2,504 human genomes.2504个人类基因组结构变异的整合图谱。
Nature. 2015 Oct 1;526(7571):75-81. doi: 10.1038/nature15394.
3
A global reference for human genetic variation.人类遗传变异的全球参考。
多态性 USP8 等位基因通过去泛素化诱导α-突触核蛋白积累促进帕金森病。
Cell Mol Life Sci. 2023 Nov 19;80(12):363. doi: 10.1007/s00018-023-05006-0.
4
Etiological roles of core promoter variation in triple-negative breast cancer.核心启动子变异在三阴性乳腺癌中的病因学作用
Genes Dis. 2022 Feb 24;10(1):228-238. doi: 10.1016/j.gendis.2022.01.003. eCollection 2023 Jan.
5
Multi-ancestry genome-wide association analyses identify novel genetic mechanisms in rheumatoid arthritis.多民族全基因组关联分析确定类风湿关节炎的新遗传机制。
Nat Genet. 2022 Nov;54(11):1640-1651. doi: 10.1038/s41588-022-01213-w. Epub 2022 Nov 4.
6
Core promoter in TNBC is highly mutated with rich ethnic signature.三阴性乳腺癌的核心启动子高度突变,具有丰富的种族特征。
Brief Funct Genomics. 2023 Jan 20;22(1):9-19. doi: 10.1093/bfgp/elac035.
7
Pangenomic analysis of Chinese gastric cancer.泛基因组分析中国胃癌。
Nat Commun. 2022 Sep 15;13(1):5412. doi: 10.1038/s41467-022-33073-7.
8
Population-scale genotyping of structural variation in the era of long-read sequencing.长读长测序时代结构变异的群体规模基因分型
Comput Struct Biotechnol J. 2022 May 27;20:2639-2647. doi: 10.1016/j.csbj.2022.05.047. eCollection 2022.
9
Robust and accurate estimation of paralog-specific copy number for duplicated genes using whole-genome sequencing.利用全基因组测序对重复基因进行稳健、准确的直系同源基因拷贝数估计。
Nat Commun. 2022 Jun 9;13(1):3221. doi: 10.1038/s41467-022-30930-3.
10
Prioritising positively selected variants in whole-genome sequencing data using FineMAV.使用 FineMAV 对全基因组测序数据中的正选择变异进行优先级排序。
BMC Bioinformatics. 2021 Dec 18;22(1):604. doi: 10.1186/s12859-021-04506-9.
Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.
4
The UK10K project identifies rare variants in health and disease.英国万人基因组计划识别健康与疾病中的罕见变异。
Nature. 2015 Oct 1;526(7571):82-90. doi: 10.1038/nature14962. Epub 2015 Sep 14.
5
Assessing structural variation in a personal genome-towards a human reference diploid genome.评估个人基因组中的结构变异——迈向人类参考二倍体基因组
BMC Genomics. 2015 Apr 11;16(1):286. doi: 10.1186/s12864-015-1479-3.
6
Rare variant association studies: considerations, challenges and opportunities.罕见变异关联研究:考量、挑战与机遇
Genome Med. 2015 Feb 23;7(1):16. doi: 10.1186/s13073-015-0138-2. eCollection 2015.
7
Variant calling in low-coverage whole genome sequencing of a Native American population sample.对美洲原住民人群样本进行低覆盖度全基因组测序中的变异调用。
BMC Genomics. 2014 Jan 30;15:85. doi: 10.1186/1471-2164-15-85.
8
SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.SOAPdenovo2:一种经验丰富的、内存效率高的短读长从头组装器。
Gigascience. 2012 Dec 27;1(1):18. doi: 10.1186/2047-217X-1-18.
9
An integrated map of genetic variation from 1,092 human genomes.1092 个人类基因组遗传变异的综合图谱。
Nature. 2012 Nov 1;491(7422):56-65. doi: 10.1038/nature11632.
10
Structural variation in two human genomes mapped at single-nucleotide resolution by whole genome de novo assembly.通过全基因组从头组装,以单核苷酸分辨率绘制的两个人类基因组的结构变异。
Nat Biotechnol. 2011 Jul 24;29(8):723-30. doi: 10.1038/nbt.1904.