• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

迈向京族越南人参考基因组:利用长读长测序和光学图谱构建从头基因组组装

Toward a Kinh Vietnamese Reference Genome: Constructing a De Novo Genome Assembly Using Long-Read Sequencing and Optical Mapping.

作者信息

Dung Le Thi, Lam Le Tung, Trang Nguyen Hong, Anh Nguyen Vu Hung, Nam Nguyen Ngoc, Nhung Doan Thi, Linh Tran Huyen, Giang Le Ngoc, Ha Hoang, Huy Nguyen Quang, Hai Truong Nam

机构信息

Institute of Biology, Vietnam Academy of Science and Technology (VAST), Hanoi 10072, Vietnam.

Department of Life Sciences, University of Science and Technology of Hanoi (USTH), Vietnam Academy of Science and Technology (VAST), Hanoi 10072, Vietnam.

出版信息

Genes (Basel). 2025 Apr 29;16(5):536. doi: 10.3390/genes16050536.

DOI:10.3390/genes16050536
PMID:40428358
Abstract

Population-specific reference genomes are essential for improving the accuracy and reliability of genomic analyses across diverse human populations. Although Vietnam ranks as the 16th most populous country in the world, with more than 86% of its population identifying as Kinh, studies specifically focusing on the Kinh Vietnamese reference genome remain scarce. Therefore, constructing a Kinh Vietnamese reference genome is valuable in the genetic research of Vietnamese. In this study, we combined PacBio long-read sequencing and Bionano optical mapping data to generate a de novo assembly of a Kinh Vietnamese genome (VHG), which was subsequently polished using multiple Kinh Vietnamese short-read whole-genome sequences (WGSs). The final assembly, named VHG1.2, comprised 3.22 gigabase pairs of high-quality sequence data, demonstrating high accuracy (QV: 48), completeness (BUSCO: 92%), and continuity (295 super scaffolds, super scaffold N50: 50 Kbp). Using multiple bioinformatic tools for variant calling, we observed significant variants when the population-specific reference VHG1.2 was used compared to the standard reference genome hg38. Overall, our genome assembly demonstrates the advantages of a long-read hybrid sequencing approach for de novo assembly and highlights the benefit of using population-specific reference genomes in population genomic analysis.

摘要

特定人群的参考基因组对于提高不同人类群体基因组分析的准确性和可靠性至关重要。尽管越南是世界上人口第16多的国家,超过86%的人口为京族,但专门针对京族越南人参考基因组的研究仍然很少。因此,构建京族越南人参考基因组对越南的基因研究具有重要价值。在本研究中,我们结合了PacBio长读长测序和Bionano光学图谱数据,对京族越南人基因组(VHG)进行了从头组装,随后使用多个京族越南人短读长全基因组序列(WGS)进行了优化。最终组装的名为VHG1.2的基因组包含32.2亿碱基对的高质量序列数据,显示出高准确性(QV:48)、完整性(BUSCO:92%)和连续性(295个超级支架,超级支架N50:50 Kbp)。使用多种生物信息学工具进行变异检测时,与标准参考基因组hg38相比,我们发现使用特定人群参考基因组VHG1.2时存在显著变异。总体而言,我们的基因组组装展示了长读长混合测序方法在从头组装中的优势,并突出了在群体基因组分析中使用特定人群参考基因组的好处。

相似文献

1
Toward a Kinh Vietnamese Reference Genome: Constructing a De Novo Genome Assembly Using Long-Read Sequencing and Optical Mapping.迈向京族越南人参考基因组:利用长读长测序和光学图谱构建从头基因组组装
Genes (Basel). 2025 Apr 29;16(5):536. doi: 10.3390/genes16050536.
2
A Comparison of Structural Variant Calling from Short-Read and Nanopore-Based Whole-Genome Sequencing Using Optical Genome Mapping as a Benchmark.基于光学基因组图谱作为基准的短读长和纳米孔全基因组测序的结构变异调用比较。
Genes (Basel). 2024 Jul 16;15(7):925. doi: 10.3390/genes15070925.
3
A hybrid approach for de novo human genome sequence assembly and phasing.一种用于从头进行人类基因组序列组装和定相的混合方法。
Nat Methods. 2016 Jul;13(7):587-90. doi: 10.1038/nmeth.3865. Epub 2016 May 9.
4
A Vietnamese human genetic variation database.越南人类遗传变异数据库。
Hum Mutat. 2019 Oct;40(10):1664-1675. doi: 10.1002/humu.23835. Epub 2019 Jul 3.
5
Hybrid de novo genome assembly and centromere characterization of the gray mouse lemur (Microcebus murinus).灰鼠狐猴(Microcebus murinus)的混合从头基因组组装和着丝粒特征分析。
BMC Biol. 2017 Nov 16;15(1):110. doi: 10.1186/s12915-017-0439-6.
6
Chromosome-scale assembly comparison of the Korean Reference Genome KOREF from PromethION and PacBio with Hi-C mapping information.利用 Hi-C 图谱信息对 PromethION 和 PacBio 测序的韩国参考基因组 KOREF 进行染色体水平组装比较。
Gigascience. 2019 Dec 1;8(12). doi: 10.1093/gigascience/giz125.
7
De novo genome assembly of a Han Chinese male and genome-wide detection of structural variants using Oxford Nanopore sequencing.利用牛津纳米孔测序对一名汉族男性进行从头基因组组装和全基因组结构变异检测。
Mol Genet Genomics. 2020 Jul;295(4):871-876. doi: 10.1007/s00438-020-01672-y. Epub 2020 Apr 9.
8
Genome-wide association and polygenic risk score estimation of type 2 diabetes mellitus in Kinh Vietnamese-A pilot study.越南京族人群 2 型糖尿病的全基因组关联和多基因风险评分估计:一项初步研究。
J Cell Mol Med. 2024 Jul;28(13):e18526. doi: 10.1111/jcmm.18526.
9
Misassembly of long reads undermines de novo-assembled ethnicity-specific genomes: validation in a Chinese Han population.长读段的组装错误会破坏从头组装的特定族群基因组:在中国汉族人群中的验证。
Hum Genet. 2019 Jul;138(7):757-769. doi: 10.1007/s00439-019-02032-6. Epub 2019 Jun 5.
10
Genetic variation and the de novo assembly of human genomes.人类基因组的遗传变异与从头组装
Nat Rev Genet. 2015 Nov;16(11):627-40. doi: 10.1038/nrg3933. Epub 2015 Oct 7.

本文引用的文献

1
Haplotype-resolved Chinese male genome assembly based on high-fidelity sequencing.基于高保真测序的单倍型解析中国男性基因组组装
Fundam Res. 2022 Mar 2;2(6):946-953. doi: 10.1016/j.fmre.2022.02.005. eCollection 2022 Nov.
2
Long-read sequencing and optical mapping generates near T2T assemblies that resolves a centromeric translocation.长读测序和光学作图生成接近 T2T 的组装,解决了着丝粒易位问题。
Sci Rep. 2024 Apr 18;14(1):9000. doi: 10.1038/s41598-024-59683-3.
3
Rapid genomic sequencing for genetic disease diagnosis and therapy in intensive care units: a review.
重症监护病房中用于遗传疾病诊断和治疗的快速基因组测序:综述
NPJ Genom Med. 2024 Feb 27;9(1):17. doi: 10.1038/s41525-024-00404-0.
4
Origin matters: Using a local reference genome improves measures in population genomics.起源很重要:使用本地参考基因组可提高群体基因组学中的度量指标。
Mol Ecol Resour. 2023 Oct;23(7):1706-1723. doi: 10.1111/1755-0998.13838. Epub 2023 Jul 25.
5
The complete and fully-phased diploid genome of a male Han Chinese.一位男性汉族个体的完整、全面二倍体基因组。
Cell Res. 2023 Oct;33(10):745-761. doi: 10.1038/s41422-023-00849-5. Epub 2023 Jul 14.
6
Improving variant calling using population data and deep learning.利用群体数据和深度学习提高变异calling 的准确性。
BMC Bioinformatics. 2023 May 12;24(1):197. doi: 10.1186/s12859-023-05294-0.
7
Cue: a deep-learning framework for structural variant discovery and genotyping.线索:一种用于结构变异发现和基因分型的深度学习框架。
Nat Methods. 2023 Apr;20(4):559-568. doi: 10.1038/s41592-023-01799-x. Epub 2023 Mar 23.
8
The first gapless, reference-quality, fully annotated genome from a Southern Han Chinese individual.首个无缺口、参考质量、完整注释的中国南方汉族个体基因组。
G3 (Bethesda). 2023 Mar 9;13(3). doi: 10.1093/g3journal/jkac321.
9
Comparison of calling pipelines for whole genome sequencing: an empirical study demonstrating the importance of mapping and alignment.比较全基因组测序的调用管道:一项实证研究表明映射和比对的重要性。
Sci Rep. 2022 Dec 13;12(1):21502. doi: 10.1038/s41598-022-26181-3.
10
Analyzing the Korean reference genome with meta-imputation increased the imputation accuracy and spectrum of rare variants in the Korean population.使用元填充分析韩国参考基因组可提高韩国人群中罕见变异的填充准确性和范围。
Front Genet. 2022 Nov 24;13:1008646. doi: 10.3389/fgene.2022.1008646. eCollection 2022.