• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

长读长测序时代结构变异的群体规模基因分型

Population-scale genotyping of structural variation in the era of long-read sequencing.

作者信息

Quan Cheng, Lu Hao, Lu Yiming, Zhou Gangqiao

机构信息

Department of Genetics & Integrative Omics, State Key Laboratory of Proteomics, National Center for Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 100850, PR China.

Hebei University, Baoding, Hebei Province 071002, PR China.

出版信息

Comput Struct Biotechnol J. 2022 May 27;20:2639-2647. doi: 10.1016/j.csbj.2022.05.047. eCollection 2022.

DOI:10.1016/j.csbj.2022.05.047
PMID:35685364
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9163579/
Abstract

Population-scale studies of structural variation (SV) are growing rapidly worldwide with the development of long-read sequencing technology, yielding a considerable number of novel SVs and complete gap-closed genome assemblies. Herein, we highlight recent studies using a hybrid sequencing strategy and present the challenges toward large-scale genotyping for SVs due to the reference bias. Genotyping SVs at a population scale remains challenging, which severely impacts genotype-based population genetic studies or genome-wide association studies of complex diseases. We summarize academic efforts to improve genotype quality through linear or graph representations of reference and alternative alleles. Graph-based genotypers capable of integrating diverse genetic information are effectively applied to large and diverse cohorts, contributing to unbiased downstream analysis. Meanwhile, there is still an urgent need in this field for efficient tools to construct complex graphs and perform sequence-to-graph alignments.

摘要

随着长读长测序技术的发展,全球范围内针对结构变异(SV)的群体规模研究正在迅速增加,产生了大量新的SV以及完整的缺口封闭基因组组装。在此,我们重点介绍了近期使用混合测序策略的研究,并指出了由于参考偏差导致的大规模SV基因分型所面临的挑战。在群体规模上对SV进行基因分型仍然具有挑战性,这严重影响了基于基因型的群体遗传学研究或复杂疾病的全基因组关联研究。我们总结了通过参考等位基因和替代等位基因的线性或图形表示来提高基因型质量的学术努力。能够整合多种遗传信息的基于图形的基因分型器被有效地应用于大型多样的队列研究,有助于进行无偏的下游分析。与此同时,该领域仍然迫切需要高效的工具来构建复杂图形并进行序列到图形的比对。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fcc/9163579/c69a999b10f1/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fcc/9163579/c69a999b10f1/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4fcc/9163579/c69a999b10f1/gr1.jpg

相似文献

1
Population-scale genotyping of structural variation in the era of long-read sequencing.长读长测序时代结构变异的群体规模基因分型
Comput Struct Biotechnol J. 2022 May 27;20:2639-2647. doi: 10.1016/j.csbj.2022.05.047. eCollection 2022.
2
SVJedi-graph: improving the genotyping of close and overlapping structural variants with long reads using a variation graph.SVJedi-graph:使用变异图提高长读长对紧密和重叠结构变异的基因分型。
Bioinformatics. 2023 Jun 30;39(39 Suppl 1):i270-i278. doi: 10.1093/bioinformatics/btad237.
3
SVJedi: genotyping structural variations with long reads.使用长读长进行基因分型结构变异。
Bioinformatics. 2020 Nov 1;36(17):4568-4575. doi: 10.1093/bioinformatics/btaa527.
4
Paragraph: a graph-based structural variant genotyper for short-read sequence data.段落:基于图的短读序列数据结构变异基因分型器。
Genome Biol. 2019 Dec 19;20(1):291. doi: 10.1186/s13059-019-1909-7.
5
Genotyping structural variants in pangenome graphs using the vg toolkit.使用vg工具包对泛基因组图谱中的结构变异进行基因分型。
Genome Biol. 2020 Feb 12;21(1):35. doi: 10.1186/s13059-020-1941-7.
6
NPSV: A simulation-driven approach to genotyping structural variants in whole-genome sequencing data.NPSV:一种基于模拟的全基因组测序数据分析中结构变异基因分型方法。
Gigascience. 2021 Jul 1;10(7). doi: 10.1093/gigascience/giab046.
7
Bovine breed-specific augmented reference graphs facilitate accurate sequence read mapping and unbiased variant discovery.牛种特异性增强参考图谱有助于准确的序列读取映射和无偏的变异发现。
Genome Biol. 2020 Jul 27;21(1):184. doi: 10.1186/s13059-020-02105-0.
8
The promise and challenges of characterizing genome-wide structural variants: A case study in a critically endangered parrot.全基因组结构变异特征分析的前景与挑战:以一种极度濒危鹦鹉为例的研究
Mol Ecol Resour. 2023 Mar 14. doi: 10.1111/1755-0998.13783.
9
PanSVR: Pan-Genome Augmented Short Read Realignment for Sensitive Detection of Structural Variations.PanSVR:用于结构变异灵敏检测的全基因组增强短读长重比对
Front Genet. 2021 Aug 19;12:731515. doi: 10.3389/fgene.2021.731515. eCollection 2021.
10
MoMI-G: modular multi-scale integrated genome graph browser.MoMI-G:模块化多尺度综合基因组图谱浏览器。
BMC Bioinformatics. 2019 Nov 5;20(1):548. doi: 10.1186/s12859-019-3145-2.

引用本文的文献

1
Long-Read Whole-Genome Sequencing as a Tool for Variant Detection in Inherited Retinal Dystrophies.长读长全基因组测序作为遗传性视网膜营养不良中变异检测的工具。
Int J Mol Sci. 2025 Apr 18;26(8):3825. doi: 10.3390/ijms26083825.
2
CCS-Consensuser: A Haplotype-Aware Consensus Generator for PacBio Amplicon Sequences.CCS共识生成器:一种用于PacBio扩增子序列的单倍型感知共识生成器。
Mol Ecol Resour. 2025 Oct;25(7):e14113. doi: 10.1111/1755-0998.14113. Epub 2025 Apr 4.
3
SVLearn: a dual-reference machine learning approach enables accurate cross-species genotyping of structural variants.

本文引用的文献

1
The motif composition of variable number tandem repeats impacts gene expression.可变数串联重复的基序组成影响基因表达。
Genome Res. 2023 Apr;33(4):511-524. doi: 10.1101/gr.276768.122. Epub 2023 Apr 10.
2
Scalable, ultra-fast, and low-memory construction of compacted de Bruijn graphs with Cuttlefish 2.使用 Cuttlefish 2 实现可扩展、超快速和低内存消耗的紧凑 de Bruijn 图构建。
Genome Biol. 2022 Sep 8;23(1):190. doi: 10.1186/s13059-022-02743-6.
3
High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios.
SVLearn:一种双参考机器学习方法可实现结构变异的准确跨物种基因分型。
Nat Commun. 2025 Mar 11;16(1):2406. doi: 10.1038/s41467-025-57756-z.
4
Genotype and phenotype data standardization, utilization and integration in the big data era for agricultural sciences.基因型和表型数据在农业科学大数据时代的标准化、利用和整合。
Database (Oxford). 2023 Dec 11;2023. doi: 10.1093/database/baad088.
5
PhenoSV: interpretable phenotype-aware model for the prioritization of genes affected by structural variants.PhenoSV:一种可解释的表型感知模型,用于优先考虑受结构变异影响的基因。
Nat Commun. 2023 Nov 28;14(1):7805. doi: 10.1038/s41467-023-43651-y.
6
The genomics and evolution of inter-sexual mimicry and female-limited polymorphisms in damselflies.雌雄二态性模拟和雌性限性多态性在蜻蜓中的基因组学和进化。
Nat Ecol Evol. 2024 Jan;8(1):83-97. doi: 10.1038/s41559-023-02243-1. Epub 2023 Nov 6.
7
Human Pangenomics: Promises and Challenges of a Distributed Genomic Reference.人类泛基因组学:分布式基因组参考的前景与挑战
Life (Basel). 2023 Jun 9;13(6):1360. doi: 10.3390/life13061360.
8
Chimera: The spoiler in multiple displacement amplification.嵌合体:多重置换扩增中的干扰因素
Comput Struct Biotechnol J. 2023 Feb 23;21:1688-1696. doi: 10.1016/j.csbj.2023.02.034. eCollection 2023.
9
Recent advances and current challenges in population genomics of structural variation in animals and plants.动植物结构变异群体基因组学的最新进展与当前挑战
Front Genet. 2022 Nov 29;13:1060898. doi: 10.3389/fgene.2022.1060898. eCollection 2022.
对扩展的 1000 基因组项目队列进行高覆盖率全基因组测序,包括 602 个三核苷酸重复序列。
Cell. 2022 Sep 1;185(18):3426-3440.e19. doi: 10.1016/j.cell.2022.08.004.
4
Minos: variant adjudication and joint genotyping of cohorts of bacterial genomes.米诺斯:细菌基因组队列的变异裁决和联合基因分型。
Genome Biol. 2022 Jul 5;23(1):147. doi: 10.1186/s13059-022-02714-x.
5
The Human Pangenome Project: a global resource to map genomic diversity.人类泛基因组计划:绘制基因组多样性图谱的全球资源。
Nature. 2022 Apr;604(7906):437-446. doi: 10.1038/s41586-022-04601-8. Epub 2022 Apr 20.
6
Searching thousands of genomes to classify somatic and novel structural variants using STIX.利用 STIX 搜索数千个基因组以对体细胞和新型结构变体进行分类。
Nat Methods. 2022 Apr;19(4):445-448. doi: 10.1038/s41592-022-01423-4. Epub 2022 Apr 8.
7
A complete reference genome improves analysis of human genetic variation.完整的参考基因组提高了人类遗传变异分析的能力。
Science. 2022 Apr;376(6588):eabl3533. doi: 10.1126/science.abl3533. Epub 2022 Apr 1.
8
The complete sequence of a human genome.人类基因组的完整序列。
Science. 2022 Apr;376(6588):44-53. doi: 10.1126/science.abj6987. Epub 2022 Mar 31.
9
Towards accurate and reliable resolution of structural variants for clinical diagnosis.致力于实现结构变异的准确可靠解析,以用于临床诊断。
Genome Biol. 2022 Mar 3;23(1):68. doi: 10.1186/s13059-022-02636-8.
10
Pangenomics enables genotyping of known structural variants in 5202 diverse genomes.泛基因组学能够对 5202 个不同基因组中的已知结构变异进行基因分型。
Science. 2021 Dec 17;374(6574):abg8871. doi: 10.1126/science.abg8871.