• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

牛种特异性增强参考图谱有助于准确的序列读取映射和无偏的变异发现。

Bovine breed-specific augmented reference graphs facilitate accurate sequence read mapping and unbiased variant discovery.

机构信息

Animal Genomics, ETH Zürich, Zürich, Switzerland.

出版信息

Genome Biol. 2020 Jul 27;21(1):184. doi: 10.1186/s13059-020-02105-0.

DOI:10.1186/s13059-020-02105-0
PMID:32718320
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7385871/
Abstract

BACKGROUND

The current bovine genomic reference sequence was assembled from a Hereford cow. The resulting linear assembly lacks diversity because it does not contain allelic variation, a drawback of linear references that causes reference allele bias. High nucleotide diversity and the separation of individuals by hundreds of breeds make cattle ideally suited to investigate the optimal composition of variation-aware references.

RESULTS

We augment the bovine linear reference sequence (ARS-UCD1.2) with variants filtered for allele frequency in dairy (Brown Swiss, Holstein) and dual-purpose (Fleckvieh, Original Braunvieh) cattle breeds to construct either breed-specific or pan-genome reference graphs using the vg toolkit. We find that read mapping is more accurate to variation-aware than linear references if pre-selected variants are used to construct the genome graphs. Graphs that contain random variants do not improve read mapping over the linear reference sequence. Breed-specific augmented and pan-genome graphs enable almost similar mapping accuracy improvements over the linear reference. We construct a whole-genome graph that contains the Hereford-based reference sequence and 14 million alleles that have alternate allele frequency greater than 0.03 in the Brown Swiss cattle breed. Our novel variation-aware reference facilitates accurate read mapping and unbiased sequence variant genotyping for SNPs and Indels.

CONCLUSIONS

We develop the first variation-aware reference graph for an agricultural animal ( https://doi.org/10.5281/zenodo.3759712 ). Our novel reference structure improves sequence read mapping and variant genotyping over the linear reference. Our work is a first step towards the transition from linear to variation-aware reference structures in species with high genetic diversity and many sub-populations.

摘要

背景

目前的牛基因组参考序列是由一头赫里福德牛组装而成的。由此产生的线性组装缺乏多样性,因为它不包含等位基因变异,这是线性参考的一个缺点,会导致参考等位基因偏倚。高核苷酸多样性和个体之间的数百个品种的分离使得牛非常适合研究具有变异意识的参考序列的最佳组成。

结果

我们使用 vg 工具包,用在乳用(棕色瑞士牛、荷斯坦牛)和兼用(弗莱维赫牛、原始勃艮第牛)牛品种中过滤等位基因频率的变体来扩充牛的线性参考序列(ARS-UCD1.2),构建特定品种或泛基因组参考图谱。我们发现,如果使用预先选择的变体来构建基因组图谱,那么读映射比线性参考序列更准确地反映变异。包含随机变体的图谱并不能提高线性参考序列的读映射准确性。特定品种的扩充和泛基因组图谱几乎可以提高线性参考序列的映射准确性。我们构建了一个包含赫里福德牛参考序列和 1400 万个等位基因的全基因组图谱,这些等位基因在棕色瑞士牛品种中的等位基因频率大于 0.03。我们的新变异感知参考有助于 SNP 和 Indel 的准确读映射和无偏序列变异基因分型。

结论

我们为农业动物开发了第一个具有变异意识的参考图谱(https://doi.org/10.5281/zenodo.3759712)。我们的新参考结构提高了序列读映射和变体基因分型的准确性,优于线性参考。我们的工作是朝着在具有高度遗传多样性和许多亚群体的物种中从线性参考结构向具有变异意识的参考结构过渡的第一步。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fff6/7385871/a2bf2b07d9b0/13059_2020_2105_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fff6/7385871/bf214671b2d5/13059_2020_2105_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fff6/7385871/df004aa39228/13059_2020_2105_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fff6/7385871/d4084a822ef5/13059_2020_2105_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fff6/7385871/3d64bba5b2c3/13059_2020_2105_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fff6/7385871/8f5b2d5a43a6/13059_2020_2105_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fff6/7385871/64376048fb71/13059_2020_2105_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fff6/7385871/a2bf2b07d9b0/13059_2020_2105_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fff6/7385871/bf214671b2d5/13059_2020_2105_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fff6/7385871/df004aa39228/13059_2020_2105_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fff6/7385871/d4084a822ef5/13059_2020_2105_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fff6/7385871/3d64bba5b2c3/13059_2020_2105_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fff6/7385871/8f5b2d5a43a6/13059_2020_2105_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fff6/7385871/64376048fb71/13059_2020_2105_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fff6/7385871/a2bf2b07d9b0/13059_2020_2105_Fig7_HTML.jpg

相似文献

1
Bovine breed-specific augmented reference graphs facilitate accurate sequence read mapping and unbiased variant discovery.牛种特异性增强参考图谱有助于准确的序列读取映射和无偏的变异发现。
Genome Biol. 2020 Jul 27;21(1):184. doi: 10.1186/s13059-020-02105-0.
2
Investigating the impact of reference assembly choice on genomic analyses in a cattle breed.研究参考组装选择对牛种基因组分析的影响。
BMC Genomics. 2021 May 19;22(1):363. doi: 10.1186/s12864-021-07554-w.
3
Accurate sequence variant genotyping in cattle using variation-aware genome graphs.利用变异感知基因组图谱对牛进行精确的序列变异基因分型。
Genet Sel Evol. 2019 May 15;51(1):21. doi: 10.1186/s12711-019-0462-x.
4
Strategies for imputation to whole genome sequence using a single or multi-breed reference population in cattle.利用单一或多品种参考群体对牛全基因组序列进行填充的策略。
BMC Genomics. 2014 Aug 27;15(1):728. doi: 10.1186/1471-2164-15-728.
5
Whole genome resequencing of black Angus and Holstein cattle for SNP and CNV discovery.对黑安格斯牛和荷斯坦牛进行全基因组重测序以发现 SNP 和 CNV。
BMC Genomics. 2011 Nov 15;12:559. doi: 10.1186/1471-2164-12-559.
6
Meta-analysis of sequence-based association studies across three cattle breeds reveals 25 QTL for fat and protein percentages in milk at nucleotide resolution.基于序列的关联研究在三个牛品种中的荟萃分析,在核苷酸分辨率下揭示了牛奶中脂肪和蛋白质百分比的 25 个 QTL。
BMC Genomics. 2017 Nov 9;18(1):853. doi: 10.1186/s12864-017-4263-8.
7
Consequences of splitting whole-genome sequencing effort over multiple breeds on imputation accuracy.将全基因组测序工作分散到多个品种上对基因填充准确性的影响。
BMC Genet. 2014 Oct 3;15:105. doi: 10.1186/s12863-014-0105-8.
8
A cattle graph genome incorporating global breed diversity.一个纳入全球品种多样性的牛基因组图谱。
Nat Commun. 2022 Feb 17;13(1):910. doi: 10.1038/s41467-022-28605-0.
9
Fast alignment of reads to a variation graph with application to SNP detection.快速将读取内容与变异图谱对齐,应用于 SNP 检测。
J Integr Bioinform. 2021 Nov 16;18(4):20210032. doi: 10.1515/jib-2021-0032.
10
The size and composition of haplotype reference panels impact the accuracy of imputation from low-pass sequencing in cattle.单体型参考面板的大小和组成会影响牛低深度测序数据的准确性。
Genet Sel Evol. 2023 May 11;55(1):33. doi: 10.1186/s12711-023-00809-y.

引用本文的文献

1
Global Pangenome Analysis Highlights the Critical Role of Structural Variants in Cattle Improvement and Identifies a Unique Event as a Novel Enhancer in IGFBP7+ Cells.全球泛基因组分析凸显结构变异在牛改良中的关键作用,并鉴定出一个独特事件作为IGFBP7 +细胞中的新型增强子。
Mol Biol Evol. 2025 Sep 1;42(9). doi: 10.1093/molbev/msaf205.
2
Pangenome graph mitigates heterozygosity overestimation from mapping bias: a case study in Chinese indigenous pigs.泛基因组图谱减轻了因映射偏差导致的杂合度高估:以中国本土猪为例的研究
BMC Biol. 2025 Mar 26;23(1):89. doi: 10.1186/s12915-025-02194-y.
3
A gentle introduction to pangenomics.

本文引用的文献

1
Long-read sequencing of 3,622 Icelanders provides insight into the role of structural variants in human diseases and other traits.对 3622 名冰岛人的长读测序为深入了解结构变异在人类疾病和其他特征中的作用提供了线索。
Nat Genet. 2021 Jun;53(6):779-786. doi: 10.1038/s41588-021-00865-4. Epub 2021 May 10.
2
The design and construction of reference pangenome graphs with minigraph.使用 Minigraph 设计和构建参考泛基因组图谱。
Genome Biol. 2020 Oct 16;21(1):265. doi: 10.1186/s13059-020-02168-z.
3
Telomere-to-telomere assembly of a complete human X chromosome.
泛基因组学简介。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae588.
4
DNA methylation analysis to differentiate reference, breed, and parent-of-origin effects in the bovine pangenome era.在牛泛基因组时代,通过 DNA 甲基化分析来区分参考、品种和亲本来源效应。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae061.
5
Uncovering structural variants in Creole cattle from Guadeloupe and their impact on environmental adaptation through whole genome sequencing.揭示瓜德罗普岛克里奥尔牛的结构变异及其对环境适应的影响,方法是进行全基因组测序。
PLoS One. 2024 Aug 26;19(8):e0309411. doi: 10.1371/journal.pone.0309411. eCollection 2024.
6
An overview of recent technological developments in bovine genomics.牛基因组学近期技术发展概述。
Vet Anim Sci. 2024 Jul 23;25:100382. doi: 10.1016/j.vas.2024.100382. eCollection 2024 Sep.
7
Beef Cattle Genome Project: Advances in Genome Sequencing, Assembly, and Functional Genes Discovery.牛基因组计划:基因组测序、组装和功能基因发现的进展。
Int J Mol Sci. 2024 Jun 28;25(13):7147. doi: 10.3390/ijms25137147.
8
Measuring, visualizing, and diagnosing reference bias with biastools.使用 biastools 测量、可视化和诊断参考偏倚。
Genome Biol. 2024 Apr 19;25(1):101. doi: 10.1186/s13059-024-03240-8.
9
When Livestock Genomes Meet Third-Generation Sequencing Technology: From Opportunities to Applications.当家畜基因组遇到第三代测序技术:从机遇到应用。
Genes (Basel). 2024 Feb 15;15(2):245. doi: 10.3390/genes15020245.
10
Molecular genetic characterization and meat-use functional gene identification in Jianshui yellow-brown ducks through combined resequencing and transcriptome analysis.基于重测序与转录组联合分析的建水黄麻鸭分子遗传特征及肉用功能基因鉴定
Front Vet Sci. 2023 Dec 12;10:1269904. doi: 10.3389/fvets.2023.1269904. eCollection 2023.
端粒到端粒组装完整的人类 X 染色体。
Nature. 2020 Sep;585(7823):79-84. doi: 10.1038/s41586-020-2547-7. Epub 2020 Jul 14.
4
Personalized and graph genomes reveal missing signal in epigenomic data.个性化和图形基因组揭示了表观基因组数据中的缺失信号。
Genome Biol. 2020 May 25;21(1):124. doi: 10.1186/s13059-020-02038-8.
5
Assessing graph-based read mappers against a baseline approach highlights strengths and weaknesses of current methods.评估基于图的读映射器相对于基线方法,可以突出当前方法的优缺点。
BMC Genomics. 2020 Apr 6;21(1):282. doi: 10.1186/s12864-020-6685-y.
6
Continuous chromosome-scale haplotypes assembled from a single interspecies F1 hybrid of yak and cattle.连续的染色体尺度单体型由牦牛和牛的一个种间 F1 杂种组装而成。
Gigascience. 2020 Apr 1;9(4). doi: 10.1093/gigascience/giaa029.
7
De novo assembly of the cattle reference genome with single-molecule sequencing.利用单分子测序技术从头组装牛参考基因组。
Gigascience. 2020 Mar 1;9(3). doi: 10.1093/gigascience/giaa021.
8
Genotyping structural variants in pangenome graphs using the vg toolkit.使用vg工具包对泛基因组图谱中的结构变异进行基因分型。
Genome Biol. 2020 Feb 12;21(1):35. doi: 10.1186/s13059-020-1941-7.
9
Assessing genomic diversity and signatures of selection in Original Braunvieh cattle using whole-genome sequencing data.利用全基因组测序数据评估原布劳恩维勒牛的基因组多样性和选择特征。
BMC Genomics. 2020 Jan 8;21(1):27. doi: 10.1186/s12864-020-6446-y.
10
Paragraph: a graph-based structural variant genotyper for short-read sequence data.段落:基于图的短读序列数据结构变异基因分型器。
Genome Biol. 2019 Dec 19;20(1):291. doi: 10.1186/s13059-019-1909-7.