• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用全基因组 SNPs 和 InDels 对中国本土绵羊品种进行群体结构和品种鉴定。

Population structure and breed identification of Chinese indigenous sheep breeds using whole genome SNPs and InDels.

机构信息

Shandong Provincial Key Laboratory for Livestock Germplasm Innovation & Utilization, College of Animal Science, Shandong Agricultural University, Tai'an, 271018, China.

CAS Key Laboratory of Adaptation and Evolution of Plateau Biota, Qinghai Key Laboratory of Animal Ecological Genomics, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining, 810001, Qinghai, China.

出版信息

Genet Sel Evol. 2024 Sep 3;56(1):60. doi: 10.1186/s12711-024-00927-1.

DOI:10.1186/s12711-024-00927-1
PMID:39227836
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11370120/
Abstract

BACKGROUND

Accurate breed identification is essential for the conservation and sustainable use of indigenous farm animal genetic resources. In this study, we evaluated the phylogenetic relationships and genomic breed compositions of 13 sheep breeds using SNP and InDel data from whole genome sequencing. The breeds included 11 Chinese indigenous and 2 foreign commercial breeds. We compared different strategies for breed identification with respect to different marker types, i.e. SNPs, InDels, and a combination of SNPs and InDels (named SIs), different breed-informative marker detection methods, and different machine learning classification methods.

RESULTS

Using WGS-based SNPs and InDels, we revealed the phylogenetic relationships between 11 Chinese indigenous and two foreign sheep breeds and quantified their purities through estimated genomic breed compositions. We found that the optimal strategy for identifying these breeds was the combination of DFI_union for breed-informative marker detection, which integrated the methods of Delta, Pairwise Wright's FST, and Informativeness for Assignment (namely DFI) by merging the breed-informative markers derived from the three methods, and KSR for breed assignment, which integrated the methods of K-Nearest Neighbor, Support Vector Machine, and Random Forest (namely KSR) by intersecting their results. Using SI markers improved the identification accuracy compared to using SNPs or InDels alone. We achieved accuracies over 97.5% when using at least the 1000 most breed-informative (MBI) SI markers and even 100% when using 5000 SI markers.

CONCLUSIONS

Our results provide not only an important foundation for conservation of these Chinese local sheep breeds, but also general approaches for breed identification of indigenous farm animal breeds.

摘要

背景

准确的品种鉴定对于保护和可持续利用本土农场动物遗传资源至关重要。在这项研究中,我们使用全基因组测序的 SNP 和 InDel 数据评估了 13 个绵羊品种的系统发育关系和基因组品种组成。这些品种包括 11 个中国本土品种和 2 个国外商业品种。我们比较了不同的标记类型(SNP、InDel 和 SNP 和 InDel 的组合,命名为 SIs)、不同的品种信息标记检测方法和不同的机器学习分类方法在品种鉴定方面的不同策略。

结果

使用基于 WGS 的 SNP 和 InDel,我们揭示了 11 个中国本土品种和两个国外绵羊品种之间的系统发育关系,并通过估计的基因组品种组成量化了它们的纯度。我们发现,识别这些品种的最佳策略是 DFI_union 与 KSR 相结合的策略,用于品种信息标记检测的 DFI_union 集成了 Delta、Pairwise Wright's FST 和 Informativeness for Assignment(即 DFI)三种方法的品种信息标记,用于品种分配的 KSR 则集成了 K-Nearest Neighbor、Support Vector Machine 和 Random Forest(即 KSR)三种方法的结果。与单独使用 SNP 或 InDel 相比,使用 SI 标记可提高识别准确性。当使用至少 1000 个最具品种信息性(MBI)SI 标记时,我们实现了超过 97.5%的准确率,甚至使用 5000 个 SI 标记时也可实现 100%的准确率。

结论

我们的研究结果不仅为保护这些中国本土绵羊品种提供了重要基础,也为本土农场动物品种的品种鉴定提供了一般方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a25/11370120/cc833e1a9b6a/12711_2024_927_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a25/11370120/3577b0cd7957/12711_2024_927_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a25/11370120/2f5bcdf352e9/12711_2024_927_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a25/11370120/39a2b361b4aa/12711_2024_927_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a25/11370120/65ed0602998a/12711_2024_927_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a25/11370120/cc833e1a9b6a/12711_2024_927_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a25/11370120/3577b0cd7957/12711_2024_927_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a25/11370120/2f5bcdf352e9/12711_2024_927_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a25/11370120/39a2b361b4aa/12711_2024_927_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a25/11370120/65ed0602998a/12711_2024_927_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a25/11370120/cc833e1a9b6a/12711_2024_927_Fig5_HTML.jpg

相似文献

1
Population structure and breed identification of Chinese indigenous sheep breeds using whole genome SNPs and InDels.利用全基因组 SNPs 和 InDels 对中国本土绵羊品种进行群体结构和品种鉴定。
Genet Sel Evol. 2024 Sep 3;56(1):60. doi: 10.1186/s12711-024-00927-1.
2
Breed identification using breed-informative SNPs and machine learning based on whole genome sequence data and SNP chip data.利用品种信息性单核苷酸多态性(SNP)以及基于全基因组序列数据和SNP芯片数据的机器学习进行品种鉴定。
J Anim Sci Biotechnol. 2023 Jun 1;14(1):85. doi: 10.1186/s40104-023-00880-x.
3
Population structure and breed composition prediction in a multi-breed sheep population using genome-wide single nucleotide polymorphism genotypes.利用全基因组单核苷酸多态性基因型预测多品种绵羊群体的群体结构和品种组成。
Animal. 2020 Mar;14(3):464-474. doi: 10.1017/S1751731119002398. Epub 2019 Oct 15.
4
Classification accuracy of machine learning algorithms for Chinese local cattle breeds using genomic markers.机器学习算法利用基因组标记对中国本地牛品种进行分类准确性研究。
Yi Chuan. 2024 Jul;46(7):530-539. doi: 10.16288/j.yczz.24-059.
5
High imputation accuracy from informative low-to-medium density single nucleotide polymorphism genotypes is achievable in sheep1.在绵羊中,信息量较低且中等密度的单核苷酸多态性基因型也能实现高的插补准确性。1
J Anim Sci. 2019 Apr 3;97(4):1550-1567. doi: 10.1093/jas/skz043.
6
Genomic diversity and population structure of three autochthonous Greek sheep breeds assessed with genome-wide DNA arrays.利用全基因组 DNA 芯片评估三个本土希腊绵羊品种的基因组多样性和种群结构。
Mol Genet Genomics. 2018 Jun;293(3):753-768. doi: 10.1007/s00438-018-1421-x. Epub 2018 Jan 25.
7
Screening Discriminating SNPs for Chinese Indigenous Pig Breeds Identification Using a Random Forests Algorithm.利用随机森林算法筛选用于中国本土猪种鉴定的区分 SNP。
Genes (Basel). 2022 Nov 25;13(12):2207. doi: 10.3390/genes13122207.
8
Whole genome characterization of autochthonous Bos taurus brachyceros and introduced Bos indicus indicus cattle breeds in Cameroon regarding their adaptive phenotypic traits and pathogen resistance.喀麦隆本土短角牛和引入的印度瘤牛牛种的全基因组特征,涉及它们的适应性表型特征和对病原体的抗性。
BMC Genet. 2020 Jun 22;21(1):64. doi: 10.1186/s12863-020-00869-9.
9
The use of a genomic relationship matrix for breed assignment of cattle breeds: comparison and combination with a machine learning method.利用基因组关系矩阵对牛品种进行品种归属:与机器学习方法的比较和结合。
J Anim Sci. 2023 Jan 3;101. doi: 10.1093/jas/skad172.
10
Preselection statistics and Random Forest classification identify population informative single nucleotide polymorphisms in cosmopolitan and autochthonous cattle breeds.预选统计和随机森林分类确定了世界性和土着牛品种中具有群体信息的单核苷酸多态性。
Animal. 2018 Jan;12(1):12-19. doi: 10.1017/S1751731117001355. Epub 2017 Jun 23.

引用本文的文献

1
A deep learning strategy for accurate identification of purebred and hybrid pigs across SNP chips.一种基于SNP芯片准确识别纯种猪和杂交猪的深度学习策略。
J Anim Sci Biotechnol. 2025 Aug 14;16(1):116. doi: 10.1186/s40104-025-01249-y.
2
rPIMS: a ShinyR package for the precision identification and modelling of livestock breeds using genomic data and machine learning approaches.rPIMS:一个用于利用基因组数据和机器学习方法对家畜品种进行精准识别和建模的ShinyR软件包。
Bioinform Adv. 2025 Apr 7;5(1):vbaf077. doi: 10.1093/bioadv/vbaf077. eCollection 2025.
3
Population Genetic Structure of Three-Spined Sticklebacks in the St. Lawrence: A Gradient of Change.

本文引用的文献

1
Whole-genome resequencing of Chinese indigenous sheep provides insight into the genetic basis underlying climate adaptation.对中国本土绵羊的全基因组重测序为深入了解气候适应的遗传基础提供了线索。
Genet Sel Evol. 2024 Apr 2;56(1):26. doi: 10.1186/s12711-024-00880-z.
2
Evaluating the use of statistical and machine learning methods for estimating breed composition of purebred and crossbred animals in thirteen cattle breeds using genomic information.利用基因组信息评估统计和机器学习方法在13个牛品种中估计纯种和杂交动物品种组成的应用。
Front Genet. 2023 May 15;14:1120312. doi: 10.3389/fgene.2023.1120312. eCollection 2023.
3
圣劳伦斯河三刺鱼的种群遗传结构:变化梯度
Ecol Evol. 2025 Apr 23;15(4):e71153. doi: 10.1002/ece3.71153. eCollection 2025 Apr.
4
Genetic and Epigenetic Adaptation Mechanisms of Sheep Under Multi-Environmental Stress Environment.多环境应激条件下绵羊的遗传与表观遗传适应机制
Int J Mol Sci. 2025 Apr 1;26(7):3261. doi: 10.3390/ijms26073261.
5
Single nucleotide polymorphism-based analysis of linkage disequilibrium and runs of homozygosity patterns of indigenous sheep in the southern Taklamakan desert.基于单核苷酸多态性的塔克拉玛干沙漠南部本土绵羊连锁不平衡和纯合子连续片段模式分析
BMC Genomics. 2025 Mar 18;26(1):267. doi: 10.1186/s12864-025-11445-9.
6
Advances in Whole Genome Sequencing: Methods, Tools, and Applications in Population Genomics.全基因组测序进展:群体基因组学中的方法、工具及应用
Int J Mol Sci. 2025 Jan 4;26(1):372. doi: 10.3390/ijms26010372.
7
Identification of Taihang-chicken-specific genetic markers using genome-wide SNPs and machine learning: BREED-SPECIFIC SNPS OF TAIHANG CHICKEN.利用全基因组单核苷酸多态性和机器学习鉴定太行鸡特异性遗传标记:太行鸡的品种特异性单核苷酸多态性
Poult Sci. 2025 Jan;104(1):104585. doi: 10.1016/j.psj.2024.104585. Epub 2024 Nov 22.
Breed identification using breed-informative SNPs and machine learning based on whole genome sequence data and SNP chip data.
利用品种信息性单核苷酸多态性(SNP)以及基于全基因组序列数据和SNP芯片数据的机器学习进行品种鉴定。
J Anim Sci Biotechnol. 2023 Jun 1;14(1):85. doi: 10.1186/s40104-023-00880-x.
4
A web tool for the global identification of pig breeds.一个用于全球猪品种识别的网络工具。
Genet Sel Evol. 2023 Mar 21;55(1):18. doi: 10.1186/s12711-023-00788-0.
5
Screening Discriminating SNPs for Chinese Indigenous Pig Breeds Identification Using a Random Forests Algorithm.利用随机森林算法筛选用于中国本土猪种鉴定的区分 SNP。
Genes (Basel). 2022 Nov 25;13(12):2207. doi: 10.3390/genes13122207.
6
An improved ovine reference genome assembly to facilitate in-depth functional annotation of the sheep genome.一个改良的绵羊参考基因组组装,以促进绵羊基因组的深入功能注释。
Gigascience. 2022 Feb 4;11. doi: 10.1093/gigascience/giab096.
7
A low-density SNP genotyping panel for the accurate prediction of cattle breeds.一种用于准确预测牛种的低密度 SNP 基因分型面板。
J Anim Sci. 2020 Nov 1;98(11). doi: 10.1093/jas/skaa337.
8
Population structure and breed composition prediction in a multi-breed sheep population using genome-wide single nucleotide polymorphism genotypes.利用全基因组单核苷酸多态性基因型预测多品种绵羊群体的群体结构和品种组成。
Animal. 2020 Mar;14(3):464-474. doi: 10.1017/S1751731119002398. Epub 2019 Oct 15.
9
fastp: an ultra-fast all-in-one FASTQ preprocessor.fastp:一个超快速的一体化 FASTQ 预处理程序。
Bioinformatics. 2018 Sep 1;34(17):i884-i890. doi: 10.1093/bioinformatics/bty560.
10
Comparing SNP panels and statistical methods for estimating genomic breed composition of individual animals in ten cattle breeds.比较用于估计十个牛品种个体动物基因组品种组成的单核苷酸多态性(SNP)面板和统计方法。
BMC Genet. 2018 Aug 9;19(1):56. doi: 10.1186/s12863-018-0654-3.