• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

AIM-SNPtag:一种用于开发具有遗传背景信息的 SNP 面板的计算高效方法。

AIM-SNPtag: A computationally efficient approach for developing ancestry-informative SNP panels.

机构信息

CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.

CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China.

出版信息

Forensic Sci Int Genet. 2019 Jan;38:245-253. doi: 10.1016/j.fsigen.2018.10.015. Epub 2018 Nov 2.

DOI:10.1016/j.fsigen.2018.10.015
PMID:30476751
Abstract

Inferring an individual's ancestry or group membership using a small set of highly informative genetic markers is very useful in forensic and medical genetics. However, given the huge amount of SNP data available from a diverse of populations, it is challenging to develop informative panels by exhaustively searching for all possible SNP combinations. In this study, we formulate it as an algorithm problem of selecting an optimal set of SNPs that maximizes the inference accuracy while minimizes the set size. Built on this conception, we develop a computational approach that is capable of constructing ancestry informative panels from multi-population genome-wide SNP data efficiently. We evaluated the performance of the method by comparing the panel size and membership inference accuracy of the constructed SNP panels to panels selected through empirical procedures in previous studies. For the membership inference of population groups including Asian, European, African, East Asian and Southeast Asian, a 36-SNP panel developed by our approach has an overall accuracy of 99.07%, and a 21-SNP subset of the panel has an overall accuracy of 95.36%. In comparison, an existing panel requires 74 SNPs to achieve an accuracy of 94.14% on the same set of population groups. We further apply the method to four subpopulations within Europe (Finnish, British, Spanish and Italian); a 175-SNP panel can discriminate individuals of those European subpopulations with an accuracy of 99.36%, of which a 68-SNP subset can achieve an accuracy of 95.07%. We expect our method to be a useful tool for constructing ancestry informative markers in forensic genetics.

摘要

利用一小部分信息量丰富的遗传标记推断个体的祖先或群体归属,在法医学和医学遗传学中非常有用。然而,考虑到来自不同人群的 SNP 数据数量巨大,通过穷尽搜索所有可能的 SNP 组合来开发信息量丰富的面板是具有挑战性的。在这项研究中,我们将其表述为选择一组最优 SNP 的算法问题,这些 SNP 最大限度地提高了推断准确性,同时最小化了集合大小。在此概念的基础上,我们开发了一种能够从多人群全基因组 SNP 数据中有效构建祖先信息面板的计算方法。我们通过将构建的 SNP 面板与之前研究中通过经验程序选择的面板的面板大小和成员推断准确性进行比较,来评估该方法的性能。对于包括亚洲、欧洲、非洲、东亚和东南亚在内的人群群体的成员推断,我们的方法开发的 36-SNP 面板的总体准确性为 99.07%,而面板的 21-SNP 子集的总体准确性为 95.36%。相比之下,现有的面板需要 74 个 SNP 才能在相同的人群群体上达到 94.14%的准确性。我们进一步将该方法应用于欧洲的四个亚群(芬兰人、英国人、西班牙人和意大利人);一个 175-SNP 面板可以以 99.36%的准确性区分这些欧洲亚群的个体,其中一个 68-SNP 子集可以达到 95.07%的准确性。我们希望我们的方法成为法医遗传学中构建祖先信息标记的有用工具。

相似文献

1
AIM-SNPtag: A computationally efficient approach for developing ancestry-informative SNP panels.AIM-SNPtag:一种用于开发具有遗传背景信息的 SNP 面板的计算高效方法。
Forensic Sci Int Genet. 2019 Jan;38:245-253. doi: 10.1016/j.fsigen.2018.10.015. Epub 2018 Nov 2.
2
Pacifiplex: an ancestry-informative SNP panel centred on Australia and the Pacific region.Pacifiplex:一个以澳大利亚和太平洋地区为中心的具有祖先信息的单核苷酸多态性(SNP)面板。
Forensic Sci Int Genet. 2016 Jan;20:71-80. doi: 10.1016/j.fsigen.2015.10.003. Epub 2015 Oct 20.
3
Minimal SNP overlap among multiple panels of ancestry informative markers argues for more international collaboration.多个祖先信息标记面板之间的单核苷酸多态性(SNP)重叠最少,这表明需要更多的国际合作。
Forensic Sci Int Genet. 2016 Jul;23:25-32. doi: 10.1016/j.fsigen.2016.01.013. Epub 2016 Jan 22.
4
An ancestry informative marker panel design for individual ancestry estimation of Hispanic population using whole exome sequencing data.基于全外显子组测序数据的西班牙裔个体祖籍信息标记面板设计用于个体祖籍估计。
BMC Genomics. 2019 Dec 30;20(Suppl 12):1007. doi: 10.1186/s12864-019-6333-6.
5
Ancestry informative SNP panels for discriminating the major East Asian populations: Han Chinese, Japanese and Korean.用于区分东亚主要人群(汉族、日本人和韩国人)的祖先信息性单核苷酸多态性(SNP)面板。
Ann Hum Genet. 2019 Sep;83(5):348-354. doi: 10.1111/ahg.12320. Epub 2019 Apr 26.
6
PIMA: A population informative multiplex for the Americas.皮马印第安人多重信息标记:美洲人群特有的遗传标记
Forensic Sci Int Genet. 2020 Jan;44:102200. doi: 10.1016/j.fsigen.2019.102200. Epub 2019 Nov 5.
7
Global analysis of population stratification using a smart panel of 27 continental ancestry-informative SNPs.利用一个包含 27 个大陆祖先信息单核苷酸多态性(SNP)的智能面板进行全球人群结构分层的分析。
Forensic Sci Int Genet. 2018 Jul;35:e10-e12. doi: 10.1016/j.fsigen.2018.05.006. Epub 2018 May 18.
8
A single-tube 27-plex SNP assay for estimating individual ancestry and admixture from three continents.一种用于从三大洲估计个体祖先和混合情况的单管27重SNP检测方法。
Int J Legal Med. 2016 Jan;130(1):27-37. doi: 10.1007/s00414-015-1183-5. Epub 2015 Apr 2.
9
Inter-laboratory evaluation of the EUROFORGEN Global ancestry-informative SNP panel by massively parallel sequencing using the Ion PGM™.使用Ion PGM™通过大规模平行测序对EUROFORGEN全球祖先信息SNP面板进行实验室间评估。
Forensic Sci Int Genet. 2016 Jul;23:178-189. doi: 10.1016/j.fsigen.2016.04.008. Epub 2016 Apr 21.
10
Evaluating a subset of ancestry informative SNPs for discriminating among Southwest Asian and circum-Mediterranean populations.评估一组祖先信息性单核苷酸多态性以区分西南亚和地中海周边人群。
Forensic Sci Int Genet. 2016 Jul;23:153-158. doi: 10.1016/j.fsigen.2016.04.010. Epub 2016 Apr 27.

引用本文的文献

1
A high-throughput screening method for selecting feature SNPs to evaluate breed diversity and infer ancestry.一种用于选择特征单核苷酸多态性以评估品种多样性和推断祖先的高通量筛选方法。
Genome Res. 2025 Aug 1;35(8):1875-1886. doi: 10.1101/gr.280176.124.
2
snpAIMeR: R package for evaluating ancestry informative marker contributions in non-model population diagnostics.snpAIMeR:用于评估非模型群体诊断中遗传标记贡献的 R 包。
Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae377.
3
Ancestry informative DIP loci for dissecting genetic structure and ancestry proportions of Qinghai Tibetan and Tibet Tibetan groups.
用于剖析青海藏族和西藏藏族群体遗传结构和祖源比例的信息性 DIP 基因座。
Mol Biol Rep. 2020 Feb;47(2):1079-1087. doi: 10.1007/s11033-019-05202-x. Epub 2019 Dec 2.
4
GRAF-pop: A Fast Distance-Based Method To Infer Subject Ancestry from Multiple Genotype Datasets Without Principal Components Analysis.GRAF-pop:一种无需主成分分析即可基于距离推断个体祖先的快速方法,适用于多种基因型数据集。
G3 (Bethesda). 2019 Aug 8;9(8):2447-2461. doi: 10.1534/g3.118.200925.