• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

资源受限下的人类祖先识别——一条染色体能告诉我们关于人类生物地理祖先的哪些信息?

Human ancestry indentification under resource constraints -- what can one chromosome tell us about human biogeographical ancestry?

作者信息

Toma Tanjin T, Dawson Jeremy M, Adjeroh Donald A

机构信息

Lane Department of Computer Science and Electrical Engineering, West Virginia University, Morgantown, WV, USA.

出版信息

BMC Med Genomics. 2018 Nov 20;11(Suppl 5):0. doi: 10.1186/s12920-018-0412-4.

DOI:10.1186/s12920-018-0412-4
PMID:30453954
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6245491/
Abstract

BACKGROUND

While continental level ancestry is relatively simple using genomic information, distinguishing between individuals from closely associated sub-populations (e.g., from the same continent) is still a difficult challenge.

METHODS

We study the problem of predicting human biogeographical ancestry from genomic data under resource constraints. In particular, we focus on the case where the analysis is constrained to using single nucleotide polymorphisms (SNPs) from just one chromosome. We propose methods to construct such ancestry informative SNP panels using correlation-based and outlier-based methods.

RESULTS

We accessed the performance of the proposed SNP panels derived from just one chromosome, using data from the 1000 Genome Project, Phase 3. For continental-level ancestry classification, we achieved an overall classification rate of 96.75% using 206 single nucleotide polymorphisms (SNPs). For sub-population level ancestry prediction, we achieved an average pairwise binary classification rates as follows: subpopulations in Europe: 76.6% (58 SNPs); Africa: 87.02% (87 SNPs); East Asia: 73.30% (68 SNPs); South Asia: 81.14% (75 SNPs); America: 85.85% (68 SNPs).

CONCLUSION

Our results demonstrate that one single chromosome (in particular, Chromosome 1), if carefully analyzed, could hold enough information for accurate prediction of human biogeographical ancestry. This has significant implications in terms of the computational resources required for analysis of ancestry, and in the applications of such analyses, such as in studies of genetic diseases, forensics, and soft biometrics.

摘要

背景

虽然利用基因组信息推断大陆水平的祖先相对简单,但区分来自密切相关亚群体(例如来自同一大陆)的个体仍然是一项艰巨的挑战。

方法

我们研究了在资源受限的情况下,从基因组数据预测人类生物地理祖先的问题。具体而言,我们专注于分析仅限于使用来自一条染色体的单核苷酸多态性(SNP)的情况。我们提出了使用基于相关性和基于异常值的方法来构建此类祖先信息SNP面板的方法。

结果

我们使用来自千人基因组计划第三阶段的数据,评估了仅从一条染色体衍生的所提出的SNP面板的性能。对于大陆水平的祖先分类,我们使用206个单核苷酸多态性(SNP)实现了96.75%的总体分类率。对于亚群体水平的祖先预测,我们实现了如下平均成对二元分类率:欧洲亚群体:76.6%(58个SNP);非洲:87.02%(87个SNP);东亚:73.30%(68个SNP);南亚:81.14%(75个SNP);美洲:85.85%(68个SNP)。

结论

我们的结果表明,如果仔细分析,一条单一的染色体(特别是1号染色体)可以包含足够的信息来准确预测人类生物地理祖先。这在祖先分析所需的计算资源以及此类分析的应用方面具有重要意义,例如在遗传疾病研究、法医学和软生物特征识别中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf68/6245491/8e2661f0fb5c/12920_2018_412_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf68/6245491/83ab16a8dd1b/12920_2018_412_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf68/6245491/a862682d7f73/12920_2018_412_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf68/6245491/8e2661f0fb5c/12920_2018_412_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf68/6245491/83ab16a8dd1b/12920_2018_412_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf68/6245491/a862682d7f73/12920_2018_412_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bf68/6245491/8e2661f0fb5c/12920_2018_412_Fig3_HTML.jpg

相似文献

1
Human ancestry indentification under resource constraints -- what can one chromosome tell us about human biogeographical ancestry?资源受限下的人类祖先识别——一条染色体能告诉我们关于人类生物地理祖先的哪些信息?
BMC Med Genomics. 2018 Nov 20;11(Suppl 5):0. doi: 10.1186/s12920-018-0412-4.
2
ETHNOPRED: a novel machine learning method for accurate continental and sub-continental ancestry identification and population stratification correction.ETHNOPRED:一种用于准确进行大陆和次大陆祖先鉴定和群体分层校正的新型机器学习方法。
BMC Bioinformatics. 2013 Feb 22;14:61. doi: 10.1186/1471-2105-14-61.
3
A single nucleotide polymorphism panel for individual identification and ancestry assignment in Caucasians and four East and Southeast Asian populations using a machine learning classifier.使用机器学习分类器的单核苷酸多态性面板用于白种人和四个东亚及东南亚人群的个体识别和血统归属。
Forensic Sci Med Pathol. 2019 Mar;15(1):67-74. doi: 10.1007/s12024-018-0071-y. Epub 2019 Jan 16.
4
An ancestry informative marker set for determining continental origin: validation and extension using human genome diversity panels.用于确定大陆起源的祖先信息标记集:使用人类基因组多样性面板进行验证和扩展
BMC Genet. 2009 Jul 24;10:39. doi: 10.1186/1471-2156-10-39.
5
Development of a SNP panel for predicting biogeographical ancestry and phenotype using massively parallel sequencing.利用大规模平行测序开发用于预测生物地理起源和表型的 SNP 面板。
Electrophoresis. 2018 Nov;39(21):2743-2751. doi: 10.1002/elps.201800243. Epub 2018 Aug 17.
6
Simple and cost-effective 14-loci SNP assay designed for differentiation of European, East Asian and African samples.设计用于区分欧洲、东亚和非洲样本的简单且经济高效的14位点单核苷酸多态性检测方法。
Forensic Sci Int Genet. 2015 Jan;14:42-9. doi: 10.1016/j.fsigen.2014.09.009. Epub 2014 Sep 28.
7
Systematic analyses of AISNPs screening and classification algorithms based on genome-wide data for forensic biogeographic ancestry inference.基于全基因组数据的法医生物地理祖籍推断的 AISNPs 筛选和分类算法的系统分析。
Forensic Sci Int. 2024 Apr;357:111975. doi: 10.1016/j.forsciint.2024.111975. Epub 2024 Mar 2.
8
Ancestry informative SNP panels for discriminating the major East Asian populations: Han Chinese, Japanese and Korean.用于区分东亚主要人群(汉族、日本人和韩国人)的祖先信息性单核苷酸多态性(SNP)面板。
Ann Hum Genet. 2019 Sep;83(5):348-354. doi: 10.1111/ahg.12320. Epub 2019 Apr 26.
9
Next generation sequencing of a set of ancestry-informative SNPs: ancestry assignment of three continental populations and estimating ancestry composition for Mongolians.一组与祖先相关的 SNPs 的下一代测序:三个大陆人群的祖先归属以及蒙古人的祖先成分估计。
Mol Genet Genomics. 2020 Jul;295(4):1027-1038. doi: 10.1007/s00438-020-01660-2. Epub 2020 Mar 23.
10
Genetic composition of Brazilian population samples based on a set of twenty-eight ancestry informative SNPs.基于一组 28 个遗传多态性的单核苷酸多态性(SNP)对巴西人群样本的遗传组成进行分析。
Am J Hum Biol. 2010 Mar-Apr;22(2):187-92. doi: 10.1002/ajhb.20976.

引用本文的文献

1
Correction to: Human ancestry indentification under resource constraints -- what can one chromosome tell us about human biogeographical ancestry?对《资源受限情况下的人类祖先识别——一条染色体能告诉我们关于人类生物地理祖先的哪些信息?》的勘误
BMC Med Genomics. 2019 Jan 9;12(1):2. doi: 10.1186/s12920-018-0464-5.

本文引用的文献

1
Novel probabilistic models of spatial genetic ancestry with applications to stratification correction in genome-wide association studies.用于全基因组关联研究分层校正的空间遗传血统新型概率模型。
Bioinformatics. 2017 Mar 15;33(6):879-885. doi: 10.1093/bioinformatics/btw720.
2
Prediction of breast cancer risk based on common genetic variants in women of East Asian ancestry.基于东亚血统女性常见基因变异预测乳腺癌风险。
Breast Cancer Res. 2016 Dec 8;18(1):124. doi: 10.1186/s13058-016-0786-1.
3
AKT: ancestry and kinship toolkit.AKT:血统和亲属关系工具包。
Bioinformatics. 2017 Jan 1;33(1):142-144. doi: 10.1093/bioinformatics/btw576. Epub 2016 Sep 14.
4
Integrating, summarizing and visualizing GWAS-hits and human diversity with DANCE (Disease-ANCEstry networks).利用 DANCE(疾病-ANCEstry 网络)整合、总结和可视化 GWAS 命中和人类多样性。
Bioinformatics. 2016 Apr 15;32(8):1247-9. doi: 10.1093/bioinformatics/btv708. Epub 2015 Dec 15.
5
ancGWAS: a post genome-wide association study method for interaction, pathway and ancestry analysis in homogeneous and admixed populations.ancGWAS:一种用于同质和混合人群中相互作用、通路及祖先分析的全基因组关联研究后分析方法。
Bioinformatics. 2016 Feb 15;32(4):549-56. doi: 10.1093/bioinformatics/btv619. Epub 2015 Oct 27.
6
A global reference for human genetic variation.人类遗传变异的全球参考。
Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.
7
Reconstructing Past Admixture Processes from Local Genomic Ancestry Using Wavelet Transformation.利用小波变换从局部基因组祖先重建过去的混合过程。
Genetics. 2015 Jun;200(2):469-81. doi: 10.1534/genetics.115.176842. Epub 2015 Apr 7.
8
Evaluating the accuracy of AIM panels at quantifying genome ancestry.评估AIM面板在量化基因组血统方面的准确性。
BMC Genomics. 2014 Jun 30;15(1):543. doi: 10.1186/1471-2164-15-543.
9
Progress toward an efficient panel of SNPs for ancestry inference.用于血统推断的高效单核苷酸多态性(SNP)面板的进展。
Forensic Sci Int Genet. 2014 May;10:23-32. doi: 10.1016/j.fsigen.2014.01.002. Epub 2014 Jan 15.
10
A 50-SNP assay for biogeographic ancestry and phenotype prediction in the U.S. population.50-SNP 基因检测在美国人群中的生物地理起源和表型预测
Forensic Sci Int Genet. 2014 Jan;8(1):101-8. doi: 10.1016/j.fsigen.2013.07.010. Epub 2013 Aug 17.