• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于全基因组数据的法医生物地理祖籍推断的 AISNPs 筛选和分类算法的系统分析。

Systematic analyses of AISNPs screening and classification algorithms based on genome-wide data for forensic biogeographic ancestry inference.

机构信息

Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, Guangdong, China.

Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, Guangdong, China; Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, Guangdong, China.

出版信息

Forensic Sci Int. 2024 Apr;357:111975. doi: 10.1016/j.forsciint.2024.111975. Epub 2024 Mar 2.

DOI:10.1016/j.forsciint.2024.111975
PMID:38547686
Abstract

Identifying the biogeographic ancestral origin of biological sample left at a crime scene can provide important evidence for judicial case, as well as clue for narrowing down suspect. Ancestry informative single nucleotide polymorphism (AISNP) has become one of the most important genetic markers in recent years for screening ancestry information loci and analyzing the population genetic background and structure due to their high number and wide distributions in the human genome. In this study, based on data from 26 populations in the 1000 Genomes Project Phase 3, a Random Forest classification model was constructed with one-vs-rest classification strategy for embedded feature selection in order to obtain a panel with a small number of efficient AISNPs. The research aim was to clarify differentiations of population genetic structures among continents and subregions of East Asia. ADMIXTURE results showed that based on the 58 AISNPs selected by the machine learning algorithm, the 26 populations involved in the study could be categorized into six intercontinental ancestry components: North East Asia, South East Asia, Africa, Europe, South Asia, and America. The 24 continental-specific AISNPs and 34 East Asian-specific AISNPs were finally obtained, and used to construct the ancestry prediction model using XGBoost algorithm, resulting in the Matthews correlation coefficients of 0.94 and 0.89, and accuracies of 0.94 and 0.92, respectively. The machine learning models that we constructed using population-specific AISNPs were able to accurately predict the ancestral origins of continental and intra-East Asian populations. To summarize, screening a set of high-perform AISNPs to infer biogeographical ancestral information using embedded feature selection has potential application in creating a layered inference system that accurately differentiates from intercontinental populations to local subpopulations.

摘要

鉴定犯罪现场遗留生物样本的生物地理祖先起源,可以为司法案件提供重要证据,也可以为缩小嫌疑人范围提供线索。由于其在人类基因组中的数量多且分布广泛,因此,单核苷酸多态性(SNP)成为近年来筛选祖先信息位点、分析群体遗传背景和结构的最重要遗传标记之一。本研究基于 1000 基因组计划第三阶段 26 个人群的数据,采用一对一分类策略的随机森林分类模型进行嵌入式特征选择,以获得一个具有少数高效 SNP 的面板。本研究旨在阐明不同大洲和东亚亚区的种群遗传结构差异。ADMIXTURE 结果表明,基于机器学习算法选择的 58 个 SNP,可以将研究中涉及的 26 个人群分为六个洲际祖先成分:东北亚、东南亚、非洲、欧洲、南亚和美洲。最终获得了 24 个大陆特异性 SNP 和 34 个东亚特异性 SNP,并使用 XGBoost 算法构建了祖先预测模型,结果分别为 Matthews 相关系数 0.94 和 0.89,准确率为 0.94 和 0.92。使用人群特异性 SNP 构建的机器学习模型能够准确预测大陆和东亚内部人群的祖先起源。总之,筛选一组高性能 SNP 进行嵌入式特征选择,以推断生物地理祖先信息,在创建一个能够从洲际人群到本地亚群进行准确区分的分层推断系统方面具有潜在的应用价值。

相似文献

1
Systematic analyses of AISNPs screening and classification algorithms based on genome-wide data for forensic biogeographic ancestry inference.基于全基因组数据的法医生物地理祖籍推断的 AISNPs 筛选和分类算法的系统分析。
Forensic Sci Int. 2024 Apr;357:111975. doi: 10.1016/j.forsciint.2024.111975. Epub 2024 Mar 2.
2
Systematic selection of ancestry informative SNPs for differentiating Han, Japanese, Dai, and Kinh populations.系统选择区分汉族、日本、傣族和京族人群的祖先信息 SNP。
Electrophoresis. 2023 Sep;44(17-18):1405-1413. doi: 10.1002/elps.202200292. Epub 2023 Jun 16.
3
An efficient ancestry informative SNPs panel for further discriminating East Asian populations.一个高效的用于进一步区分东亚人群的亲缘关系信息 SNP 面板。
Electrophoresis. 2022 Sep;43(16-17):1774-1783. doi: 10.1002/elps.202100349. Epub 2022 Jul 9.
4
Massively parallel sequencing of 165 ancestry-informative SNPs and forensic biogeographical ancestry inference in three southern Chinese Sinitic/Tai-Kadai populations.对 165 个具有族群遗传信息的 SNP 进行大规模平行测序,并对中国南方三个汉藏语系/台语族群进行法医学生物地理族群推断。
Forensic Sci Int Genet. 2021 May;52:102475. doi: 10.1016/j.fsigen.2021.102475. Epub 2021 Feb 2.
5
Biogeographic origin prediction of three continental populations through 42 ancestry informative SNPs.通过 42 个祖先信息 SNP 预测三个大陆人群的生物地理起源。
Electrophoresis. 2020 Feb;41(3-4):235-245. doi: 10.1002/elps.201900241. Epub 2019 Nov 29.
6
A panel of 74 AISNPs: Improved ancestry inference within Eastern Asia.一组74个亚洲内部特异性单核苷酸多态性:改善东亚地区的血统推断
Forensic Sci Int Genet. 2016 Jul;23:101-110. doi: 10.1016/j.fsigen.2016.04.002. Epub 2016 Apr 4.
7
Ancestry Prediction Comparisons of Different AISNPs for Five Continental Populations and Population Structure Dissection of the Xinjiang Hui Group via a Self-Developed Panel.基于自主研发 panel 对五个大陆人群的不同 AISNPs 进行祖籍预测比较及对新疆回族人群结构进行剖析。
Genes (Basel). 2020 May 4;11(5):505. doi: 10.3390/genes11050505.
8
The ancestry inference of Chinese populations using 74-plex SNPs system.使用74重单核苷酸多态性(SNP)系统对中国人群进行祖先推断。
Yi Chuan. 2020 Mar 20;42(3):296-308. doi: 10.16288/j.yczz.19-252.
9
A panel of 130 autosomal single-nucleotide polymorphisms for ancestry assignment in five Asian populations and in Caucasians.一组用于五个亚洲人群和高加索人群血统鉴定的130个常染色体单核苷酸多态性。
Forensic Sci Med Pathol. 2017 Jun;13(2):177-187. doi: 10.1007/s12024-017-9863-8. Epub 2017 Apr 24.
10
A single nucleotide polymorphism panel for individual identification and ancestry assignment in Caucasians and four East and Southeast Asian populations using a machine learning classifier.使用机器学习分类器的单核苷酸多态性面板用于白种人和四个东亚及东南亚人群的个体识别和血统归属。
Forensic Sci Med Pathol. 2019 Mar;15(1):67-74. doi: 10.1007/s12024-018-0071-y. Epub 2019 Jan 16.