• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MI-MAAP:多祖混合人群的标记信息量。

MI-MAAP: marker informativeness for multi-ancestry admixed populations.

机构信息

Department of Pediatrics, Cincinnati Children's Hospital Medical Center, University of Cincinnati, 3333 Burnet Avenue, MLC 7037, Cincinnati, OH, 45229-3026, USA.

Department of Electrical Engineering and Computer Science, University of Cincinnati, Cincinnati, OH, 45221, USA.

出版信息

BMC Bioinformatics. 2020 Apr 3;21(1):131. doi: 10.1186/s12859-020-3462-5.

DOI:10.1186/s12859-020-3462-5
PMID:32245404
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7119171/
Abstract

BACKGROUND

Admixed populations arise when two or more previously isolated populations interbreed. A powerful approach to addressing the genetic complexity in admixed populations is to infer ancestry. Ancestry inference including the proportion of an individual's genome coming from each population and its ancestral origin along the chromosome of an admixed population requires the use of ancestry informative markers (AIMs) from reference ancestral populations. AIMs exhibit substantial differences in allele frequency between ancestral populations. Given the huge amount of human genetic variation data available from diverse populations, a computationally feasible and cost-effective approach is becoming increasingly important to extract or filter AIMs with the maximum information content for ancestry inference, admixture mapping, forensic applications, and detecting genomic regions that have been under recent selection.

RESULTS

To address this gap, we present MI-MAAP, an easy-to-use web-based bioinformatics tool designed to prioritize informative markers for multi-ancestry admixed populations by utilizing feature selection methods and multiple genomics resources including 1000 Genomes Project and Human Genome Diversity Project. Specifically, this tool implements a novel allele frequency-based feature selection algorithm, Lancaster Estimator of Independence (LEI), as well as other genotype-based methods such as Principal Component Analysis (PCA), Support Vector Machine (SVM), and Random Forest (RF). We demonstrated that MI-MAAP is a useful tool in prioritizing informative markers and accurately classifying ancestral populations. LEI is an efficient feature selection strategy to retrieve ancestry informative variants with different allele frequency/selection pressure among (or between) ancestries without requiring computationally expensive individual-level genotype data.

CONCLUSIONS

MI-MAAP has a user-friendly interface which provides researchers an easy and fast way to filter and identify AIMs. MI-MAAP can be accessed at https://research.cchmc.org/mershalab/MI-MAAP/login/.

摘要

背景

当两个或更多以前隔离的群体杂交时,就会出现混合群体。解决混合群体遗传复杂性的一种有力方法是推断其祖先。推断祖先包括个体基因组中来自每个群体的比例及其在混合群体中的染色体祖先起源,这需要使用来自参考祖先群体的祖先信息标记 (AIMs)。AIMs 在等位基因频率方面在祖先群体之间存在显著差异。鉴于来自不同群体的大量人类遗传变异数据可用,提取或过滤具有最大祖先推断、混合映射、法医应用和检测最近受到选择的基因组区域信息量的 AIMs 的计算上可行且具有成本效益的方法变得越来越重要。

结果

为了解决这一差距,我们提出了 MI-MAAP,这是一种易于使用的基于网络的生物信息学工具,旨在通过利用特征选择方法和多种基因组学资源(包括 1000 基因组计划和人类基因组多样性计划),优先为多祖先混合群体选择信息标记。具体来说,该工具实现了一种新颖的基于等位基因频率的特征选择算法,即兰开斯特独立性估计器 (LEI),以及其他基于基因型的方法,如主成分分析 (PCA)、支持向量机 (SVM) 和随机森林 (RF)。我们证明了 MI-MAAP 是一种有用的工具,可以优先选择信息标记并准确分类祖先群体。LEI 是一种有效的特征选择策略,可在不要求计算昂贵的个体水平基因型数据的情况下,从(或在)祖先之间检索具有不同等位基因频率/选择压力的祖先信息变体。

结论

MI-MAAP 具有用户友好的界面,为研究人员提供了一种简单快捷的方法来筛选和识别 AIMs。MI-MAAP 可在 https://research.cchmc.org/mershalab/MI-MAAP/login/ 访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/088d/7119171/85ea0b1e513a/12859_2020_3462_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/088d/7119171/336f6c70c574/12859_2020_3462_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/088d/7119171/fba5c1617053/12859_2020_3462_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/088d/7119171/5819a5769bf4/12859_2020_3462_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/088d/7119171/daf31203f8b4/12859_2020_3462_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/088d/7119171/9e47df2afe46/12859_2020_3462_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/088d/7119171/85ea0b1e513a/12859_2020_3462_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/088d/7119171/336f6c70c574/12859_2020_3462_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/088d/7119171/fba5c1617053/12859_2020_3462_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/088d/7119171/5819a5769bf4/12859_2020_3462_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/088d/7119171/daf31203f8b4/12859_2020_3462_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/088d/7119171/9e47df2afe46/12859_2020_3462_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/088d/7119171/85ea0b1e513a/12859_2020_3462_Fig6_HTML.jpg

相似文献

1
MI-MAAP: marker informativeness for multi-ancestry admixed populations.MI-MAAP:多祖混合人群的标记信息量。
BMC Bioinformatics. 2020 Apr 3;21(1):131. doi: 10.1186/s12859-020-3462-5.
2
LEI: A Novel Allele Frequency-Based Feature Selection Method for Multi-ancestry Admixed Populations.LEI:一种基于新型等位基因频率的多血统混合人群特征选择方法。
Sci Rep. 2019 Jul 31;9(1):11103. doi: 10.1038/s41598-019-47012-y.
3
An ancestry informative marker panel design for individual ancestry estimation of Hispanic population using whole exome sequencing data.基于全外显子组测序数据的西班牙裔个体祖籍信息标记面板设计用于个体祖籍估计。
BMC Genomics. 2019 Dec 30;20(Suppl 12):1007. doi: 10.1186/s12864-019-6333-6.
4
Comparison of measures of marker informativeness for ancestry and admixture mapping.比较用于祖先和混合映射的标记信息量测度。
BMC Genomics. 2011 Dec 20;12:622. doi: 10.1186/1471-2164-12-622.
5
AncestrySNPminer: a bioinformatics tool to retrieve and develop ancestry informative SNP panels.祖先 SNPminer:一个检索和开发具有祖先信息的 SNP 面板的生物信息学工具。
Genomics. 2012 Jul;100(1):57-63. doi: 10.1016/j.ygeno.2012.05.003. Epub 2012 May 11.
6
Ancestral informative marker selection and population structure visualization using sparse Laplacian eigenfunctions.利用稀疏拉普拉斯特征函数进行祖先信息标记选择和群体结构可视化。
PLoS One. 2010 Nov 4;5(11):e13734. doi: 10.1371/journal.pone.0013734.
7
PAMAM: Power analysis in multiancestry admixture mapping.PAMAM:多祖裔混合分析中的功效分析。
Genet Epidemiol. 2019 Oct;43(7):831-843. doi: 10.1002/gepi.22216. Epub 2019 Jun 26.
8
Straightforward inference of ancestry and admixture proportions through ancestry-informative insertion deletion multiplexing.通过基于祖先信息的插入缺失多重PCR 直接推断祖先和混合比例。
PLoS One. 2012;7(1):e29684. doi: 10.1371/journal.pone.0029684. Epub 2012 Jan 17.
9
Fast individual ancestry inference from DNA sequence data leveraging allele frequencies for multiple populations.利用多个群体的等位基因频率从DNA序列数据中快速推断个体祖先。
BMC Bioinformatics. 2015 Jan 16;16:4. doi: 10.1186/s12859-014-0418-7.
10
: Statistical Power and Sample Size Estimation for Mapping Genetic Loci in Admixed Populations.: 混合人群中遗传基因座定位的统计功效和样本量估计。
Genetics. 2017 Nov;207(3):873-882. doi: 10.1534/genetics.117.300312. Epub 2017 Sep 26.

引用本文的文献

1
Gene variants, oxidative stress and inflammation in Colombian populations.哥伦比亚人群中的基因变异、氧化应激与炎症
Biomedica. 2025 May 30;45(2):244-266. doi: 10.7705/biomedica.7220.
2
A Pipeline and Recommendations for Population and Individual Diagnostic SNP Selection in Non-Model Species.非模式物种群体和个体诊断性单核苷酸多态性选择的流程及建议
Mol Ecol Resour. 2025 Apr;25(3):e14048. doi: 10.1111/1755-0998.14048. Epub 2024 Nov 29.
3
Fully exploiting SNP arrays: a systematic review on the tools to extract underlying genomic structure.

本文引用的文献

1
LEI: A Novel Allele Frequency-Based Feature Selection Method for Multi-ancestry Admixed Populations.LEI:一种基于新型等位基因频率的多血统混合人群特征选择方法。
Sci Rep. 2019 Jul 31;9(1):11103. doi: 10.1038/s41598-019-47012-y.
2
GENEASE: real time bioinformatics tool for multi-omics and disease ontology exploration, analysis and visualization.GENEASE:用于多组学和疾病本体探索、分析和可视化的实时生物信息学工具。
Bioinformatics. 2018 Sep 15;34(18):3160-3168. doi: 10.1093/bioinformatics/bty182.
3
The ExAC browser: displaying reference data information from over 60 000 exomes.
充分利用 SNP 阵列:提取潜在基因组结构的工具的系统评价。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac043.
ExAC浏览器:展示来自6万多个外显子组的参考数据信息。
Nucleic Acids Res. 2017 Jan 4;45(D1):D840-D845. doi: 10.1093/nar/gkw971. Epub 2016 Nov 28.
4
Analysis of protein-coding genetic variation in 60,706 humans.对60706名人类的蛋白质编码基因变异进行分析。
Nature. 2016 Aug 18;536(7616):285-91. doi: 10.1038/nature19057.
5
The Genotype-Tissue Expression (GTEx) Project.基因型-组织表达(GTEx)项目
Biopreserv Biobank. 2015 Oct;13(5):307-8. doi: 10.1089/bio.2015.29031.hmm.
6
Mapping asthma-associated variants in admixed populations.在混合人群中定位哮喘相关变异
Front Genet. 2015 Sep 29;6:292. doi: 10.3389/fgene.2015.00292. eCollection 2015.
7
A global reference for human genetic variation.人类遗传变异的全球参考。
Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.
8
The Genotype-Tissue Expression (GTEx) project.基因型-组织表达 (GTEx) 项目。
Nat Genet. 2013 Jun;45(6):580-5. doi: 10.1038/ng.2653.
9
Annotation of functional variation in personal genomes using RegulomeDB.利用 RegulomeDB 注释个人基因组中的功能变异。
Genome Res. 2012 Sep;22(9):1790-7. doi: 10.1101/gr.137323.112.
10
AncestrySNPminer: a bioinformatics tool to retrieve and develop ancestry informative SNP panels.祖先 SNPminer:一个检索和开发具有祖先信息的 SNP 面板的生物信息学工具。
Genomics. 2012 Jul;100(1):57-63. doi: 10.1016/j.ygeno.2012.05.003. Epub 2012 May 11.