• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

拉普拉斯特征函数可学习群体结构。

Laplacian eigenfunctions learn population structure.

机构信息

Department of Radiology, The University of Chicago, Chicago, Illinois, United States of America.

出版信息

PLoS One. 2009 Dec 1;4(12):e7928. doi: 10.1371/journal.pone.0007928.

DOI:10.1371/journal.pone.0007928
PMID:19956572
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2779848/
Abstract

Principal components analysis has been used for decades to summarize genetic variation across geographic regions and to infer population migration history. More recently, with the advent of genome-wide association studies of complex traits, it has become a commonly-used tool for detection and correction of confounding due to population structure. However, principal components are generally sensitive to outliers. Recently there has also been concern about its interpretation. Motivated from geometric learning, we describe a method based on spectral graph theory. Regarding each study subject as a node with suitably defined weights for its edges to close neighbors, one can form a weighted graph. We suggest using the spectrum of the associated graph Laplacian operator, namely, Laplacian eigenfunctions, to infer population structure. In simulations and real data on a ring species of birds, Laplacian eigenfunctions reveal more meaningful and less noisy structure of the underlying population, compared with principal components. The proposed approach is simple and computationally fast. It is expected to become a promising and basic method for population genetics and disease association studies.

摘要

主成分分析已被广泛应用于数十年,用于概括地理区域的遗传变异,并推断人口迁移历史。最近,随着复杂性状的全基因组关联研究的出现,它已成为一种常用的工具,用于检测和校正由于人口结构引起的混杂。然而,主成分通常对异常值很敏感。最近也有人对其解释表示担忧。受几何学习的启发,我们描述了一种基于谱图理论的方法。将每个研究对象视为一个节点,其边缘与相邻节点的权重适当定义,就可以形成一个加权图。我们建议使用关联图拉普拉斯算子的谱,即拉普拉斯特征函数,来推断人口结构。在鸟类环种的模拟和真实数据中,与主成分相比,拉普拉斯特征函数揭示了更有意义且噪声更小的底层人口结构。所提出的方法简单且计算速度快。预计它将成为群体遗传学和疾病关联研究的一种有前途的基本方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5854/2779848/4d35c0c59cfc/pone.0007928.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5854/2779848/64816c081aaa/pone.0007928.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5854/2779848/3e2d85de4609/pone.0007928.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5854/2779848/af5c984f68a3/pone.0007928.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5854/2779848/9f10f61c6db9/pone.0007928.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5854/2779848/4d35c0c59cfc/pone.0007928.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5854/2779848/64816c081aaa/pone.0007928.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5854/2779848/3e2d85de4609/pone.0007928.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5854/2779848/af5c984f68a3/pone.0007928.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5854/2779848/9f10f61c6db9/pone.0007928.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5854/2779848/4d35c0c59cfc/pone.0007928.g005.jpg

相似文献

1
Laplacian eigenfunctions learn population structure.拉普拉斯特征函数可学习群体结构。
PLoS One. 2009 Dec 1;4(12):e7928. doi: 10.1371/journal.pone.0007928.
2
Graphic analysis of population structure on genome-wide rheumatoid arthritis data.基于全基因组类风湿性关节炎数据的群体结构图形分析。
BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S110. doi: 10.1186/1753-6561-3-s7-s110.
3
Ancestral informative marker selection and population structure visualization using sparse Laplacian eigenfunctions.利用稀疏拉普拉斯特征函数进行祖先信息标记选择和群体结构可视化。
PLoS One. 2010 Nov 4;5(11):e13734. doi: 10.1371/journal.pone.0013734.
4
Discovering genetic ancestry using spectral graph theory.利用谱图理论探寻遗传渊源。
Genet Epidemiol. 2010 Jan;34(1):51-9. doi: 10.1002/gepi.20434.
5
GRAF-pop: A Fast Distance-Based Method To Infer Subject Ancestry from Multiple Genotype Datasets Without Principal Components Analysis.GRAF-pop:一种无需主成分分析即可基于距离推断个体祖先的快速方法,适用于多种基因型数据集。
G3 (Bethesda). 2019 Aug 8;9(8):2447-2461. doi: 10.1534/g3.118.200925.
6
PCA via joint graph Laplacian and sparse constraint: Identification of differentially expressed genes and sample clustering on gene expression data.基于联合图拉普拉斯和稀疏约束的主成分分析:在基因表达数据上进行差异表达基因的识别和样本聚类。
BMC Bioinformatics. 2019 Dec 30;20(Suppl 22):716. doi: 10.1186/s12859-019-3229-z.
7
A spectral graph convolution for signed directed graphs via magnetic Laplacian.基于磁拉普拉斯的有向符号图的谱图卷积。
Neural Netw. 2023 Jul;164:562-574. doi: 10.1016/j.neunet.2023.05.009. Epub 2023 May 12.
8
A graph-Laplacian-based feature extraction algorithm for neural spike sorting.一种基于图拉普拉斯算子的神经尖峰分类特征提取算法。
Annu Int Conf IEEE Eng Med Biol Soc. 2009;2009:3142-5. doi: 10.1109/IEMBS.2009.5332571.
9
Learning eigenfunctions links spectral embedding and kernel PCA.学习特征函数将谱嵌入与核主成分分析联系起来。
Neural Comput. 2004 Oct;16(10):2197-219. doi: 10.1162/0899766041732396.
10
Pattern vectors from algebraic graph theory.来自代数图论的模式向量。
IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1112-24. doi: 10.1109/TPAMI.2005.145.

引用本文的文献

1
Robust genome-wide ancestry inference for heterogeneous datasets: illustrated using the 1,000 genome project with 3D facial images.利用 1000 基因组计划与 3D 面部图像进行异构数据集的全基因组祖先推断:实例研究。
Sci Rep. 2020 Jul 16;10(1):11850. doi: 10.1038/s41598-020-68259-w.
2
Association Analysis in Rice: From Application to Utilization.水稻中的关联分析:从应用到利用
Front Plant Sci. 2016 Aug 17;7:1202. doi: 10.3389/fpls.2016.01202. eCollection 2016.
3
Retrospective Binary-Trait Association Test Elucidates Genetic Architecture of Crohn Disease.

本文引用的文献

1
Graphic analysis of population structure on genome-wide rheumatoid arthritis data.基于全基因组类风湿性关节炎数据的群体结构图形分析。
BMC Proc. 2009 Dec 15;3 Suppl 7(Suppl 7):S110. doi: 10.1186/1753-6561-3-s7-s110.
2
Discovering genetic ancestry using spectral graph theory.利用谱图理论探寻遗传渊源。
Genet Epidemiol. 2010 Jan;34(1):51-9. doi: 10.1002/gepi.20434.
3
The Population Reference Sample, POPRES: a resource for population, disease, and pharmacological genetics research.人口参考样本(POPRES):用于人口、疾病和药物遗传学研究的资源。
回顾性二元性状关联测试揭示克罗恩病的遗传结构。
Am J Hum Genet. 2016 Feb 4;98(2):243-55. doi: 10.1016/j.ajhg.2015.12.012. Epub 2016 Jan 28.
4
Detecting individual ancestry in the human genome.检测人类基因组中的个体血统。
Investig Genet. 2015 May 1;6:7. doi: 10.1186/s13323-015-0019-x. eCollection 2015.
5
Challenges in analysis and interpretation of microsatellite data for population genetic studies.群体遗传学研究中分析和解释微卫星数据所面临的挑战。
Ecol Evol. 2014 Nov;4(22):4399-428. doi: 10.1002/ece3.1305. Epub 2014 Oct 30.
6
Semi-supervised spectral clustering with application to detect population stratification.半监督谱聚类及其在群体分层检测中的应用
Front Genet. 2013 Oct 25;4:215. doi: 10.3389/fgene.2013.00215. eCollection 2013.
7
Genome-wide association study heterogeneous cohort homogenization via subject weight knock-down.通过受试者体重降低实现全基因组关联研究异质队列同质化
PLoS One. 2012;7(10):e48653. doi: 10.1371/journal.pone.0048653. Epub 2012 Oct 29.
8
Correcting for Population Stratification in Genomewide Association Studies.在全基因组关联研究中校正群体分层
J Am Stat Assoc. 2011 Sep 1;106(495):997-1008. doi: 10.1198/jasa.2011.tm10294.
9
Manifold learning for human population structure studies.用于人类群体结构研究的流形学习。
PLoS One. 2012;7(1):e29901. doi: 10.1371/journal.pone.0029901. Epub 2012 Jan 17.
10
Population structure and genetic diversity in a rice core collection (Oryza sativa L.) investigated with SSR markers.利用 SSR 标记研究水稻核心种质资源的群体结构和遗传多样性。
PLoS One. 2011;6(12):e27565. doi: 10.1371/journal.pone.0027565. Epub 2011 Dec 2.
Am J Hum Genet. 2008 Sep;83(3):347-58. doi: 10.1016/j.ajhg.2008.08.005. Epub 2008 Aug 28.
4
Principal component analysis of genetic data.遗传数据的主成分分析
Nat Genet. 2008 May;40(5):491-2. doi: 10.1038/ng0508-491.
5
Interpreting principal component analyses of spatial population genetic variation.解读空间群体遗传变异的主成分分析
Nat Genet. 2008 May;40(5):646-9. doi: 10.1038/ng.139. Epub 2008 Apr 20.
6
Principal components analysis corrects for stratification in genome-wide association studies.主成分分析可校正全基因组关联研究中的分层现象。
Nat Genet. 2006 Aug;38(8):904-9. doi: 10.1038/ng1847. Epub 2006 Jul 23.
7
Speciation by distance in a ring species.环物种中的距离成种
Science. 2005 Jan 21;307(5708):414-6. doi: 10.1126/science.1105201.
8
The effects of human population structure on large genetic association studies.人类种群结构对大型基因关联研究的影响。
Nat Genet. 2004 May;36(5):512-7. doi: 10.1038/ng1337. Epub 2004 Mar 28.
9
Assessing the impact of population stratification on genetic association studies.评估群体分层对基因关联研究的影响。
Nat Genet. 2004 Apr;36(4):388-93. doi: 10.1038/ng1333. Epub 2004 Mar 28.
10
Qualitative semi-parametric test for genetic associations in case-control designs under structured populations.结构化人群中病例对照设计下基因关联的定性半参数检验。
Ann Hum Genet. 2003 May;67(Pt 3):250-64. doi: 10.1046/j.1469-1809.2003.00036.x.