• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Computation of ancestry scores with mixed families and unrelated individuals.使用混合家庭和无关个体计算血统分数。
Biometrics. 2018 Mar;74(1):155-164. doi: 10.1111/biom.12708. Epub 2017 Apr 27.
2
Eigenvalue significance testing for genetic association.基因关联的特征值显著性检验
Biometrics. 2018 Jun;74(2):439-447. doi: 10.1111/biom.12767. Epub 2017 Aug 29.
3
A fast indirect method to compute functions of genomic relationships concerning genotyped and ungenotyped individuals, for diversity management.一种快速的间接方法,用于计算与已基因型和未基因型个体有关的基因组关系的函数,用于多样性管理。
Genet Sel Evol. 2017 Dec 1;49(1):87. doi: 10.1186/s12711-017-0363-9.
4
sim1000G: a user-friendly genetic variant simulator in R for unrelated individuals and family-based designs.sim1000G:一个用于无关个体和基于家系设计的 R 语言中易于使用的遗传变异模拟器。
BMC Bioinformatics. 2019 Jan 15;20(1):26. doi: 10.1186/s12859-019-2611-1.
5
Fast and robust ancestry prediction using principal component analysis.利用主成分分析进行快速稳健的祖源预测。
Bioinformatics. 2020 Jun 1;36(11):3439-3446. doi: 10.1093/bioinformatics/btaa152.
6
Evaluation of methods accounting for population structure with pedigree data and continuous outcomes.基于家系数据和连续结局评估考虑群体结构的方法。
Genet Epidemiol. 2011 Sep;35(6):427-36. doi: 10.1002/gepi.20590. Epub 2011 May 26.
7
Large-scale genomic prediction using singular value decomposition of the genotype matrix.基于基因型矩阵奇异值分解的大规模基因组预测。
Genet Sel Evol. 2018 Feb 28;50(1):6. doi: 10.1186/s12711-018-0373-2.
8
Robust inference of population structure for ancestry prediction and correction of stratification in the presence of relatedness.在存在亲缘关系的情况下,对群体结构进行稳健推断,以进行血统预测和分层校正。
Genet Epidemiol. 2015 May;39(4):276-93. doi: 10.1002/gepi.21896. Epub 2015 Mar 23.
9
Detecting familial aggregation.检测家族聚集性。
Methods Mol Biol. 2012;850:119-50. doi: 10.1007/978-1-61779-555-8_8.
10
Comparison of three methods to estimate genetic ancestry and control for stratification in genetic association studies among admixed populations.三种估算遗传血统及控制混合人群基因关联研究中分层因素方法的比较。
Hum Genet. 2005 Dec;118(3-4):424-33. doi: 10.1007/s00439-005-0067-z. Epub 2005 Oct 6.

引用本文的文献

1
HAP-SAMPLE2: data-based resampling for association studies with admixture.HAP-SAMPLE2:用于混合样本关联研究的基于数据的重采样
Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf333.
2
Variants of IL6, IL10, FCN2, RNASE3, IL12B and IL17B loci are associated with Schistosoma mansoni worm burden in the Albert Nile region of Uganda.IL6、IL10、FCN2、RNASE3、IL12B 和 IL17B 基因座的变异与乌干达阿尔伯特尼罗地区曼氏血吸虫虫荷有关。
PLoS Negl Trop Dis. 2023 Nov 30;17(11):e0011796. doi: 10.1371/journal.pntd.0011796. eCollection 2023 Nov.
3
Identifying suicidal subtypes and dynamic indicators of increasing and decreasing suicide risk in active duty military personnel: Study protocol.识别现役军人自杀亚型以及自杀风险增减的动态指标:研究方案。
Contemp Clin Trials Commun. 2021 Feb 16;21:100752. doi: 10.1016/j.conctc.2021.100752. eCollection 2021 Mar.
4
A Review and Tutorial of Machine Learning Methods for Microbiome Host Trait Prediction.微生物组宿主性状预测的机器学习方法综述与教程
Front Genet. 2019 Jun 25;10:579. doi: 10.3389/fgene.2019.00579. eCollection 2019.
5
A survey of high dimension low sample size asymptotics.高维小样本渐近性研究
Aust N Z J Stat. 2018 Mar;60(1):4-19. doi: 10.1111/anzs.12212. Epub 2018 Mar 14.
6
Eigenvalue significance testing for genetic association.基因关联的特征值显著性检验
Biometrics. 2018 Jun;74(2):439-447. doi: 10.1111/biom.12767. Epub 2017 Aug 29.

本文引用的文献

1
The Scree Test For The Number Of Factors.因子数量的碎石检验
Multivariate Behav Res. 1966 Apr 1;1(2):245-76. doi: 10.1207/s15327906mbr0102_10.
2
A global reference for human genetic variation.人类遗传变异的全球参考。
Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.
3
Genome-wide association meta-analysis identifies five modifier loci of lung disease severity in cystic fibrosis.全基因组关联荟萃分析确定了囊性纤维化肺病严重程度的五个修饰位点。
Nat Commun. 2015 Sep 29;6:8382. doi: 10.1038/ncomms9382.
4
Testing for genetic associations in arbitrarily structured populations.在任意结构群体中进行基因关联检测。
Nat Genet. 2015 May;47(5):550-4. doi: 10.1038/ng.3244. Epub 2015 Mar 30.
5
Robust inference of population structure for ancestry prediction and correction of stratification in the presence of relatedness.在存在亲缘关系的情况下,对群体结构进行稳健推断,以进行血统预测和分层校正。
Genet Epidemiol. 2015 May;39(4):276-93. doi: 10.1002/gepi.21896. Epub 2015 Mar 23.
6
A genome-wide association study identifies new susceptibility loci for esophageal adenocarcinoma and Barrett's esophagus.全基因组关联研究鉴定出食管腺癌和 Barrett 食管的新易感性位点。
Nat Genet. 2013 Dec;45(12):1487-93. doi: 10.1038/ng.2796. Epub 2013 Oct 13.
7
Genome-wide association and linkage identify modifier loci of lung disease severity in cystic fibrosis at 11p13 and 20q13.2.全基因组关联和连锁分析确定了囊性纤维化肺疾病严重程度的修饰基因座,位于 11p13 和 20q13.2。
Nat Genet. 2011 Jun;43(6):539-46. doi: 10.1038/ng.838. Epub 2011 May 22.
8
CONVERGENCE AND PREDICTION OF PRINCIPAL COMPONENT SCORES IN HIGH-DIMENSIONAL SETTINGS.高维环境下主成分得分的收敛性与预测
Ann Stat. 2010 Jan 1;38(6):3605-3629. doi: 10.1214/10-AOS821.
9
Robust relationship inference in genome-wide association studies.全基因组关联研究中的稳健关系推断。
Bioinformatics. 2010 Nov 15;26(22):2867-73. doi: 10.1093/bioinformatics/btq559. Epub 2010 Oct 5.
10
SWISS MADE: Standardized WithIn Class Sum of Squares to evaluate methodologies and dataset elements.SWISS MADE:在类内平方和标准化,用于评估方法和数据集元素。
PLoS One. 2010 Mar 26;5(3):e9905. doi: 10.1371/journal.pone.0009905.

使用混合家庭和无关个体计算血统分数。

Computation of ancestry scores with mixed families and unrelated individuals.

作者信息

Zhou Yi-Hui, Marron James S, Wright Fred A

机构信息

Department of Biological Sciences, Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, U.S.A.

Department of Statistics and Operations Research, University of North Carolina, Chapel Hill, U.S.A.

出版信息

Biometrics. 2018 Mar;74(1):155-164. doi: 10.1111/biom.12708. Epub 2017 Apr 27.

DOI:10.1111/biom.12708
PMID:28452052
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6069602/
Abstract

The issue of robustness to family relationships in computing genotype ancestry scores such as eigenvector projections has received increased attention in genetic association, and is particularly challenging when sets of both unrelated individuals and closely related family members are included. The current standard is to compute loadings (left singular vectors) using unrelated individuals and to compute projected scores for remaining family members. However, projected ancestry scores from this approach suffer from shrinkage toward zero. We consider two main novel strategies: (i) matrix substitution based on decomposition of a target family-orthogonalized covariance matrix, and (ii) using family-averaged data to obtain loadings. We illustrate the performance via simulations, including resampling from 1000 Genomes Project data, and analysis of a cystic fibrosis dataset. The matrix substitution approach has similar performance to the current standard, but is simple and uses only a genotype covariance matrix, while the family-average method shows superior performance. Our approaches are accompanied by novel ancillary approaches that provide considerable insight, including individual-specific eigenvalue scree plots.

摘要

在计算基因型祖先分数(如特征向量投影)时,对家族关系的稳健性问题在基因关联研究中受到了越来越多的关注,当纳入无关个体和密切相关的家庭成员时,这一问题尤其具有挑战性。当前的标准做法是使用无关个体计算载荷(左奇异向量),并为其余家庭成员计算投影分数。然而,这种方法得到的投影祖先分数会出现向零收缩的情况。我们考虑了两种主要的新策略:(i)基于目标家族正交化协方差矩阵分解的矩阵替换,以及(ii)使用家族平均数据来获得载荷。我们通过模拟来说明性能,包括从千人基因组计划数据中重采样,以及对囊性纤维化数据集的分析。矩阵替换方法的性能与当前标准相似,但简单且仅使用基因型协方差矩阵,而家族平均方法表现出更优的性能。我们的方法还伴随着一些新颖的辅助方法,这些方法提供了相当多的见解,包括个体特异性特征值碎石图。