• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

下一代统计遗传学:高维数据中的建模、惩罚与优化

Next Generation Statistical Genetics: Modeling, Penalization, and Optimization in High-Dimensional Data.

作者信息

Lange Kenneth, Papp Jeanette C, Sinsheimer Janet S, Sobel Eric M

机构信息

Depts of Biomathematics, Human Genetics, and Statistics, UCLA.

Dept of Human Genetics, UCLA.

出版信息

Annu Rev Stat Appl. 2014 Jan 1;1(1):279-300. doi: 10.1146/annurev-statistics-022513-115638.

DOI:10.1146/annurev-statistics-022513-115638
PMID:24955378
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4062304/
Abstract

Statistical genetics is undergoing the same transition to big data that all branches of applied statistics are experiencing. With the advent of inexpensive DNA sequencing, the transition is only accelerating. This brief review highlights some modern techniques with recent successes in statistical genetics. These include: (a) lasso penalized regression and association mapping, (b) ethnic admixture estimation, (c) matrix completion for genotype and sequence data, (d) the fused lasso and copy number variation, (e) haplotyping, (f) estimation of relatedness, (g) variance components models, and (h) rare variant testing. For more than a century, genetics has been both a driver and beneficiary of statistical theory and practice. This symbiotic relationship will persist for the foreseeable future.

摘要

统计遗传学正在经历与应用统计学所有分支相同的向大数据的转变。随着廉价DNA测序技术的出现,这种转变正在加速。本简要综述重点介绍了一些在统计遗传学中取得近期成功的现代技术。这些技术包括:(a)套索惩罚回归与关联图谱分析,(b)族群混合估计,(c)基因型和序列数据的矩阵补全,(d)融合套索与拷贝数变异,(e)单倍型分型,(f)亲缘关系估计,(g)方差成分模型,以及(h)罕见变异检测。一个多世纪以来,遗传学一直是统计理论与实践的推动者和受益者。这种共生关系在可预见的未来将持续存在。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/140e/4062304/258ed30c30f7/nihms576361f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/140e/4062304/258ed30c30f7/nihms576361f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/140e/4062304/258ed30c30f7/nihms576361f1.jpg

相似文献

1
Next Generation Statistical Genetics: Modeling, Penalization, and Optimization in High-Dimensional Data.下一代统计遗传学:高维数据中的建模、惩罚与优化
Annu Rev Stat Appl. 2014 Jan 1;1(1):279-300. doi: 10.1146/annurev-statistics-022513-115638.
2
Ridge Penalization in High-Dimensional Testing With Applications to Imaging Genetics.高维检验中的岭惩罚及其在影像遗传学中的应用
Front Neurosci. 2022 Mar 24;16:836100. doi: 10.3389/fnins.2022.836100. eCollection 2022.
3
RECONSTRUCTING DNA COPY NUMBER BY PENALIZED ESTIMATION AND IMPUTATION.通过惩罚估计和插补重建DNA拷贝数
Ann Appl Stat. 2010 Dec 1;4(4):1749-1773. doi: 10.1214/10-AOAS357.
4
A Brief Survey of Modern Optimization for Statisticians.统计学家的现代优化方法简要概述。
Int Stat Rev. 2014 Apr 1;82(1):46-70. doi: 10.1111/insr.12022.
5
A penalized regression approach for DNA copy number study using the sequencing data.一种使用测序数据进行DNA拷贝数研究的惩罚回归方法。
Stat Appl Genet Mol Biol. 2019 May 30;18(4):sagmb-2018-0001. doi: 10.1515/sagmb-2018-0001.
6
Multi-centre evaluation of a comprehensive preimplantation genetic test through haplotyping-by-sequencing.通过测序单倍型分析对综合植入前遗传学检测的多中心评估。
Hum Reprod. 2019 Aug 1;34(8):1608-1619. doi: 10.1093/humrep/dez106.
7
Detection of common copy number variation with application to population clustering from next generation sequencing data.利用下一代测序数据检测常见拷贝数变异并应用于群体聚类
Annu Int Conf IEEE Eng Med Biol Soc. 2012;2012:1246-9. doi: 10.1109/EMBC.2012.6346163.
8
Approaches to Regularized Regression - A Comparison between Gradient Boosting and the Lasso.正则化回归方法——梯度提升与套索法的比较
Methods Inf Med. 2016 Oct 17;55(5):422-430. doi: 10.3414/ME16-01-0033. Epub 2016 Sep 14.
9
High-dimensional Cox models: the choice of penalty as part of the model building process.高维Cox模型:作为模型构建过程一部分的惩罚项选择
Biom J. 2010 Feb;52(1):50-69. doi: 10.1002/bimj.200900064.
10
Testing genetic association with rare and common variants in family data.在家族数据中检测与罕见和常见变异的基因关联性。
Genet Epidemiol. 2014 Sep;38 Suppl 1(0 1):S37-43. doi: 10.1002/gepi.21823.

引用本文的文献

1
TARGETING UNDERREPRESENTED POPULATIONS IN PRECISION MEDICINE: A FEDERATED TRANSFER LEARNING APPROACH.精准医学中针对代表性不足人群:一种联邦迁移学习方法。
Ann Appl Stat. 2023 Dec;17(4):2970-2992. doi: 10.1214/23-AOAS1747. Epub 2023 Oct 30.
2
Inflammatory Biomarkers in Newly Diagnosed Patients With Parkinson Disease and Related Neurodegenerative Disorders.炎症生物标志物在新发帕金森病及相关神经退行性疾病患者中的研究进展。
Neurol Neuroimmunol Neuroinflamm. 2023 May 31;10(4). doi: 10.1212/NXI.0000000000200132. Print 2023 Jul.
3
Sparse Regression in Cancer Genomics: Comparing Variable Selection and Predictions in Real World Data.

本文引用的文献

1
: Coordinate Descent With Nonconvex Penalties.带非凸惩罚项的坐标下降法
J Am Stat Assoc. 2011;106(495):1125-1138. doi: 10.1198/jasa.2011.tm09738.
2
PUMA: a unified framework for penalized multiple regression analysis of GWAS data.PUMA:用于 GWAS 数据分析的惩罚性多重回归分析的统一框架。
PLoS Comput Biol. 2013;9(6):e1003101. doi: 10.1371/journal.pcbi.1003101. Epub 2013 Jun 27.
3
Mendel: the Swiss army knife of genetic analysis programs.门德尔:遗传分析程序的瑞士军刀。
癌症基因组学中的稀疏回归:在真实世界数据中比较变量选择和预测
Cancer Inform. 2021 Nov 27;20:11769351211056298. doi: 10.1177/11769351211056298. eCollection 2021.
4
Pan-Genomic and Polymorphic Driven Prediction of Antibiotic Resistance in .泛基因组和多态性驱动的抗生素耐药性预测
Front Microbiol. 2019 Jul 4;10:1446. doi: 10.3389/fmicb.2019.01446. eCollection 2019.
5
Genetic Architecture of Primary Open-Angle Glaucoma in Individuals of African Descent: The African Descent and Glaucoma Evaluation Study III.非裔原发性开角型青光眼的遗传结构:非裔与青光眼评估研究 III。
Ophthalmology. 2019 Jan;126(1):38-48. doi: 10.1016/j.ophtha.2018.10.031. Epub 2018 Oct 21.
6
Dopaminergic and Opioid Pathways Associated with Impulse Control Disorders in Parkinson's Disease.与帕金森病冲动控制障碍相关的多巴胺能和阿片类通路
Front Neurol. 2018 Feb 28;9:109. doi: 10.3389/fneur.2018.00109. eCollection 2018.
7
Iterative hard thresholding for model selection in genome-wide association studies.全基因组关联研究中用于模型选择的迭代硬阈值法
Genet Epidemiol. 2017 Dec;41(8):756-768. doi: 10.1002/gepi.22068. Epub 2017 Sep 6.
8
Genome-wide QTL and eQTL analyses using Mendel.使用孟德尔进行全基因组QTL和eQTL分析。
BMC Proc. 2016 Oct 18;10(Suppl 7):239-244. doi: 10.1186/s12919-016-0037-6. eCollection 2016.
9
Fast Genome-Wide QTL Association Mapping on Pedigree and Population Data.基于系谱和群体数据的快速全基因组QTL关联图谱分析
Genet Epidemiol. 2017 Apr;41(3):174-186. doi: 10.1002/gepi.21988. Epub 2016 Dec 12.
10
Transforming big data into computational models for personalized medicine and health care.将大数据转化为用于个性化医疗和保健的计算模型。
Dialogues Clin Neurosci. 2016 Sep;18(3):339-343. doi: 10.31887/DCNS.2016.18.3/ssoroushmehr.
Bioinformatics. 2013 Jun 15;29(12):1568-70. doi: 10.1093/bioinformatics/btt187. Epub 2013 Apr 22.
4
MetaSeq: privacy preserving meta-analysis of sequencing-based association studies.MetaSeq:基于测序的关联研究的隐私保护元分析
Pac Symp Biocomput. 2013:356-67.
5
A kernel of truth: statistical advances in polygenic variance component models for complex human pedigrees.一个真理的核心:复杂人类家系多基因方差分量模型的统计进展。
Adv Genet. 2013;81:1-31. doi: 10.1016/B978-0-12-407677-8.00001-4.
6
Extent of linkage disequilibrium in the domestic cat, Felis silvestris catus, and its breeds.家猫(Felis silvestris catus)及其品种的连锁不平衡程度。
PLoS One. 2013;8(1):e53537. doi: 10.1371/journal.pone.0053537. Epub 2013 Jan 7.
7
Population genomic and genome-wide association studies of agroclimatic traits in sorghum.高粱农艺性状的群体基因组学和全基因组关联研究。
Proc Natl Acad Sci U S A. 2013 Jan 8;110(2):453-8. doi: 10.1073/pnas.1215985110. Epub 2012 Dec 24.
8
Genotype imputation via matrix completion.基于矩阵补全的基因型推断。
Genome Res. 2013 Mar;23(3):509-18. doi: 10.1101/gr.145821.112. Epub 2012 Dec 10.
9
Robust and powerful tests for rare variants using Fisher's method to combine evidence of association from two or more complementary tests.利用 Fisher 法对罕见变异进行稳健而强大的检验,以整合来自两个或多个互补检验的关联证据。
Genet Epidemiol. 2013 Jan;37(1):110-21. doi: 10.1002/gepi.21689. Epub 2012 Oct 2.
10
Mendel-GPU: haplotyping and genotype imputation on graphics processing units.孟德尔 GPU:图形处理单元上的单倍型分型和基因型推断。
Bioinformatics. 2012 Nov 15;28(22):2979-80. doi: 10.1093/bioinformatics/bts536. Epub 2012 Sep 5.