lme4qtl：用于相关个体遗传研究的具有灵活协方差结构的线性混合效应模型。

lme4qtl: linear mixed models with flexible covariance structure for genetic studies of related individuals.

机构信息

Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, United States of America.

Unitat de Genòmica de Malalties Complexes, Institut d'Investigació Biomèdica Sant Pau (IIB-Sant Pau), Barcelona, Spain.

出版信息

BMC Bioinformatics. 2018 Feb 27;19(1):68. doi: 10.1186/s12859-018-2057-x.

DOI:10.1186/s12859-018-2057-x

PMID:29486711

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5830078/

Abstract

BACKGROUND

Quantitative trait locus (QTL) mapping in genetic data often involves analysis of correlated observations, which need to be accounted for to avoid false association signals. This is commonly performed by modeling such correlations as random effects in linear mixed models (LMMs). The R package lme4 is a well-established tool that implements major LMM features using sparse matrix methods; however, it is not fully adapted for QTL mapping association and linkage studies. In particular, two LMM features are lacking in the base version of lme4: the definition of random effects by custom covariance matrices; and parameter constraints, which are essential in advanced QTL models. Apart from applications in linkage studies of related individuals, such functionalities are of high interest for association studies in situations where multiple covariance matrices need to be modeled, a scenario not covered by many genome-wide association study (GWAS) software.

RESULTS

To address the aforementioned limitations, we developed a new R package lme4qtl as an extension of lme4. First, lme4qtl contributes new models for genetic studies within a single tool integrated with lme4 and its companion packages. Second, lme4qtl offers a flexible framework for scenarios with multiple levels of relatedness and becomes efficient when covariance matrices are sparse. We showed the value of our package using real family-based data in the Genetic Analysis of Idiopathic Thrombophilia 2 (GAIT2) project.

CONCLUSIONS

Our software lme4qtl enables QTL mapping models with a versatile structure of random effects and efficient computation for sparse covariances. lme4qtl is available at https://github.com/variani/lme4qtl .

摘要

背景

遗传数据中的数量性状基因座 (QTL) 映射通常涉及相关观测值的分析，需要对其进行分析以避免虚假关联信号。这通常通过在线性混合模型 (LMM) 中将此类相关性建模为随机效应来完成。R 包 lme4 是一个成熟的工具，它使用稀疏矩阵方法实现了主要的 LMM 特征；然而，它不完全适应 QTL 映射关联和连锁研究。特别是，lme4 的基础版本缺少两个 LMM 特征：自定义协方差矩阵定义的随机效应；以及参数约束，这在高级 QTL 模型中是必不可少的。除了在相关个体的连锁研究中的应用外，在需要对多个协方差矩阵进行建模的情况下，这些功能对于关联研究非常重要，而许多全基因组关联研究 (GWAS) 软件并未涵盖这种情况。

结果

为了解决上述限制，我们开发了一个新的 R 包 lme4qtl，作为 lme4 的扩展。首先，lme4qtl 为遗传研究提供了新的模型，这些模型集成在 lme4 及其配套包中。其次，lme4qtl 为具有多个相关性水平的情况提供了灵活的框架，并且在协方差矩阵稀疏时效率很高。我们使用 Genetic Analysis of Idiopathic Thrombophilia 2 (GAIT2) 项目中的真实基于家庭的数据展示了我们软件包的价值。

结论

我们的软件 lme4qtl 使 QTL 映射模型具有灵活的随机效应结构和稀疏协方差的高效计算。lme4qtl 可在 https://github.com/variani/lme4qtl 获得。

相似文献

lme4qtl: linear mixed models with flexible covariance structure for genetic studies of related individuals.

BMC Bioinformatics. 2018 Feb 27;19(1):68. doi: 10.1186/s12859-018-2057-x.

Efficient penalized generalized linear mixed models for variable selection and genetic risk prediction in high-dimensional data.

Bioinformatics. 2023 Feb 3;39(2). doi: 10.1093/bioinformatics/btad063.

Integrating molecular QTL data into genome-wide genetic association analysis: Probabilistic assessment of enrichment and colocalization.

PLoS Genet. 2017 Mar 9;13(3):e1006646. doi: 10.1371/journal.pgen.1006646. eCollection 2017 Mar.

ADDO: a comprehensive toolkit to detect, classify and visualize additive and non-additive quantitative trait loci.

Bioinformatics. 2020 Mar 1;36(5):1517-1521. doi: 10.1093/bioinformatics/btz786.

Single Marker Family-Based Association Analysis Not Conditional on Parental Information.

Methods Mol Biol. 2017;1666:409-439. doi: 10.1007/978-1-4939-7274-6_20.

Exploring efficient linear mixed models to detect quantitative trait locus-by-environment interactions.

G3 (Bethesda). 2021 Aug 7;11(8). doi: 10.1093/g3journal/jkab119.

Fast Genome-Wide QTL Association Mapping on Pedigree and Population Data.

Genet Epidemiol. 2017 Apr;41(3):174-186. doi: 10.1002/gepi.21988. Epub 2016 Dec 12.

lme4GS: An R-Package for Genomic Selection.

Front Genet. 2021 Jun 18;12:680569. doi: 10.3389/fgene.2021.680569. eCollection 2021.

Hybrid of Restricted and Penalized Maximum Likelihood Method for Efficient Genome-Wide Association Study.

Genes (Basel). 2020 Oct 29;11(11):1286. doi: 10.3390/genes11111286.

Software Application Profile: RVPedigree: a suite of family-based rare variant association tests for normally and non-normally distributed quantitative traits.

Int J Epidemiol. 2016 Apr;45(2):402-7. doi: 10.1093/ije/dyw047. Epub 2016 Apr 16.

引用本文的文献

From aerial drone to quantitative trait locus: leveraging next-generation phenotyping to reveal the genetics of color and height in field-grown Lactuca sativa.

Plant J. 2025 Aug;123(3):e70405. doi: 10.1111/tpj.70405.

Genome-wide association study for plant height and ear height in maize under well-watered and water-stressed conditions.

BMC Genomics. 2025 Aug 12;26(1):745. doi: 10.1186/s12864-025-11932-z.

Genome-wide analyses reveal intricate genetic mechanisms underlying egg production efficiency in chickens.

J Anim Sci Biotechnol. 2025 Aug 11;16(1):114. doi: 10.1186/s40104-025-01245-2.

Non-Native Woody Plant Species Show Different Leaf Functional Traits and Herbivory Levels From Native Ones in the Urban Areas of Beijing, China.

Ecol Evol. 2025 Aug 8;15(8):e71947. doi: 10.1002/ece3.71947. eCollection 2025 Aug.

Large effect life-history genomic regions are associated with functional morphological traits in Atlantic salmon.

G3 (Bethesda). 2025 Jul 9;15(7). doi: 10.1093/g3journal/jkaf106.

Reveal genomic insights into cotton domestication and improvement using gene level functional haplotype-based GWAS.

Nat Commun. 2025 May 21;16(1):4734. doi: 10.1038/s41467-025-59983-w.

Trans-eQTL hotspots shape complex traits by modulating cellular states.

Cell Genom. 2025 May 14;5(5):100873. doi: 10.1016/j.xgen.2025.100873. Epub 2025 May 5.

Gut metagenomes reveal interactions between dietary restriction, ageing and the microbiome in genetically diverse mice.

Nat Microbiol. 2025 May;10(5):1240-1257. doi: 10.1038/s41564-025-01963-3. Epub 2025 Mar 31.

Enhancer RNA Transcriptome-Wide Association Study Reveals a Distinctive Class of Pan-Cancer Susceptibility eRNAs.

Adv Sci (Weinh). 2025 Apr;12(13):e2411974. doi: 10.1002/advs.202411974. Epub 2025 Feb 14.

Haplotype analysis incorporating ancestral origins identified novel genetic loci associated with chicken body weight using an advanced intercross line.

Genet Sel Evol. 2024 Dec 20;56(1):78. doi: 10.1186/s12711-024-00946-y.

本文引用的文献

solarius: an R interface to SOLAR for variance component analysis in pedigrees.

Bioinformatics. 2016 Jun 15;32(12):1901-2. doi: 10.1093/bioinformatics/btw080. Epub 2016 Feb 15.

Genetic Determinants of Thrombin Generation and Their Relation to Venous Thrombosis: Results from the GAIT-2 Project.

PLoS One. 2016 Jan 19;11(1):e0146922. doi: 10.1371/journal.pone.0146922. eCollection 2016.

Contrasting genetic architectures of schizophrenia and other complex diseases using fast variance-components analysis.

Nat Genet. 2015 Dec;47(12):1385-92. doi: 10.1038/ng.3431. Epub 2015 Nov 2.

Advantages and pitfalls in the application of mixed-model association methods.

Nat Genet. 2014 Feb;46(2):100-6. doi: 10.1038/ng.2876.

A kernel of truth: statistical advances in polygenic variance component models for complex human pedigrees.

Adv Genet. 2013;81:1-31. doi: 10.1016/B978-0-12-407677-8.00001-4.

Genome-wide efficient mixed-model analysis for association studies.

Nat Genet. 2012 Jun 17;44(7):821-4. doi: 10.1038/ng.2310.

Genetic associations for activated partial thromboplastin time and prothrombin time, their gene expression profiles, and risk of coronary artery disease.

Am J Hum Genet. 2012 Jul 13;91(1):152-62. doi: 10.1016/j.ajhg.2012.05.009. Epub 2012 Jun 14.

FaST linear mixed models for genome-wide association studies.

Nat Methods. 2011 Sep 4;8(10):833-5. doi: 10.1038/nmeth.1681.

Statistical genetic approaches to human adaptability. 1993.

Hum Biol. 2009 Dec;81(5-6):523-46. doi: 10.3378/027.081.0603.

Technical note: an R package for fitting generalized linear mixed models in animal breeding.

J Anim Sci. 2010 Feb;88(2):497-504. doi: 10.2527/jas.2009-1952. Epub 2009 Oct 9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

lme4qtl：用于相关个体遗传研究的具有灵活协方差结构的线性混合效应模型。

lme4qtl: linear mixed models with flexible covariance structure for genetic studies of related individuals.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献