Stinchcombe John R, Kelly John K
Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, M5S3B2, Canada.
Koffler Scientific Reserve at Joker's Hill, University of Toronto, King, ON, L7B1K5, Canada.
New Phytol. 2025 Sep;247(5):1994-2002. doi: 10.1111/nph.70287. Epub 2025 Jun 5.
The level and pattern of gene expression is increasingly recognized as a principal determinant of plant phenotypes and thus of fitness. The estimation of natural selection on the transcriptome is an emerging research discipline. We here review recent progress and consider the challenges posed by the high dimensionality of the transcriptome for the multiple regression methods routinely used to characterize selection in field experiments. We consider several different methods, including classical multivariate statistical approaches, regularized regression, latent factor models, and machine learning, that address the fact that the number of traits potentially affecting fitness (each expressed gene) can greatly exceed the number of plants that researchers can reasonably monitor in a field study. While such studies are currently few, extant data are sufficient to illustrate several of these approaches. With additional methodological development coupled with applications to a broader range of species, we believe prospects are favorable for directly characterizing selection on gene expression within natural plant populations.
基因表达的水平和模式越来越被认为是植物表型乃至适合度的主要决定因素。对转录组自然选择的估计是一个新兴的研究领域。我们在此回顾近期的进展,并考虑转录组的高维度给常用于表征田间实验中选择的多元回归方法带来的挑战。我们考虑了几种不同的方法,包括经典多元统计方法、正则化回归、潜在因子模型和机器学习,这些方法应对了这样一个事实,即潜在影响适合度的性状数量(每个表达的基因)可能大大超过研究人员在田间研究中能够合理监测的植物数量。虽然目前此类研究较少,但现有数据足以说明其中的几种方法。随着方法的进一步发展以及应用于更广泛的物种,我们相信直接表征自然植物种群中基因表达的选择前景良好。