Kostich Mitchell S
USEPA/ORD/NERL/EMMD, 26 West M. L. King Drive, Cincinnati, OH 45268, USA.
Chemosphere. 2017 Dec;188:49-59. doi: 10.1016/j.chemosphere.2017.08.136. Epub 2017 Aug 28.
Use of 'omics technologies in environmental science is expanding. However, application is mostly restricted to characterizing molecular steps leading from toxicant interaction with molecular receptors to apical endpoints in laboratory species. Use in environmental decision-making is limited, due to difficulty in elucidating mechanisms in sufficient detail to make quantitative outcome predictions in any single species or in extending predictions to aquatic communities. Here we introduce a mechanism-agnostic statistical approach, supplementing mechanistic investigation by allowing probabilistic outcome prediction even when understanding of molecular pathways is limited, and facilitating extrapolation from results in laboratory test species to predictions about aquatic communities. We use concepts familiar to environmental managers, supplemented with techniques employed for clinical interpretation of 'omics-based biomedical tests. We describe the framework in step-wise fashion, beginning with single test replicates of a single RNA variant, then extending to multi-gene RNA profiling, collections of test replicates, and integration of complementary data. In order to simplify the presentation, we focus on using RNA profiling for distinguishing presence versus absence of chemical hazards, but the principles discussed can be extended to other types of 'omics measurements, multi-class problems, and regression. We include a supplemental file demonstrating many of the concepts using the open source R statistical package.
“组学”技术在环境科学中的应用正在不断扩展。然而,其应用大多局限于描述从毒物与分子受体相互作用到实验室物种顶端终点的分子步骤。由于难以详细阐明机制以对任何单一物种进行定量结果预测,或将预测扩展到水生群落,因此在环境决策中的应用有限。在此,我们引入一种与机制无关的统计方法,即使在对分子途径的理解有限时,也能通过允许概率性结果预测来补充机制研究,并促进从实验室测试物种的结果推断到对水生群落的预测。我们运用环境管理者熟悉的概念,并辅以用于基于组学的生物医学测试临床解释的技术。我们以逐步的方式描述该框架,首先从单个RNA变体的单测试重复开始,然后扩展到多基因RNA谱分析、测试重复的集合以及补充数据的整合。为了简化表述,我们专注于使用RNA谱分析来区分化学危害的存在与否,但所讨论的原理可扩展到其他类型的组学测量、多类问题和回归。我们包含一个补充文件,使用开源R统计软件包展示了许多概念。