Archer Kellie J, Hou Jiayi, Zhou Qing, Ferber Kyle, Layne John G, Gentry Amanda E
Department of Biostatistics, Virginia Commonwealth University, Richmond, VA, USA.
Clinical and Translational Research Institute, University of California San Diego, San Diego, CA.
Cancer Inform. 2014 Dec 10;13:187-95. doi: 10.4137/CIN.S20806. eCollection 2014.
High-throughput genomic assays are performed using tissue samples with the goal of classifying the samples as normal < pre-malignant < malignant or by stage of cancer using a small set of molecular features. In such cases, molecular features monotonically associated with the ordinal response may be important to disease development; that is, an increase in the phenotypic level (stage of cancer) may be mechanistically linked through a monotonic association with gene expression or methylation levels. Though traditional ordinal response modeling methods exist, they assume independence among the predictor variables and require the number of samples (n) to exceed the number of covariates (P) included in the model. In this paper, we describe our ordinalgmifs R package, available from the Comprehensive R Archive Network, which can fit a variety of ordinal response models when the number of predictors (P) exceeds the sample size (n). R code illustrating usage is also provided.
高通量基因组分析使用组织样本进行,目的是利用一小部分分子特征将样本分类为正常<癌前<恶性,或根据癌症阶段进行分类。在这种情况下,与有序反应单调相关的分子特征可能对疾病发展很重要;也就是说,表型水平(癌症阶段)的增加可能通过与基因表达或甲基化水平的单调关联在机制上联系起来。尽管存在传统的有序反应建模方法,但它们假定预测变量之间相互独立,并且要求样本数量(n)超过模型中包含的协变量数量(P)。在本文中,我们描述了我们的ordinalgmifs R包,可从综合R存档网络获得,当预测变量数量(P)超过样本量(n)时,该包可以拟合各种有序反应模型。还提供了说明用法的R代码。