Division of Clinical Research, The First Hospital of Jilin University, 1 Xinmin Street, Changchun, Jilin, 130021, People's Republic of China.
Department of Internal Medicine, College of Medicine, University of Kentucky, 800 Rose St, Lexington, KY, 40536, USA.
Hum Genomics. 2021 Apr 20;15(1):23. doi: 10.1186/s40246-021-00323-6.
Psoriasis is an immune-mediated, inflammatory disorder of the skin with chronic inflammation and hyper-proliferation of the epidermis. Since psoriasis has genetic components and the diseased tissue of psoriasis is very easily accessible, it is natural to use high-throughput technologies to characterize psoriasis and thus seek targeted therapies. Transcriptional profiles change correspondingly after an intervention. Unlike cross-sectional gene expression data, longitudinal gene expression data can capture the dynamic changes and thus facilitate causal inference.
Using the iCluster method as a building block, an ensemble method was proposed and applied to a longitudinal gene expression dataset for psoriasis, with the objective of identifying key lncRNAs that can discriminate the responders from the non-responders to two immune treatments of psoriasis.
Using support vector machine models, the leave-one-out predictive accuracy of the 20-lncRNA signature identified by this ensemble was estimated as 80%, which outperforms several competing methods. Furthermore, pathway enrichment analysis was performed on the target mRNAs of the identified lncRNAs. Of the enriched GO terms or KEGG pathways, proteasome, and protein deubiquitination is included. The ubiquitination-proteasome system is regarded as a key player in psoriasis, and a proteasome inhibitor to target ubiquitination pathway holds promises for treating psoriasis.
An integrative method such as iCluster for multiple data integration can be adopted directly to analyze longitudinal gene expression data, which offers more promising options for longitudinal big data analysis. A comprehensive evaluation and validation of the resulting 20-lncRNA signature is highly desirable.
银屑病是一种免疫介导的、炎症性皮肤疾病,具有慢性炎症和表皮过度增殖的特点。由于银屑病具有遗传成分,且银屑病的病变组织易于获取,因此很自然地可以利用高通量技术对银屑病进行特征描述,从而寻求靶向治疗。干预后转录谱会相应改变。与横断面基因表达数据不同,纵向基因表达数据可以捕捉到动态变化,从而有助于因果推断。
本研究使用 iCluster 方法作为构建块,提出了一种集成方法,并将其应用于银屑病的纵向基因表达数据集,目的是识别能够区分两种银屑病免疫治疗应答者和非应答者的关键 lncRNAs。
使用支持向量机模型,通过该集成方法识别的 20 个 lncRNA 特征的留一法预测准确性估计为 80%,优于几种竞争方法。此外,对鉴定出的 lncRNAs 的靶 mRNAs 进行了通路富集分析。在富集的 GO 术语或 KEGG 通路中,包括蛋白酶体和蛋白质去泛素化。泛素蛋白酶体系统被认为是银屑病的关键参与者,靶向泛素化途径的蛋白酶体抑制剂有望用于治疗银屑病。
可以直接采用 iCluster 等集成方法对纵向基因表达数据进行分析,为纵向大数据分析提供了更有前景的选择。对得到的 20 个 lncRNA 特征进行全面评估和验证是非常必要的。