Xu Chengfeng, Zhang Ruochi, Duan Meiyu, Zhou Yongming, Bao Jizhang, Lu Hao, Wang Jie, Hu Minghui, Hu Zhaoyang, Zhou Fengfeng, Zhu Wenwei
Department of Hematology, Yueyang Hospital of Integrated Traditional Chinese and Western Medicine, Shanghai University of Traditional Chinese Medicine, 110 Ganhe Road, Hongkou District, Shanghai 200437, China.
College of Computer Science and Technology, Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, Jilin 130012, China.
Mol Ther Nucleic Acids. 2022 Apr 6;28:477-487. doi: 10.1016/j.omtn.2022.04.004. eCollection 2022 Jun 14.
Immune thrombocytopenia (ITP) is an autoimmune disease with the typical symptom of a low platelet count in blood. ITP demonstrated age and sex biases in both occurrences and prognosis, and adult ITP was mainly induced by the living environments. The current diagnosis guideline lacks the integration of molecular heterogenicity. This study recruited the largest cohort of platelet transcriptome samples. A comprehensive procedure of feature selection, feature engineering, and stacking classification was carried out to detect the ITP biomarkers using RNA sequencing (RNA-seq) transcriptomes. The 40 detected biomarkers were loaded to train the final ITP detection model, with an overall accuracy 0.974. The biomarkers suggested that ITP onset may be associated with various transcribed components, including protein-coding genes, long intergenic non-coding RNA (lincRNA) genes, and pseudogenes with apparent transcriptions. The delivered ITP detection model may also be utilized as a complementary ITP diagnosis tool. The code and the example dataset is freely available on http://www.healthinformaticslab.org/supp/resources.php.
免疫性血小板减少症(ITP)是一种自身免疫性疾病,典型症状是血液中血小板计数低。ITP在发病率和预后方面均表现出年龄和性别差异,成人ITP主要由生活环境诱发。当前的诊断指南缺乏对分子异质性的整合。本研究收集了最大规模的血小板转录组样本队列。通过RNA测序(RNA-seq)转录组,开展了特征选择、特征工程和堆叠分类的综合流程来检测ITP生物标志物。将检测到的40种生物标志物用于训练最终的ITP检测模型,总体准确率为0.974。这些生物标志物表明,ITP的发病可能与多种转录成分有关,包括蛋白质编码基因、长链基因间非编码RNA(lincRNA)基因和具有明显转录的假基因。所提供的ITP检测模型也可作为ITP诊断的补充工具。代码和示例数据集可在http://www.healthinformaticslab.org/supp/resources.php上免费获取。