Groth Theodore, Gunawan Rudiyanto, Neelamegham Sriram
Chemical and Biological Engineering, University at Buffalo, State University of New York, Buffalo, NY 14260, USA.
Biomedical Engineering, University at Buffalo, State University of New York, Buffalo, NY 14260, USA.
Beilstein J Org Chem. 2021 Jul 22;17:1712-1724. doi: 10.3762/bjoc.17.119. eCollection 2021.
Glycosylation is a common posttranslational modification, and glycan biosynthesis is regulated by a set of glycogenes. The role of transcription factors (TFs) in regulating the glycogenes and related glycosylation pathways is largely unknown. In this work, we performed data mining of TF-glycogene relationships from the Cistrome Cancer database (DB), which integrates chromatin immunoprecipitation sequencing (ChIP-Seq) and RNA-Seq data to constitute regulatory relationships. In total, we observed 22,654 potentially significant TF-glycogene relationships, which include interactions involving 526 unique TFs and 341 glycogenes that span 29 the Cancer Genome Atlas (TCGA) cancer types. Here, TF-glycogene interactions appeared in clusters or so-called communities, suggesting that changes in single TF expression during both health and disease may affect multiple carbohydrate structures. Upon applying the Fisher's exact test along with glycogene pathway classification, we identified TFs that may specifically regulate the biosynthesis of individual glycan types. Integration with Reactome DB knowledge provided an avenue to relate cell-signaling pathways to TFs and cellular glycosylation state. Whereas analysis results are presented for all 29 cancer types, specific focus is placed on human luminal and basal breast cancer disease progression. Overall, the article presents a computational approach to describe TF-glycogene relationships, the starting point for experimental system-wide validation.
糖基化是一种常见的翻译后修饰,聚糖生物合成受一组糖基因调控。转录因子(TFs)在调控糖基因和相关糖基化途径中的作用在很大程度上尚不清楚。在这项工作中,我们从Cistrome癌症数据库(DB)中对TF-糖基因关系进行了数据挖掘,该数据库整合了染色质免疫沉淀测序(ChIP-Seq)和RNA-Seq数据以构建调控关系。我们总共观察到22,654个潜在的显著TF-糖基因关系,其中包括涉及526个独特TF和341个糖基因的相互作用,这些糖基因涵盖了29种癌症基因组图谱(TCGA)癌症类型。在这里,TF-糖基因相互作用以簇或所谓的群落形式出现,这表明在健康和疾病状态下单个TF表达的变化可能会影响多种碳水化合物结构。在应用Fisher精确检验并结合糖基因途径分类后,我们确定了可能特异性调节单个聚糖类型生物合成的TF。与Reactome数据库知识的整合为将细胞信号通路与TF和细胞糖基化状态联系起来提供了一条途径。虽然分析结果针对所有29种癌症类型呈现,但特别关注人类管腔型和基底型乳腺癌的疾病进展。总体而言,本文提出了一种计算方法来描述TF-糖基因关系,这是全系统实验验证的起点。