Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6 Avenue du Swing, Esch-sur-Alzette, L-4367 Belvaux, Luxembourg.
Liver Disease Laboratory, Center for Cooperative Research in Biosciences (CIC bioGUNE), Basque Research and Technology Alliance (BRTA), Bizkaia Technology Park, Derio, Spain.
Nucleic Acids Res. 2023 Jan 6;51(D1):D877-D889. doi: 10.1093/nar/gkac862.
Prior knowledge of perturbation data can significantly assist in inferring the relationship between chemical perturbations and their specific transcriptional response. However, current databases mostly contain cancer cell lines, which are unsuitable for the aforementioned inference in non-cancer cells, such as cells related to non-cancer disease, immunology and aging. Here, we present ChemPert (https://chempert.uni.lu/), a database consisting of 82 270 transcriptional signatures in response to 2566 unique perturbagens (drugs, small molecules and protein ligands) across 167 non-cancer cell types, as well as the protein targets of 57 818 perturbagens. In addition, we develop a computational tool that leverages the non-cancer cell datasets, which enables more accurate predictions of perturbation responses and drugs in non-cancer cells compared to those based onto cancer databases. In particular, ChemPert correctly predicted drug effects for treating hepatitis and novel drugs for osteoarthritis. The ChemPert web interface is user-friendly and allows easy access of the entire datasets and the computational tool, providing valuable resources for both experimental researchers who wish to find datasets relevant to their research and computational researchers who need comprehensive non-cancer perturbation transcriptomics datasets for developing novel algorithms. Overall, ChemPert will facilitate future in silico compound screening for non-cancer cells.
扰动数据的先验知识可以极大地帮助推断化学扰动与其特定转录反应之间的关系。然而,目前的数据库主要包含癌细胞系,这些数据库不适合对非癌细胞(如与非癌症疾病、免疫学和衰老相关的细胞)进行上述推断。在这里,我们展示了 ChemPert(https://chempert.uni.lu/),这是一个包含 82270 个转录特征的数据库,这些特征是对 167 种非癌细胞类型中 2566 种独特扰动剂(药物、小分子和蛋白质配体)的响应,以及 57818 种扰动剂的蛋白质靶标。此外,我们开发了一种利用非癌细胞数据集的计算工具,与基于癌症数据库的预测相比,该工具能够更准确地预测非癌细胞中的扰动反应和药物。特别是,ChemPert 正确预测了治疗肝炎和骨关节炎的新药的药物作用。ChemPert 的网络界面用户友好,允许轻松访问整个数据集和计算工具,为希望找到与自己研究相关的数据集的实验研究人员和需要用于开发新算法的综合非癌细胞扰动转录组学数据集的计算研究人员提供了有价值的资源。总体而言,ChemPert 将促进未来针对非癌细胞的计算机化合物筛选。