• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CrossICC:无需调整批次效应的跨平台基因表达数据的迭代一致性聚类。

CrossICC: iterative consensus clustering of cross-platform gene expression data without adjusting batch effect.

机构信息

State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou 510060, China.

School of Environmental Science and Engineering, Sun Yat-sen University, Guangzhou, 510060, China.

出版信息

Brief Bioinform. 2020 Sep 25;21(5):1818-1824. doi: 10.1093/bib/bbz116.

DOI:10.1093/bib/bbz116
PMID:32978617
Abstract

UNLABELLED

Unsupervised clustering of high-throughput gene expression data is widely adopted for cancer subtyping. However, cancer subtypes derived from a single dataset are usually not applicable across multiple datasets from different platforms. Merging different datasets is necessary to determine accurate and applicable cancer subtypes but is still embarrassing due to the batch effect. CrossICC is an R package designed for the unsupervised clustering of gene expression data from multiple datasets/platforms without the requirement of batch effect adjustment. CrossICC utilizes an iterative strategy to derive the optimal gene signature and cluster numbers from a consensus similarity matrix generated by consensus clustering. This package also provides abundant functions to visualize the identified subtypes and evaluate subtyping performance. We expected that CrossICC could be used to discover the robust cancer subtypes with significant translational implications in personalized care for cancer patients.

AVAILABILITY AND IMPLEMENTATION

The package is implemented in R and available at GitHub (https://github.com/bioinformatist/CrossICC) and Bioconductor (http://bioconductor.org/packages/release/bioc/html/CrossICC.html) under the GPL v3 License.

摘要

未加标签

非监督聚类的高通量基因表达数据被广泛应用于癌症分型。然而,从单个数据集获得的癌症亚型通常不适用于来自不同平台的多个数据集。合并不同的数据集对于确定准确和适用的癌症亚型是必要的,但由于批次效应仍然令人尴尬。CrossICC 是一个 R 包,用于在没有批次效应调整要求的情况下对来自多个数据集/平台的基因表达数据进行无监督聚类。CrossICC 利用迭代策略从共识聚类生成的共识相似性矩阵中推导出最优基因特征和聚类数量。该软件包还提供了丰富的功能来可视化鉴定的亚型,并评估亚型划分性能。我们期望 CrossICC 能够用于发现具有显著转化意义的稳健癌症亚型,从而为癌症患者的个性化治疗提供帮助。

可用性和实现

该软件包是用 R 编写的,并可在 GitHub(https://github.com/bioinformatist/CrossICC)和 Bioconductor(http://bioconductor.org/packages/release/bioc/html/CrossICC.html)上使用,遵循 GPLv3 许可证。

相似文献

1
CrossICC: iterative consensus clustering of cross-platform gene expression data without adjusting batch effect.CrossICC:无需调整批次效应的跨平台基因表达数据的迭代一致性聚类。
Brief Bioinform. 2020 Sep 25;21(5):1818-1824. doi: 10.1093/bib/bbz116.
2
MethylMix 2.0: an R package for identifying DNA methylation genes.MethylMix 2.0:用于鉴定 DNA 甲基化基因的 R 包。
Bioinformatics. 2018 Sep 1;34(17):3044-3046. doi: 10.1093/bioinformatics/bty156.
3
mAPKL: R/ Bioconductor package for detecting gene exemplars and revealing their characteristics.mAPKL:用于检测基因范例并揭示其特征的R/Bioconductor软件包。
BMC Bioinformatics. 2015 Sep 15;16(1):291. doi: 10.1186/s12859-015-0719-5.
4
Genefu: an R/Bioconductor package for computation of gene expression-based signatures in breast cancer.Genefu:一个用于计算基于基因表达的乳腺癌特征的R/Bioconductor软件包。
Bioinformatics. 2016 Apr 1;32(7):1097-9. doi: 10.1093/bioinformatics/btv693. Epub 2015 Nov 24.
5
MultiBaC: an R package to remove batch effects in multi-omic experiments.MultiBaC:一个用于去除多组学实验中批次效应的 R 包。
Bioinformatics. 2022 Apr 28;38(9):2657-2658. doi: 10.1093/bioinformatics/btac132.
6
Pathway-based deep clustering for molecular subtyping of cancer.基于通路的深度聚类在癌症分子分型中的应用。
Methods. 2020 Feb 15;173:24-31. doi: 10.1016/j.ymeth.2019.06.017. Epub 2019 Jun 25.
7
Overcoming the impacts of two-step batch effect correction on gene expression estimation and inference.克服两步批处理效应校正对基因表达估计和推断的影响。
Biostatistics. 2023 Jul 14;24(3):635-652. doi: 10.1093/biostatistics/kxab039.
8
GeneExpressionSignature: an R package for discovering functional connections using gene expression signatures.基因表达特征:一个使用基因表达特征发现功能关联的 R 包。
OMICS. 2013 Feb;17(2):116-8. doi: 10.1089/omi.2012.0087.
9
Robust clustering of noisy high-dimensional gene expression data for patients subtyping.对噪声高维基因表达数据进行稳健聚类,以对患者进行亚型划分。
Bioinformatics. 2018 Dec 1;34(23):4064-4072. doi: 10.1093/bioinformatics/bty502.
10
intCC: An efficient weighted integrative consensus clustering of multimodal data.intCC:一种高效的多模态数据加权综合共识聚类方法。
Pac Symp Biocomput. 2024;29:627-640.

引用本文的文献

1
DNA methylation in esophageal cancer: technological advances and early detection clinical applications.食管癌中的DNA甲基化:技术进展与早期检测的临床应用
Front Oncol. 2025 Jul 11;15:1543190. doi: 10.3389/fonc.2025.1543190. eCollection 2025.
2
The molecular subtypes of autoimmune diseases.自身免疫性疾病的分子亚型
Comput Struct Biotechnol J. 2024 Mar 28;23:1348-1363. doi: 10.1016/j.csbj.2024.03.026. eCollection 2024 Dec.
3
A precise molecular subtyping of ulcerative colitis reveals the immune heterogeneity and predicts clinical drug responses.
溃疡性结肠炎的精确分子分型揭示了免疫异质性,并预测了临床药物反应。
J Transl Med. 2023 Jul 13;21(1):466. doi: 10.1186/s12967-023-04326-w.
4
PredPromoter-MF(2L): A Novel Approach of Promoter Prediction Based on Multi-source Feature Fusion and Deep Forest.启动子预测-MF(2L):一种基于多源特征融合和深度森林的新型启动子预测方法。
Interdiscip Sci. 2022 Sep;14(3):697-711. doi: 10.1007/s12539-022-00520-4. Epub 2022 Apr 30.
5
An integrated bioinformatic investigation of mitochondrial solute carrier family 25 (SLC25) in colon cancer followed by preliminary validation of member 5 (SLC25A5) in tumorigenesis.对结肠癌中线粒体溶质载体家族 25(SLC25)的综合生物信息学研究,随后对成员 5(SLC25A5)在肿瘤发生中的初步验证。
Cell Death Dis. 2022 Mar 14;13(3):237. doi: 10.1038/s41419-022-04692-1.
6
Multi-Omics Characterization of Tumor Microenvironment Heterogeneity and Immunotherapy Resistance Through Cell States-Based Subtyping in Bladder Cancer.通过基于细胞状态的亚型分析对膀胱癌肿瘤微环境异质性和免疫治疗耐药性进行多组学表征
Front Cell Dev Biol. 2022 Feb 9;9:809588. doi: 10.3389/fcell.2021.809588. eCollection 2021.
7
Multivariate meta-analysis reveals global transcriptomic signatures underlying distinct human naive-like pluripotent states.多变量荟萃分析揭示了不同人类原始样多能状态的全局转录组特征。
PLoS One. 2021 May 13;16(5):e0251461. doi: 10.1371/journal.pone.0251461. eCollection 2021.