• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用新的差异样本方差基因集检验提高数据的可解释性。

Improving data interpretability with new differential sample variance gene set tests.

作者信息

Rahmatallah Yasir, Glazko Galina

机构信息

Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, 72205, USA.

出版信息

BMC Bioinformatics. 2025 Apr 14;26(1):103. doi: 10.1186/s12859-025-06117-0.

DOI:10.1186/s12859-025-06117-0
PMID:40229677
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11998189/
Abstract

BACKGROUND

Gene set analysis methods have played a major role in generating biological interpretations of omics data such as gene expression datasets. However, most methods focus on detecting homogenous pattern changes in mean expression while methods detecting pattern changes in variance remain poorly explored. While a few studies attempted to use gene-level variance analysis, such approach remains under-utilized. When comparing two phenotypes, gene sets with distinct changes in subgroups under one phenotype are overlooked by available methods although they reflect meaningful biological differences between two phenotypes. Multivariate sample-level variance analysis methods are needed to detect such pattern changes.

RESULTS

We used ranking schemes based on minimum spanning tree to generalize the Cramer-Von Mises and Anderson-Darling univariate statistics into multivariate gene set analysis methods to detect differential sample variance or mean. We characterized the detection power and Type I error rate of these methods in addition to two methods developed earlier using simulation results with different parameters. We applied the developed methods to microarray gene expression dataset of prednisolone-resistant and prednisolone-sensitive children diagnosed with B-lineage acute lymphoblastic leukemia and bulk RNA-sequencing gene expression dataset of benign hyperplastic polyps and potentially malignant sessile serrated adenoma/polyps. One or both of the two compared phenotypes in each of these datasets have distinct molecular subtypes that contribute to within phenotype variability and to heterogeneous differences between two compared phenotypes. Our results show that methods designed to detect differential sample variance provide meaningful biological interpretations by detecting specific hallmark gene sets associated with the two compared phenotypes as documented in available literature.

CONCLUSIONS

The results of this study demonstrate the usefulness of methods designed to detect differential sample variance in providing biological interpretations when biologically relevant but heterogeneous changes between two phenotypes are prevalent in specific signaling pathways. Software implementation of the methods is available with detailed documentation from Bioconductor package GSAR. The available methods are applicable to gene expression datasets in a normalized matrix form and could be used with other omics datasets in a normalized matrix form with available collection of feature sets.

摘要

背景

基因集分析方法在对诸如基因表达数据集等组学数据进行生物学解释方面发挥了重要作用。然而,大多数方法侧重于检测平均表达中的同质模式变化,而检测方差模式变化的方法仍未得到充分探索。虽然有一些研究尝试使用基因水平的方差分析,但这种方法仍未得到充分利用。在比较两种表型时,现有方法会忽略在一种表型下亚组中具有明显变化的基因集,尽管它们反映了两种表型之间有意义的生物学差异。需要多变量样本水平的方差分析方法来检测这种模式变化。

结果

我们使用基于最小生成树的排序方案,将克莱默 - 冯·米塞斯和安德森 - 达林单变量统计量推广为多变量基因集分析方法,以检测差异样本方差或均值。除了早期使用不同参数的模拟结果开发的两种方法外,我们还对这些方法的检测能力和I型错误率进行了表征。我们将开发的方法应用于诊断为B系急性淋巴细胞白血病的泼尼松龙耐药和泼尼松龙敏感儿童的微阵列基因表达数据集,以及良性增生性息肉和潜在恶性无蒂锯齿状腺瘤/息肉的批量RNA测序基因表达数据集。在这些数据集中的每一个中,两种比较的表型中的一种或两种都具有不同的分子亚型,这些亚型导致表型内的变异性以及两种比较表型之间的异质性差异。我们的结果表明,设计用于检测差异样本方差的方法通过检测与两种比较表型相关的特定标志性基因集,如现有文献中所记录的,提供了有意义的生物学解释。

结论

本研究结果证明,当两种表型之间生物学相关但异质性变化在特定信号通路中普遍存在时,设计用于检测差异样本方差的方法在提供生物学解释方面是有用的。这些方法的软件实现可从Bioconductor包GSAR获得详细文档。现有方法适用于归一化矩阵形式的基因表达数据集,并且可以与具有可用特征集集合的归一化矩阵形式的其他组学数据集一起使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b95/11998189/e43421ce7354/12859_2025_6117_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b95/11998189/03f585cbd8c4/12859_2025_6117_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b95/11998189/4110baa67775/12859_2025_6117_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b95/11998189/0aacba0998e2/12859_2025_6117_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b95/11998189/e43421ce7354/12859_2025_6117_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b95/11998189/03f585cbd8c4/12859_2025_6117_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b95/11998189/4110baa67775/12859_2025_6117_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b95/11998189/0aacba0998e2/12859_2025_6117_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b95/11998189/e43421ce7354/12859_2025_6117_Fig4_HTML.jpg

相似文献

1
Improving data interpretability with new differential sample variance gene set tests.使用新的差异样本方差基因集检验提高数据的可解释性。
BMC Bioinformatics. 2025 Apr 14;26(1):103. doi: 10.1186/s12859-025-06117-0.
2
Improving data interpretability with new differential sample variance gene set tests.使用新的差异样本方差基因集测试提高数据可解释性。
Res Sq. 2024 Sep 9:rs.3.rs-4888767. doi: 10.21203/rs.3.rs-4888767/v1.
3
GSAR: Bioconductor package for Gene Set analysis in R.GSAR:用于R中基因集分析的Bioconductor软件包。
BMC Bioinformatics. 2017 Jan 24;18(1):61. doi: 10.1186/s12859-017-1482-6.
4
Platform-independent gene expression signature differentiates sessile serrated adenomas/polyps and hyperplastic polyps of the colon.与平台无关的基因表达特征可区分结肠无蒂锯齿状腺瘤/息肉和增生性息肉。
BMC Med Genomics. 2017 Dec 28;10(1):81. doi: 10.1186/s12920-017-0317-7.
5
Erratum: High-Throughput Identification of Resistance to Pseudomonas syringae pv. Tomato in Tomato using Seedling Flood Assay.勘误:利用幼苗浸没法高通量鉴定番茄对丁香假单胞菌 pv.番茄的抗性。
J Vis Exp. 2023 Oct 18(200). doi: 10.3791/6576.
6
Comparative evaluation of gene-set analysis methods.基因集分析方法的比较评估
BMC Bioinformatics. 2007 Nov 7;8:431. doi: 10.1186/1471-2105-8-431.
7
GSVA: gene set variation analysis for microarray and RNA-seq data.GSVA:用于微阵列和 RNA-seq 数据的基因集变异分析。
BMC Bioinformatics. 2013 Jan 16;14:7. doi: 10.1186/1471-2105-14-7.
8
Many accurate small-discriminatory feature subsets exist in microarray transcript data: biomarker discovery.微阵列转录数据中存在许多准确的小判别特征子集:生物标志物发现。
BMC Bioinformatics. 2005 Apr 13;6:97. doi: 10.1186/1471-2105-6-97.
9
Gene set analysis using sufficient dimension reduction.使用充分降维的基因集分析。
BMC Bioinformatics. 2016 Feb 6;17:74. doi: 10.1186/s12859-016-0928-6.
10
Improving detection of differentially expressed gene sets by applying cluster enrichment analysis to Gene Ontology.通过将聚类富集分析应用于基因本体论来改进差异表达基因集的检测。
BMC Bioinformatics. 2009 Aug 5;10:240. doi: 10.1186/1471-2105-10-240.

本文引用的文献

1
Role of Epithelial to Mesenchymal Transition in Colorectal Cancer.上皮-间质转化在结直肠癌中的作用
Int J Mol Sci. 2023 Oct 1;24(19):14815. doi: 10.3390/ijms241914815.
2
Modelling acquired resistance to DOT1L inhibition exhibits the adaptive potential of KMT2A-rearranged acute lymphoblastic leukemia.对DOT1L抑制产生获得性耐药的模型展示了KMT2A重排急性淋巴细胞白血病的适应潜力。
Exp Hematol Oncol. 2023 Sep 22;12(1):81. doi: 10.1186/s40164-023-00445-8.
3
Th1 cytokines in pediatric acute lymphoblastic leukemia.小儿急性淋巴细胞白血病中的 Th1 细胞因子。
Cancer Immunol Immunother. 2023 Nov;72(11):3621-3634. doi: 10.1007/s00262-023-03512-5. Epub 2023 Aug 23.
4
Different Roles of Apoptosis and Autophagy in the Development of Human Colorectal Cancer.细胞凋亡和自噬在人类结直肠癌发生发展中的不同作用。
Int J Mol Sci. 2023 Jun 15;24(12):10201. doi: 10.3390/ijms241210201.
5
A convergent malignant phenotype in B-cell acute lymphoblastic leukemia involving the splicing factor SRRM1.B细胞急性淋巴细胞白血病中涉及剪接因子SRRM1的一种趋同恶性表型。
NAR Cancer. 2022 Dec 9;4(4):zcac041. doi: 10.1093/narcan/zcac041. eCollection 2022 Dec.
6
Regulome analysis in B-acute lymphoblastic leukemia exposes Core Binding Factor addiction as a therapeutic vulnerability.B 急性淋巴细胞白血病中的调控组分析揭示了核心结合因子成瘾是一种治疗上的脆弱性。
Nat Commun. 2022 Nov 21;13(1):7124. doi: 10.1038/s41467-022-34653-3.
7
Single-cell Transcriptomics Reveals Early Molecular and Immune Alterations Underlying the Serrated Neoplasia Pathway Toward Colorectal Cancer.单细胞转录组学揭示了锯齿状肿瘤途径导致结直肠癌的早期分子和免疫改变。
Cell Mol Gastroenterol Hepatol. 2023;15(2):393-424. doi: 10.1016/j.jcmgh.2022.10.001. Epub 2022 Oct 8.
8
Wnt signaling in colorectal cancer: pathogenic role and therapeutic target.结直肠癌中的 Wnt 信号通路:致病作用和治疗靶点。
Mol Cancer. 2022 Jul 14;21(1):144. doi: 10.1186/s12943-022-01616-7.
9
Microenvironmental Reactive Oxygen Species in Colorectal Cancer: Involved Processes and Therapeutic Opportunities.结直肠癌中的微环境活性氧:相关过程及治疗机会
Cancers (Basel). 2021 Oct 9;13(20):5037. doi: 10.3390/cancers13205037.
10
Targeted Therapy in the Treatment of Pediatric Acute Lymphoblastic Leukemia-Therapy and Toxicity Mechanisms.靶向治疗在儿科急性淋巴细胞白血病治疗中的应用——治疗机制与毒性机制。
Int J Mol Sci. 2021 Sep 11;22(18):9827. doi: 10.3390/ijms22189827.