• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SCIA:一种适用于具有不同特征数据的新型基因集分析方法。

SCIA: A Novel Gene Set Analysis Applicable to Data With Different Characteristics.

作者信息

Li Yiqun, Wu Ying, Zhang Xiaohan, Bai Yunfan, Akthar Luqman Muhammad, Lu Xin, Shi Ming, Zhao Jianxiang, Jiang Qinghua, Li Yu

机构信息

Department of Laboratory of Cancer Biology, School of Life Science and Technology, Harbin Institute of Technology, Harbin, China.

Department of Biostatistics, School of Public Health, Southern Medical University, Guangzhou, China.

出版信息

Front Genet. 2019 Jun 25;10:598. doi: 10.3389/fgene.2019.00598. eCollection 2019.

DOI:10.3389/fgene.2019.00598
PMID:31293623
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6603225/
Abstract

Gene set analysis is commonly used in functional enrichment and molecular pathway analyses. Most of the present methods are based on the competitive testing methods which assume each gene is independent of the others. However, the false discovery rates of competitive methods are amplified when they are applied to datasets with high inter-gene correlations. The self-contained testing methods could solve this problem, but there are other restrictions on data characteristics. Therefore, a statistically rigorous testing method applicable to different datasets with various complex characteristics is needed to obtain unbiased and comparable results. We propose a self-contained and competitive incorporated analysis (SCIA) to alleviate the bias caused by the limited application scope of existing gene set analysis methods. This is accomplished through a novel permutation strategy using biological networks to selectively permute gene labels with different probabilities. In simulation studies, SCIA was compared with four representative analysis methods (GSEA, CAMERA, ROAST, and NES), and produced the best performance in both false discovery rate and sensitivity under most conditions with different parameter settings. Further, the KEGG pathway analysis on two real datasets of lung cancer showed that the results found by SCIA in both of the two datasets are much more than that of GSEA and most of them could be supported by literature. Overall, SCIA promisingly offers researchers more reliable and comparable results with different datasets.

摘要

基因集分析常用于功能富集和分子通路分析。目前大多数方法基于竞争测试方法,这些方法假定每个基因彼此独立。然而,当将竞争方法应用于基因间相关性高的数据集时,其错误发现率会被放大。自包含测试方法可以解决这个问题,但对数据特征还有其他限制。因此,需要一种统计严格的测试方法,适用于具有各种复杂特征的不同数据集,以获得无偏且可比的结果。我们提出一种自包含与竞争相结合的分析方法(SCIA),以减轻现有基因集分析方法应用范围有限所导致的偏差。这是通过一种新颖的置换策略实现的,该策略利用生物网络以不同概率选择性地置换基因标签。在模拟研究中,将SCIA与四种代表性分析方法(GSEA、CAMERA、ROAST和NES)进行了比较,在大多数不同参数设置的条件下,SCIA在错误发现率和灵敏度方面均表现最佳。此外,对两个肺癌真实数据集的KEGG通路分析表明,SCIA在两个数据集中发现的结果比GSEA多得多,并且其中大多数结果都能得到文献支持。总体而言,SCIA有望为研究人员提供与不同数据集更可靠且可比的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d37/6603225/9f5a9be85bab/fgene-10-00598-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d37/6603225/dc8ecfe1edab/fgene-10-00598-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d37/6603225/9f5a9be85bab/fgene-10-00598-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d37/6603225/dc8ecfe1edab/fgene-10-00598-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d37/6603225/9f5a9be85bab/fgene-10-00598-g0002.jpg

相似文献

1
SCIA: A Novel Gene Set Analysis Applicable to Data With Different Characteristics.SCIA:一种适用于具有不同特征数据的新型基因集分析方法。
Front Genet. 2019 Jun 25;10:598. doi: 10.3389/fgene.2019.00598. eCollection 2019.
2
Comparative study on gene set and pathway topology-based enrichment methods.基于基因集和通路拓扑结构的富集方法的比较研究
BMC Bioinformatics. 2015 Oct 22;16:334. doi: 10.1186/s12859-015-0751-5.
3
Gene set analysis methods: a systematic comparison.基因集分析方法:系统比较
BioData Min. 2018 May 31;11:8. doi: 10.1186/s13040-018-0166-8. eCollection 2018.
4
Group testing for pathway analysis improves comparability of different microarray datasets.用于通路分析的分组检验可提高不同微阵列数据集的可比性。
Bioinformatics. 2006 Oct 15;22(20):2500-6. doi: 10.1093/bioinformatics/btl424. Epub 2006 Aug 7.
5
Silver: Forging almost Gold Standard Datasets.银:锻造近乎黄金标准数据集。
Genes (Basel). 2021 Sep 28;12(10):1523. doi: 10.3390/genes12101523.
6
BubbleGUM: automatic extraction of phenotype molecular signatures and comprehensive visualization of multiple Gene Set Enrichment Analyses.BubbleGUM:表型分子特征的自动提取及多种基因集富集分析的综合可视化
BMC Genomics. 2015 Oct 19;16:814. doi: 10.1186/s12864-015-2012-4.
7
Rotation testing in gene set enrichment analysis for small direct comparison experiments.小型直接比较实验的基因集富集分析中的旋转测试。
Stat Appl Genet Mol Biol. 2009;8:Article34. doi: 10.2202/1544-6115.1418. Epub 2009 Jul 27.
8
Improving Gene-Set Enrichment Analysis of RNA-Seq Data with Small Replicates.利用小样本重复改进RNA测序数据的基因集富集分析
PLoS One. 2016 Nov 9;11(11):e0165919. doi: 10.1371/journal.pone.0165919. eCollection 2016.
9
Gene set analysis approaches for RNA-seq data: performance evaluation and application guideline.RNA测序数据的基因集分析方法:性能评估与应用指南
Brief Bioinform. 2016 May;17(3):393-407. doi: 10.1093/bib/bbv069. Epub 2015 Sep 4.
10
Assessment of gene set analysis methods based on microarray data.基于微阵列数据的基因集分析方法评估。
Gene. 2014 Jan 25;534(2):383-9. doi: 10.1016/j.gene.2013.08.063. Epub 2013 Sep 3.

引用本文的文献

1
Thyroid hormone receptor α1 acts as a new squamous cell lung cancer diagnostic marker and poor prognosis predictor.甲状腺激素受体α1作为一种新型的鳞状细胞肺癌诊断标志物及预后不良的预测指标。
Sci Rep. 2021 Apr 12;11(1):7944. doi: 10.1038/s41598-021-86754-6.
2
PMINR: Pointwise Mutual Information-Based Network Regression - With Application to Studies of Lung Cancer and Alzheimer's Disease.基于逐点互信息的网络回归——及其在肺癌和阿尔茨海默病研究中的应用
Front Genet. 2020 Oct 15;11:556259. doi: 10.3389/fgene.2020.556259. eCollection 2020.

本文引用的文献

1
Regulatory T-cell Genes Drive Altered Immune Microenvironment in Adult Solid Cancers and Allow for Immune Contextual Patient Subtyping.调节性 T 细胞基因驱动成人实体瘤中免疫微环境的改变,并允许进行免疫背景的患者亚群分型。
Cancer Epidemiol Biomarkers Prev. 2018 Jan;27(1):103-112. doi: 10.1158/1055-9965.EPI-17-0461. Epub 2017 Nov 13.
2
Powerful differential expression analysis incorporating network topology for next-generation sequencing data.结合网络拓扑结构用于下一代测序数据的强大差异表达分析。
Bioinformatics. 2017 May 15;33(10):1505-1513. doi: 10.1093/bioinformatics/btw833.
3
A novel chi-square statistic for detecting group differences between pathways in systems epidemiology.
一种用于检测系统流行病学中各通路间组间差异的新型卡方统计量。
Stat Med. 2016 Dec 20;35(29):5512-5524. doi: 10.1002/sim.7094. Epub 2016 Sep 7.
4
ESEA: Discovering the Dysregulated Pathways based on Edge Set Enrichment Analysis.ESEA:基于边集富集分析发现失调通路。
Sci Rep. 2015 Aug 12;5:13044. doi: 10.1038/srep13044.
5
Methods and approaches in the topology-based analysis of biological pathways.基于拓扑学的生物途径分析方法与途径
Front Physiol. 2013 Oct 10;4:278. doi: 10.3389/fphys.2013.00278.
6
Along signal paths: an empirical gene set approach exploiting pathway topology.沿信号通路:一种利用通路拓扑结构的经验基因集方法。
Nucleic Acids Res. 2013 Jan 7;41(1):e19. doi: 10.1093/nar/gks866. Epub 2012 Sep 21.
7
Centrality-based pathway enrichment: a systematic approach for finding significant pathways dominated by key genes.基于中心性的通路富集:一种寻找由关键基因主导的重要通路的系统方法。
BMC Syst Biol. 2012 Jun 6;6:56. doi: 10.1186/1752-0509-6-56.
8
Camera: a competitive gene set test accounting for inter-gene correlation.相机:一种竞争性基因集测试,考虑了基因间的相关性。
Nucleic Acids Res. 2012 Sep 1;40(17):e133. doi: 10.1093/nar/gks461. Epub 2012 May 25.
9
Ten years of pathway analysis: current approaches and outstanding challenges.十年的通路分析:当前方法和突出挑战。
PLoS Comput Biol. 2012;8(2):e1002375. doi: 10.1371/journal.pcbi.1002375. Epub 2012 Feb 23.
10
KEGG for integration and interpretation of large-scale molecular data sets.KEGG 用于整合和解释大规模分子数据集。
Nucleic Acids Res. 2012 Jan;40(Database issue):D109-14. doi: 10.1093/nar/gkr988. Epub 2011 Nov 10.