• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

箭头图:一种新的图形工具,用于选择上调和下调的基因以及在样本亚组中差异表达的基因。

Arrow plot: a new graphical tool for selecting up and down regulated genes and genes differentially expressed on sample subgroups.

机构信息

Natural and Exact Sciences Department, Higher School of Health Technology of Lisbon of Polytechnic Institute of Lisbon and Center of Statistics and Applications of University of Lisbon, Lisbon, Portugal.

出版信息

BMC Bioinformatics. 2012 Jun 26;13:147. doi: 10.1186/1471-2105-13-147.

DOI:10.1186/1471-2105-13-147
PMID:22734592
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3542259/
Abstract

BACKGROUND

A common task in analyzing microarray data is to determine which genes are differentially expressed across two (or more) kind of tissue samples or samples submitted under experimental conditions. Several statistical methods have been proposed to accomplish this goal, generally based on measures of distance between classes. It is well known that biological samples are heterogeneous because of factors such as molecular subtypes or genetic background that are often unknown to the experimenter. For instance, in experiments which involve molecular classification of tumors it is important to identify significant subtypes of cancer. Bimodal or multimodal distributions often reflect the presence of subsamples mixtures. Consequently, there can be genes differentially expressed on sample subgroups which are missed if usual statistical approaches are used. In this paper we propose a new graphical tool which not only identifies genes with up and down regulations, but also genes with differential expression in different subclasses, that are usually missed if current statistical methods are used. This tool is based on two measures of distance between samples, namely the overlapping coefficient (OVL) between two densities and the area under the receiver operating characteristic (ROC) curve. The methodology proposed here was implemented in the open-source R software.

RESULTS

This method was applied to a publicly available dataset, as well as to a simulated dataset. We compared our results with the ones obtained using some of the standard methods for detecting differentially expressed genes, namely Welch t-statistic, fold change (FC), rank products (RP), average difference (AD), weighted average difference (WAD), moderated t-statistic (modT), intensity-based moderated t-statistic (ibmT), significance analysis of microarrays (samT) and area under the ROC curve (AUC). On both datasets all differentially expressed genes with bimodal or multimodal distributions were not selected by all standard selection procedures. We also compared our results with (i) area between ROC curve and rising area (ABCR) and (ii) the test for not proper ROC curves (TNRC). We found our methodology more comprehensive, because it detects both bimodal and multimodal distributions and different variances can be considered on both samples. Another advantage of our method is that we can analyze graphically the behavior of different kinds of differentially expressed genes.

CONCLUSION

Our results indicate that the arrow plot represents a new flexible and useful tool for the analysis of gene expression profiles from microarrays.

摘要

背景

分析微阵列数据的一个常见任务是确定哪些基因在两种(或更多种)组织样本或在实验条件下提交的样本之间差异表达。已经提出了几种统计方法来实现这一目标,这些方法通常基于类之间的距离度量。众所周知,由于分子亚型或遗传背景等因素,生物样本是异质的,而这些因素通常是实验人员所不知道的。例如,在涉及肿瘤分子分类的实验中,识别重要的癌症亚型是很重要的。双峰或多峰分布通常反映了亚样本混合物的存在。因此,如果使用通常的统计方法,可能会错过在样本亚组中差异表达的基因。在本文中,我们提出了一种新的图形工具,该工具不仅可以识别上调和下调的基因,还可以识别通常在使用当前统计方法时会错过的不同亚类中的差异表达基因。该工具基于两个样本之间的距离度量,即两个密度之间的重叠系数(OVL)和接收器操作特性(ROC)曲线下的面积。这里提出的方法是在开源 R 软件中实现的。

结果

该方法应用于一个公开可用的数据集,以及一个模拟数据集。我们将我们的结果与一些用于检测差异表达基因的标准方法的结果进行了比较,这些方法包括 Welch t 统计量、折叠变化(FC)、秩乘积(RP)、平均差异(AD)、加权平均差异(WAD)、适度 t 统计量(modT)、基于强度的适度 t 统计量(ibmT)、微阵列显著性分析(samT)和 ROC 曲线下的面积(AUC)。在两个数据集上,所有具有双峰或多峰分布的差异表达基因都没有被所有标准选择程序选择。我们还将我们的结果与(i)ROC 曲线下面积和上升面积(ABCR)和(ii)ROC 曲线不当检验(TNRC)进行了比较。我们发现我们的方法更全面,因为它可以检测双峰和多峰分布,并且可以考虑两个样本之间的不同方差。我们方法的另一个优点是,我们可以图形化地分析不同类型的差异表达基因的行为。

结论

我们的结果表明,箭头图代表了一种用于分析微阵列基因表达谱的新的灵活而有用的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/80579229e111/1471-2105-13-147-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/f77d53d627e7/1471-2105-13-147-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/1cc3950bb468/1471-2105-13-147-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/12b20d31d79f/1471-2105-13-147-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/99bbcf38efcd/1471-2105-13-147-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/6fb144153256/1471-2105-13-147-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/4783b312a4d6/1471-2105-13-147-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/6f228b3c96f0/1471-2105-13-147-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/55f53feb2d73/1471-2105-13-147-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/80579229e111/1471-2105-13-147-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/f77d53d627e7/1471-2105-13-147-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/1cc3950bb468/1471-2105-13-147-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/12b20d31d79f/1471-2105-13-147-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/99bbcf38efcd/1471-2105-13-147-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/6fb144153256/1471-2105-13-147-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/4783b312a4d6/1471-2105-13-147-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/6f228b3c96f0/1471-2105-13-147-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/55f53feb2d73/1471-2105-13-147-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/682c/3542259/80579229e111/1471-2105-13-147-9.jpg

相似文献

1
Arrow plot: a new graphical tool for selecting up and down regulated genes and genes differentially expressed on sample subgroups.箭头图:一种新的图形工具,用于选择上调和下调的基因以及在样本亚组中差异表达的基因。
BMC Bioinformatics. 2012 Jun 26;13:147. doi: 10.1186/1471-2105-13-147.
2
A weighted average difference method for detecting differentially expressed genes from microarray data.一种用于从微阵列数据中检测差异表达基因的加权平均差异法。
Algorithms Mol Biol. 2008 Jun 26;3:8. doi: 10.1186/1748-7188-3-8.
3
Not proper ROC curves as new tool for the analysis of differentially expressed genes in microarray experiments.不适当的ROC曲线作为微阵列实验中差异表达基因分析的新工具。
BMC Bioinformatics. 2008 Oct 3;9:410. doi: 10.1186/1471-2105-9-410.
4
Ranking differentially expressed genes from Affymetrix gene expression data: methods with reproducibility, sensitivity, and specificity.对Affymetrix基因表达数据中的差异表达基因进行排名:具有可重复性、敏感性和特异性的方法。
Algorithms Mol Biol. 2009 Apr 22;4:7. doi: 10.1186/1748-7188-4-7.
5
Comparison and evaluation of methods for generating differentially expressed gene lists from microarray data.从微阵列数据生成差异表达基因列表的方法的比较与评估
BMC Bioinformatics. 2006 Jul 26;7:359. doi: 10.1186/1471-2105-7-359.
6
A unified framework for finding differentially expressed genes from microarray experiments.一种从微阵列实验中寻找差异表达基因的统一框架。
BMC Bioinformatics. 2007 Sep 18;8:347. doi: 10.1186/1471-2105-8-347.
7
Intensity-based hierarchical Bayes method improves testing for differentially expressed genes in microarray experiments.基于强度的分层贝叶斯方法改进了微阵列实验中差异表达基因的检测。
BMC Bioinformatics. 2006 Dec 19;7:538. doi: 10.1186/1471-2105-7-538.
8
Evaluating methods for ranking differentially expressed genes applied to microArray quality control data.评估应用于微阵列质量控制数据的差异表达基因排序方法。
BMC Bioinformatics. 2011 Jun 6;12:227. doi: 10.1186/1471-2105-12-227.
9
Selecting differentially expressed genes from microarray experiments.从微阵列实验中选择差异表达基因。
Biometrics. 2003 Mar;59(1):133-42. doi: 10.1111/1541-0420.00016.
10
Identifying differentially expressed genes from microarray experiments via statistic synthesis.通过统计合成从微阵列实验中识别差异表达基因。
Bioinformatics. 2005 Apr 1;21(7):1084-93. doi: 10.1093/bioinformatics/bti108. Epub 2004 Oct 28.

引用本文的文献

1
Cut-points of the 6-min walk test, the six spot step test, and the timed-25-foot walk test discriminating impaired from non-impaired walking capacity in persons with Multiple sclerosis.在多发性硬化症患者中,6分钟步行测试、六点阶梯试验和25英尺定时步行测试区分步行能力受损与未受损的切点。
Mult Scler J Exp Transl Clin. 2025 Mar 27;11(1):20552173251324600. doi: 10.1177/20552173251324600. eCollection 2025 Jan-Mar.
2
Bayesian nonparametric inference for the overlap coefficient: With an application to disease diagnosis.贝叶斯非参数推断重叠系数:在疾病诊断中的应用。
Stat Med. 2022 Sep 10;41(20):3879-3898. doi: 10.1002/sim.9480. Epub 2022 Jun 27.
3

本文引用的文献

1
Genome-wide survey of post-meiotic segregation during yeast recombination.酵母重组过程中减数分裂后分离的全基因组调查。
Genome Biol. 2011;12(4):R36. doi: 10.1186/gb-2011-12-4-r36. Epub 2011 Apr 11.
2
Mapping of signaling networks through synthetic genetic interaction analysis by RNAi.通过 RNAi 进行的合成遗传交互作用分析来绘制信号转导网络。
Nat Methods. 2011 Apr;8(4):341-6. doi: 10.1038/nmeth.1581. Epub 2011 Mar 6.
3
Antisense expression increases gene expression variability and locus interdependency.反义表达增加基因表达的可变性和基因座的相互依赖性。
Transcriptome profiling implicated in beneficiary actions of kimchi extracts against infection.
转录组分析揭示了泡菜提取物抗感染的有益作用。
J Clin Biochem Nutr. 2021 Sep;69(2):171-187. doi: 10.3164/jcbn.20-116. Epub 2021 Mar 27.
4
The length of the receiver operating characteristic curve and the two cutoff Youden index within a robust framework for discovery, evaluation, and cutoff estimation in biomarker studies involving improper receiver operating characteristic curves.在涉及不当接收者操作特征曲线的生物标志物研究中,用于发现、评估和截止值估计的稳健框架内,接收者操作特征曲线的长度和两个截止 Youden 指数。
Stat Med. 2021 Mar 30;40(7):1767-1789. doi: 10.1002/sim.8869. Epub 2021 Feb 2.
5
Tumor Biomarkers for the Prediction of Distant Metastasis in Head and Neck Squamous Cell Carcinoma.用于预测头颈部鳞状细胞癌远处转移的肿瘤生物标志物
Cancers (Basel). 2020 Apr 9;12(4):922. doi: 10.3390/cancers12040922.
6
knnAUC: an open-source R package for detecting nonlinear dependence between one continuous variable and one binary variable.knnAUC:一个用于检测一个连续变量和一个二分类变量之间非线性关系的开源 R 包。
BMC Bioinformatics. 2018 Nov 22;19(1):448. doi: 10.1186/s12859-018-2427-4.
Mol Syst Biol. 2011 Feb 15;7:468. doi: 10.1038/msb.2011.1.
4
Genome-wide analysis of mRNA decay patterns during early Drosophila development.果蝇早期发育过程中 mRNA 降解模式的全基因组分析。
Genome Biol. 2010;11(9):R93. doi: 10.1186/gb-2010-11-9-r93. Epub 2010 Sep 21.
5
Not proper ROC curves as new tool for the analysis of differentially expressed genes in microarray experiments.不适当的ROC曲线作为微阵列实验中差异表达基因分析的新工具。
BMC Bioinformatics. 2008 Oct 3;9:410. doi: 10.1186/1471-2105-9-410.
6
A weighted average difference method for detecting differentially expressed genes from microarray data.一种用于从微阵列数据中检测差异表达基因的加权平均差异法。
Algorithms Mol Biol. 2008 Jun 26;3:8. doi: 10.1186/1748-7188-3-8.
7
PADGE: analysis of heterogeneous patterns of differential gene expression.PADGE:差异基因表达的异质性模式分析
Physiol Genomics. 2007 Dec 19;32(1):154-9. doi: 10.1152/physiolgenomics.00259.2006. Epub 2007 Oct 9.
8
Intensity-based hierarchical Bayes method improves testing for differentially expressed genes in microarray experiments.基于强度的分层贝叶斯方法改进了微阵列实验中差异表达基因的检测。
BMC Bioinformatics. 2006 Dec 19;7:538. doi: 10.1186/1471-2105-7-538.
9
Comparison and evaluation of methods for generating differentially expressed gene lists from microarray data.从微阵列数据生成差异表达基因列表的方法的比较与评估
BMC Bioinformatics. 2006 Jul 26;7:359. doi: 10.1186/1471-2105-7-359.
10
Rank-based methods as a non-parametric alternative of the T-statistic for the analysis of biological microarray data.基于秩的方法作为用于生物微阵列数据分析的T统计量的非参数替代方法。
J Bioinform Comput Biol. 2005 Oct;3(5):1171-89. doi: 10.1142/s0219720005001442.