顶级排名基因列表的重叠概率、超几何分布以及基因选择标准的严格性。

Overlapping probabilities of top ranking gene lists, hypergeometric distribution, and stringency of gene selection criterion.

作者信息

Fury Wen, Batliwalla Franak, Gregersen Peter K, Li Wentian

机构信息

Regeneron Pharmaceutical Inc., Tarrytown, NY 10591, USA.

出版信息

Conf Proc IEEE Eng Med Biol Soc. 2006;2006:5531-4. doi: 10.1109/IEMBS.2006.260828.

DOI:10.1109/IEMBS.2006.260828

PMID:17947148

Abstract

When the same set of genes appear in two top ranking gene lists in two different studies, it is often of interest to estimate the probability for this being a chance event. This overlapping probability is well known to follow the hypergeometric distribution. Usually, the lengths of top-ranking gene lists are assumed to be fixed, by using a pre-set criterion on, e.g., p-value for the t-test. We investigate how overlapping probability changes with the gene selection criterion, or simply, with the length of the top-ranking gene lists. It is concluded that overlapping probability is indeed a function of the gene list length, and its statistical significance should be quoted in the context of gene selection criterion.

摘要

当同一组基因出现在两项不同研究的两个顶级基因列表中时，人们通常会对估计这是一个偶然事件的概率感兴趣。众所周知，这种重叠概率遵循超几何分布。通常，通过使用例如t检验的p值等预设标准，假设顶级基因列表的长度是固定的。我们研究了重叠概率如何随基因选择标准变化，或者简单地说，随顶级基因列表的长度变化。得出的结论是，重叠概率确实是基因列表长度的函数，其统计显著性应在基因选择标准的背景下引用。

相似文献

Overlapping probabilities of top ranking gene lists, hypergeometric distribution, and stringency of gene selection criterion.顶级排名基因列表的重叠概率、超几何分布以及基因选择标准的严格性。

Conf Proc IEEE Eng Med Biol Soc. 2006;2006:5531-4. doi: 10.1109/IEMBS.2006.260828.

The balance of reproducibility, sensitivity, and specificity of lists of differentially expressed genes in microarray studies.微阵列研究中差异表达基因列表的可重复性、敏感性和特异性之间的平衡。

BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S10. doi: 10.1186/1471-2105-9-S9-S10.

Post hoc pattern matching: assigning significance to statistically defined expression patterns in single channel microarray data.事后模式匹配：赋予单通道微阵列数据中统计定义的表达模式以显著性

BMC Bioinformatics. 2007 Jul 5;8:240. doi: 10.1186/1471-2105-8-240.

Is cross-validation better than resubstitution for ranking genes?在对基因进行排名时，交叉验证是否比重替代法更好？

Bioinformatics. 2004 Jan 22;20(2):253-8. doi: 10.1093/bioinformatics/btg399.

Using weighted permutation scores to detect differential gene expression with microarray data.使用加权排列分数通过微阵列数据检测差异基因表达。

J Bioinform Comput Biol. 2005 Aug;3(4):989-1006. doi: 10.1142/s021972000500134x.

Evaluation of gene importance in microarray data based upon probability of selection.基于选择概率评估微阵列数据中的基因重要性。

BMC Bioinformatics. 2005 Mar 22;6:67. doi: 10.1186/1471-2105-6-67.

Feature selection and nearest centroid classification for protein mass spectrometry.蛋白质质谱的特征选择与最近质心分类

BMC Bioinformatics. 2005 Mar 23;6:68. doi: 10.1186/1471-2105-6-68.

Quadratic regression analysis for gene discovery and pattern recognition for non-cyclic short time-course microarray experiments.用于非循环短时间进程微阵列实验的基因发现和模式识别的二次回归分析。

BMC Bioinformatics. 2005 Apr 25;6:106. doi: 10.1186/1471-2105-6-106.

Computational selection of distinct class- and subclass-specific gene expression signatures.不同类别和亚类特异性基因表达特征的计算选择。

J Biomed Inform. 2002 Jun;35(3):160-70. doi: 10.1016/s1532-0464(02)00525-7.

Sample size calculations based on ranking and selection in microarray experiments.基于微阵列实验中排序与选择的样本量计算。

Biometrics. 2008 Mar;64(1):217-26. doi: 10.1111/j.1541-0420.2007.00875.x. Epub 2007 Aug 3.

引用本文的文献

Common molecular links and therapeutic insights between type 2 diabetes and kidney cancer.2型糖尿病与肾癌之间的常见分子联系及治疗见解

PLoS One. 2025 Aug 20;20(8):e0330619. doi: 10.1371/journal.pone.0330619. eCollection 2025.

Cancer/testis antigens FBXO39 and CEP55 expression correlates with survival in GBM patients.癌/睾丸抗原FBXO39和CEP55的表达与胶质母细胞瘤患者的生存情况相关。

PLoS One. 2025 Jun 12;20(6):e0326054. doi: 10.1371/journal.pone.0326054. eCollection 2025.

Transcriptome Complexity Disentangled: A Regulatory Molecules Approach.解析转录组复杂性：一种调控分子方法

Int J Mol Sci. 2025 Mar 11;26(6):2510. doi: 10.3390/ijms26062510.

NCOA3 knockdown delays human embryo development.NCOA3基因敲低会延迟人类胚胎发育。

Heliyon. 2024 Sep 13;10(18):e37639. doi: 10.1016/j.heliyon.2024.e37639. eCollection 2024 Sep 30.

RankCompV3: a differential expression analysis algorithm based on relative expression orderings and applications in single-cell RNA transcriptomics.RankCompV3：一种基于相对表达顺序的差异表达分析算法及其在单细胞 RNA 转录组学中的应用。

BMC Bioinformatics. 2024 Aug 7;25(1):259. doi: 10.1186/s12859-024-05889-1.

mRNA markers for survival prediction in glioblastoma multiforme patients: a systematic review with bioinformatic analyses.mRNA 标志物在多形性胶质母细胞瘤患者生存预测中的应用：系统评价及生物信息学分析。

BMC Cancer. 2024 May 21;24(1):612. doi: 10.1186/s12885-024-12345-z.

Integrating Transcriptomics, Genomics, and Imaging in Alzheimer's Disease: A Federated Model.整合转录组学、基因组学和影像学用于阿尔茨海默病：一种联邦模型。

Front Radiol. 2022 Jan 21;1:777030. doi: 10.3389/fradi.2021.777030. eCollection 2021.

Transcriptome Complexity Disentangled: A Regulatory Molecules Approach.解开转录组复杂性：一种调控分子方法。

bioRxiv. 2025 Mar 10:2023.04.17.537241. doi: 10.1101/2023.04.17.537241.

A regulatory network of Sox and Six transcription factors initiate a cell fate transformation during hearing regeneration in adult zebrafish.在成年斑马鱼听力再生过程中，Sox和Six转录因子的调控网络启动细胞命运转变。

Cell Genom. 2022 Sep 14;2(9). doi: 10.1016/j.xgen.2022.100170. Epub 2022 Aug 22.

Sequence and tissue targeting specificity of ZFP36L2 reveals Elavl2 as a novel target with co-regulation potential.ZFP36L2 的序列和组织靶向特异性揭示了 Elavl2 是一个具有潜在共调控作用的新靶点。

Nucleic Acids Res. 2022 Apr 22;50(7):4068-4082. doi: 10.1093/nar/gkac209.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

顶级排名基因列表的重叠概率、超几何分布以及基因选择标准的严格性。

Overlapping probabilities of top ranking gene lists, hypergeometric distribution, and stringency of gene selection criterion.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献