• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

当前通话过滤对微阵列实验分析的影响。

Effects of filtering by Present call on analysis of microarray experiments.

作者信息

McClintick Jeanette N, Edenberg Howard J

机构信息

Department of Medical and Molecular Genetics, Indiana University, Indianapolis, Indiana, USA.

出版信息

BMC Bioinformatics. 2006 Jan 31;7:49. doi: 10.1186/1471-2105-7-49.

DOI:10.1186/1471-2105-7-49
PMID:16448562
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1409797/
Abstract

BACKGROUND

Affymetrix GeneChips are widely used for expression profiling of tens of thousands of genes. The large number of comparisons can lead to false positives. Various methods have been used to reduce false positives, but they have rarely been compared or quantitatively evaluated. Here we describe and evaluate a simple method that uses the detection (Present/Absent) call generated by the Affymetrix microarray suite version 5 software (MAS5) to remove data that is not reliably detected before further analysis, and compare this with filtering by expression level. We explore the effects of various thresholds for removing data in experiments of different size (from 3 to 10 arrays per treatment), as well as their relative power to detect significant differences in expression.

RESULTS

Our approach sets a threshold for the fraction of arrays called Present in at least one treatment group. This method removes a large percentage of probe sets called Absent before carrying out the comparisons, while retaining most of the probe sets called Present. It preferentially retains the more significant probe sets (p < or = 0.001) and those probe sets that are turned on or off, and improves the false discovery rate. Permutations to estimate false positives indicate that probe sets removed by the filter contribute a disproportionate number of false positives. Filtering by fraction Present is effective when applied to data generated either by the MAS5 algorithm or by other probe-level algorithms, for example RMA (robust multichip average). Experiment size greatly affects the ability to reproducibly detect significant differences, and also impacts the effect of filtering; smaller experiments (3-5 samples per treatment group) benefit from more restrictive filtering (> or =50% Present).

CONCLUSION

Use of a threshold fraction of Present detection calls (derived by MAS5) provided a simple method that effectively eliminated from analysis probe sets that are unlikely to be reliable while preserving the most significant probe sets and those turned on or off; it thereby increased the ratio of true positives to false positives.

摘要

背景

Affymetrix基因芯片被广泛用于对数以万计基因的表达谱分析。大量的比较可能导致假阳性。已经使用了各种方法来减少假阳性,但它们很少被比较或定量评估。在这里,我们描述并评估一种简单的方法,该方法使用Affymetrix微阵列套件版本5软件(MAS5)生成的检测(存在/不存在)调用,在进一步分析之前去除未可靠检测到的数据,并将其与按表达水平进行的过滤进行比较。我们探讨了在不同规模的实验(每个处理3至10个阵列)中去除数据的各种阈值的影响,以及它们检测表达显著差异的相对能力。

结果

我们的方法为至少一个处理组中被称为“存在”的阵列比例设置了一个阈值。该方法在进行比较之前去除了很大比例被称为“不存在”的探针集,同时保留了大多数被称为“存在”的探针集。它优先保留更显著的探针集(p≤0.001)以及那些开启或关闭的探针集,并提高了错误发现率。用于估计假阳性的排列表明,被过滤器去除的探针集贡献了不成比例数量的假阳性。按“存在”比例进行过滤在应用于由MAS5算法或其他探针水平算法(例如RMA,稳健多芯片平均)生成的数据时是有效的。实验规模极大地影响了可重复检测显著差异的能力,也影响了过滤效果;较小的实验(每个处理组3至5个样本)受益于更严格的过滤(≥50%“存在”)。

结论

使用由MAS5得出的“存在”检测调用的阈值比例提供了一种简单的方法,该方法有效地从分析中消除了不太可能可靠的探针集,同时保留了最显著的探针集以及那些开启或关闭的探针集;从而提高了真阳性与假阳性的比例。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/6bc2e6d3d248/1471-2105-7-49-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/f9cf99aed03b/1471-2105-7-49-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/0747466a1c64/1471-2105-7-49-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/b8744c7884b0/1471-2105-7-49-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/d32502476e1c/1471-2105-7-49-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/7a973a1361fc/1471-2105-7-49-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/1988acd5533f/1471-2105-7-49-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/057a2c2b0537/1471-2105-7-49-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/c552cef960d2/1471-2105-7-49-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/6bc2e6d3d248/1471-2105-7-49-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/f9cf99aed03b/1471-2105-7-49-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/0747466a1c64/1471-2105-7-49-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/b8744c7884b0/1471-2105-7-49-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/d32502476e1c/1471-2105-7-49-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/7a973a1361fc/1471-2105-7-49-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/1988acd5533f/1471-2105-7-49-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/057a2c2b0537/1471-2105-7-49-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/c552cef960d2/1471-2105-7-49-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7445/1409797/6bc2e6d3d248/1471-2105-7-49-9.jpg

相似文献

1
Effects of filtering by Present call on analysis of microarray experiments.当前通话过滤对微阵列实验分析的影响。
BMC Bioinformatics. 2006 Jan 31;7:49. doi: 10.1186/1471-2105-7-49.
2
Filtering for increased power for microarray data analysis.为提高微阵列数据分析的功效进行筛选。
BMC Bioinformatics. 2009 Jan 8;10:11. doi: 10.1186/1471-2105-10-11.
3
Statistical analysis of high-density oligonucleotide arrays: a multiplicative noise model.高密度寡核苷酸阵列的统计分析:一种乘性噪声模型。
Bioinformatics. 2002 Dec;18(12):1633-40. doi: 10.1093/bioinformatics/18.12.1633.
4
How to decide? Different methods of calculating gene expression from short oligonucleotide array data will give different results.如何做出决定?从短寡核苷酸阵列数据计算基因表达的不同方法会得出不同的结果。
BMC Bioinformatics. 2006 Mar 15;7:137. doi: 10.1186/1471-2105-7-137.
5
Filtering genes to improve sensitivity in oligonucleotide microarray data analysis.在寡核苷酸微阵列数据分析中筛选基因以提高灵敏度。
Nucleic Acids Res. 2007;35(16):e102. doi: 10.1093/nar/gkm537. Epub 2007 Aug 15.
6
An interactive power analysis tool for microarray hypothesis testing and generation.用于微阵列假设检验和生成的交互式功效分析工具。
Bioinformatics. 2006 Apr 1;22(7):808-14. doi: 10.1093/bioinformatics/btk052. Epub 2006 Jan 17.
7
Interactively optimizing signal-to-noise ratios in expression profiling: project-specific algorithm selection and detection p-value weighting in Affymetrix microarrays.在表达谱分析中交互式优化信噪比:Affymetrix微阵列中特定项目的算法选择和检测p值加权
Bioinformatics. 2004 Nov 1;20(16):2534-44. doi: 10.1093/bioinformatics/bth280. Epub 2004 Apr 29.
8
The utility of MAS5 expression summary and detection call algorithms.MAS5表达汇总及检测调用算法的效用。
BMC Bioinformatics. 2007 Jul 30;8:273. doi: 10.1186/1471-2105-8-273.
9
Utilization of two sample t-test statistics from redundant probe sets to evaluate different probe set algorithms in GeneChip studies.利用来自冗余探针集的双样本t检验统计量来评估基因芯片研究中的不同探针集算法。
BMC Bioinformatics. 2006 Jan 10;7:12. doi: 10.1186/1471-2105-7-12.
10
Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data.基于疾病谱数据中错误发现率的七种生成Affymetrix表达分数方法的比较。
BMC Bioinformatics. 2005 Feb 10;6:26. doi: 10.1186/1471-2105-6-26.

引用本文的文献

1
Genomic-Thermodynamic Phase Synchronization: Maxwell's Demon-like Regulation of Cell Fate Transition.基因组热力学相位同步:类麦克斯韦妖对细胞命运转变的调控
Int J Mol Sci. 2025 May 20;26(10):4911. doi: 10.3390/ijms26104911.
2
Clinical and pathological findings in neurolymphomatosis: Preliminary association with gene expression profiles in sural nerves.神经淋巴瘤病的临床和病理表现:与腓肠神经基因表达谱的初步关联
Front Oncol. 2022 Sep 26;12:974751. doi: 10.3389/fonc.2022.974751. eCollection 2022.
3
Relationship between insulin sensitivity and gene expression in human skeletal muscle.

本文引用的文献

1
Global effects of vitamin A deficiency on gene expression in rat liver: evidence for hypoandrogenism.维生素A缺乏对大鼠肝脏基因表达的整体影响:雄激素缺乏的证据。
J Nutr Biochem. 2006 May;17(5):345-55. doi: 10.1016/j.jnutbio.2005.08.006. Epub 2005 Sep 19.
2
Immediate gene expression changes after the first course of neoadjuvant chemotherapy in patients with primary breast cancer disease.原发性乳腺癌患者新辅助化疗第一疗程后即刻发生的基因表达变化。
Clin Cancer Res. 2004 Oct 1;10(19):6418-31. doi: 10.1158/1078-0432.CCR-04-1031.
3
Bioconductor: open software development for computational biology and bioinformatics.
胰岛素敏感性与人类骨骼肌基因表达的关系。
BMC Endocr Disord. 2021 Feb 27;21(1):32. doi: 10.1186/s12902-021-00687-9.
4
Assessing technical and biological variation in SWATH-MS-based proteomic analysis of chronic lymphocytic leukaemia cells.评估基于 SWATH-MS 的慢性淋巴细胞白血病细胞蛋白质组学分析中的技术和生物学变异性。
Sci Rep. 2021 Feb 3;11(1):2932. doi: 10.1038/s41598-021-82609-2.
5
Bayesian penalized cumulative logit model for high-dimensional data with an ordinal response.贝叶斯惩罚累积对数模型在有序响应的高维数据中的应用。
Stat Med. 2021 Mar 15;40(6):1453-1481. doi: 10.1002/sim.8851. Epub 2020 Dec 18.
6
Targeting fibroblast growth factor 23-responsive pathways uncovers controlling genes in kidney mineral metabolism.靶向成纤维细胞生长因子 23 反应途径揭示了肾脏矿物质代谢中的控制基因。
Kidney Int. 2021 Mar;99(3):598-608. doi: 10.1016/j.kint.2020.10.024. Epub 2020 Nov 4.
7
Cell-Fate Determination from Embryo to Cancer Development: Genomic Mechanism Elucidated.从胚胎到癌症发展的细胞命运决定:基因组机制阐明。
Int J Mol Sci. 2020 Jun 27;21(13):4581. doi: 10.3390/ijms21134581.
8
Large-Scale Differential Gene Expression Transcriptomic Analysis Identifies a Metabolic Signature Shared by All Cancer Cells.大规模差异基因表达转录组分析鉴定了所有癌细胞共有的代谢特征。
Biomolecules. 2020 Apr 30;10(5):701. doi: 10.3390/biom10050701.
9
Seasonal nitrogen remobilization and the role of auxin transport in poplar trees.季节性氮再转移及其在杨树中的生长素运输作用。
J Exp Bot. 2020 Jul 25;71(15):4512-4530. doi: 10.1093/jxb/eraa130.
10
Effects of chronic intermittent ethanol exposure and withdrawal on neuroblastoma cell transcriptome.慢性间歇性乙醇暴露和戒断对神经母细胞瘤细胞转录组的影响。
Alcohol. 2020 Jun;85:119-126. doi: 10.1016/j.alcohol.2019.12.004. Epub 2020 Jan 7.
生物导体:用于计算生物学和生物信息学的开源软件开发。
Genome Biol. 2004;5(10):R80. doi: 10.1186/gb-2004-5-10-r80. Epub 2004 Sep 15.
4
Transcriptional profiling reveals evidence for signaling and oligodendroglial abnormalities in the temporal cortex from patients with major depressive disorder.转录谱分析揭示了重度抑郁症患者颞叶皮质中信号传导和少突胶质细胞异常的证据。
Mol Psychiatry. 2005 Mar;10(3):309-22. doi: 10.1038/sj.mp.4001565.
5
Effects of cigarette smoke on the human airway epithelial cell transcriptome.香烟烟雾对人呼吸道上皮细胞转录组的影响。
Proc Natl Acad Sci U S A. 2004 Jul 6;101(27):10143-8. doi: 10.1073/pnas.0401422101. Epub 2004 Jun 21.
6
Distinct transcriptional programs activated by interleukin-10 with or without lipopolysaccharide in dendritic cells: induction of the B cell-activating chemokine, CXC chemokine ligand 13.白细胞介素-10在有或无脂多糖情况下激活树突状细胞中不同的转录程序:诱导B细胞活化趋化因子CXC趋化因子配体13
J Immunol. 2004 Jun 1;172(11):7031-42. doi: 10.4049/jimmunol.172.11.7031.
7
Interactively optimizing signal-to-noise ratios in expression profiling: project-specific algorithm selection and detection p-value weighting in Affymetrix microarrays.在表达谱分析中交互式优化信噪比:Affymetrix微阵列中特定项目的算法选择和检测p值加权
Bioinformatics. 2004 Nov 1;20(16):2534-44. doi: 10.1093/bioinformatics/bth280. Epub 2004 Apr 29.
8
Gene expression changes in peripheral blood-derived dendritic cells following exposure to a contact allergen.接触性变应原暴露后外周血来源树突状细胞中的基因表达变化
Toxicol Lett. 2004 May 2;150(3):301-16. doi: 10.1016/j.toxlet.2004.02.002.
9
Gene expression profiling during the transition to failure in TNF-alpha over-expressing mice demonstrates the development of autoimmune myocarditis.在肿瘤坏死因子-α过表达小鼠向衰竭转变过程中的基因表达谱分析显示了自身免疫性心肌炎的发展。
J Mol Cell Cardiol. 2004 Apr;36(4):515-30. doi: 10.1016/j.yjmcc.2004.01.008.
10
Transcriptional profiling of estrogen-regulated gene expression via estrogen receptor (ER) alpha or ERbeta in human osteosarcoma cells: distinct and common target genes for these receptors.通过雌激素受体(ER)α或ERβ对人骨肉瘤细胞中雌激素调节基因表达进行转录谱分析:这些受体的不同和共同靶基因
Endocrinology. 2004 Jul;145(7):3473-86. doi: 10.1210/en.2003-1682. Epub 2004 Mar 19.