• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过异常值检测识别差异表达基因。

Identification of differentially expressed genes by means of outlier detection.

机构信息

Department of Computation Science and Artificial Intelligence, University of the Basque Country UPV/EHU, Donostia, Spain.

Department of Genetics, Microbiology and Statistics, University of Barcelona, Barcelona, Spain.

出版信息

BMC Bioinformatics. 2018 Sep 10;19(1):317. doi: 10.1186/s12859-018-2318-8.

DOI:10.1186/s12859-018-2318-8
PMID:30200879
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6131896/
Abstract

BACKGROUND

An important issue in microarray data is to select, from thousands of genes, a small number of informative differentially expressed (DE) genes which may be key elements for a disease. If each gene is analyzed individually, there is a big number of hypotheses to test and a multiple comparison correction method must be used. Consequently, the resulting cut-off value may be too small. Moreover, an important issue is the selection's replicability of the DE genes. We present a new method, called ORdensity, to obtain a reproducible selection of DE genes. It takes into account the relation between all genes and it is not a gene-by-gene approach, unlike the usually applied techniques to DE gene selection.

RESULTS

The proposed method returns three measures, related to the concepts of outlier and density of false positives in a neighbourhood, which allow us to identify the DE genes with high classification accuracy. To assess the performance of ORdensity, we used simulated microarray data and four real microarray cancer data sets. The results indicated that the method correctly detects the DE genes; it is competitive with other well accepted methods; the list of DE genes that it obtains is useful for the correct classification or diagnosis of new future samples and, in general, it is more stable than other procedures.

CONCLUSIONS

ORdensity is a new method for identifying DE genes that avoids some of the shortcomings of the individual gene identification and it is stable when the original sample is changed by subsamples.

摘要

背景

在微阵列数据中,一个重要的问题是从数千个基因中选择少数有信息的差异表达(DE)基因,这些基因可能是疾病的关键因素。如果逐个分析每个基因,那么需要测试的假设数量非常多,必须使用多重比较校正方法。因此,得出的截止值可能太小。此外,一个重要的问题是 DE 基因选择的可重复性。我们提出了一种新的方法,称为 ORdensity,以获得可重复的 DE 基因选择。它考虑了所有基因之间的关系,而不是像通常应用于 DE 基因选择的技术那样逐个基因进行分析。

结果

所提出的方法返回三个度量值,与邻域中伪阳性的异常值和密度的概念有关,这些度量值可用于识别具有高分类准确性的 DE 基因。为了评估 ORdensity 的性能,我们使用了模拟微阵列数据和四个真实的微阵列癌症数据集。结果表明,该方法可以正确检测 DE 基因;它与其他公认的方法具有竞争力;它获得的 DE 基因列表对于新未来样本的正确分类或诊断非常有用,并且通常比其他程序更稳定。

结论

ORdensity 是一种识别 DE 基因的新方法,它避免了个别基因识别的一些缺点,并且在原始样本通过子样本发生变化时也很稳定。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d13/6131896/dedfe1e3f773/12859_2018_2318_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d13/6131896/c3a218026119/12859_2018_2318_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d13/6131896/dc9d0cb78672/12859_2018_2318_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d13/6131896/06517498f18c/12859_2018_2318_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d13/6131896/7181760b26d1/12859_2018_2318_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d13/6131896/53f7d907b138/12859_2018_2318_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d13/6131896/033d61de7a39/12859_2018_2318_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d13/6131896/dedfe1e3f773/12859_2018_2318_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d13/6131896/c3a218026119/12859_2018_2318_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d13/6131896/dc9d0cb78672/12859_2018_2318_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d13/6131896/06517498f18c/12859_2018_2318_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d13/6131896/7181760b26d1/12859_2018_2318_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d13/6131896/53f7d907b138/12859_2018_2318_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d13/6131896/033d61de7a39/12859_2018_2318_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d13/6131896/dedfe1e3f773/12859_2018_2318_Fig7_HTML.jpg

相似文献

1
Identification of differentially expressed genes by means of outlier detection.通过异常值检测识别差异表达基因。
BMC Bioinformatics. 2018 Sep 10;19(1):317. doi: 10.1186/s12859-018-2318-8.
2
ORdensity: user-friendly R package to identify differentially expressed genes.ORdensity:一个用户友好的 R 包,用于识别差异表达基因。
BMC Bioinformatics. 2020 Apr 7;21(1):135. doi: 10.1186/s12859-020-3463-4.
3
Ranking analysis for identifying differentially expressed genes.差异表达基因识别的排名分析。
Genomics. 2011 May;97(5):326-9. doi: 10.1016/j.ygeno.2011.03.002. Epub 2011 Mar 22.
4
Hybrid genetic algorithm-neural network: feature extraction for unpreprocessed microarray data.混合遗传算法-神经网络:未预处理微阵列数据的特征提取。
Artif Intell Med. 2011 Sep;53(1):47-56. doi: 10.1016/j.artmed.2011.06.008. Epub 2011 Jul 19.
5
Confident difference criterion: a new Bayesian differentially expressed gene selection algorithm with applications.置信差异准则:一种新的贝叶斯差异表达基因选择算法及其应用
BMC Bioinformatics. 2015 Aug 7;16:245. doi: 10.1186/s12859-015-0664-3.
6
Arrow plot: a new graphical tool for selecting up and down regulated genes and genes differentially expressed on sample subgroups.箭头图:一种新的图形工具,用于选择上调和下调的基因以及在样本亚组中差异表达的基因。
BMC Bioinformatics. 2012 Jun 26;13:147. doi: 10.1186/1471-2105-13-147.
7
A Robust Approach for Identification of Cancer Biomarkers and Candidate Drugs.一种稳健的癌症生物标志物和候选药物鉴定方法。
Medicina (Kaunas). 2019 Jun 11;55(6):269. doi: 10.3390/medicina55060269.
8
A new outlier removal approach for cDNA microarray normalization.一种用于cDNA微阵列标准化的新离群值去除方法。
Biotechniques. 2009 Aug;47(2):691-2, 694-700. doi: 10.2144/000113195.
9
MOST: detecting cancer differential gene expression.MOST:检测癌症差异基因表达。
Biostatistics. 2008 Jul;9(3):411-8. doi: 10.1093/biostatistics/kxm042. Epub 2007 Nov 29.
10
Robust gene selection methods using weighting schemes for microarray data analysis.用于微阵列数据分析的采用加权方案的稳健基因选择方法。
BMC Bioinformatics. 2017 Sep 2;18(1):389. doi: 10.1186/s12859-017-1810-x.

引用本文的文献

1
Transcriptomic Profile of Early Antral Follicles: Predictive Somatic Gene Markers of Oocyte Maturation Outcome.早期窦卵泡的转录组图谱:卵母细胞成熟结果的预测性体细胞基因标志物
Cells. 2025 May 12;14(10):704. doi: 10.3390/cells14100704.
2
ScatLay: utilizing transcriptome-wide noise for identifying and visualizing differentially expressed genes.ScatLay:利用全转录组噪声来识别和可视化差异表达基因。
Sci Rep. 2020 Oct 15;10(1):17483. doi: 10.1038/s41598-020-74564-1.
3
Identifying the Salient Genes in Microarray Data: A Novel Game Theoretic Model for the Co-Expression Network.

本文引用的文献

1
A Flexible Microarray Data Simulation Model.一种灵活的微阵列数据模拟模型。
Microarrays (Basel). 2013 Apr 17;2(2):115-30. doi: 10.3390/microarrays2020115.
2
limma powers differential expression analyses for RNA-sequencing and microarray studies.limma为RNA测序和微阵列研究提供差异表达分析的动力。
Nucleic Acids Res. 2015 Apr 20;43(7):e47. doi: 10.1093/nar/gkv007. Epub 2015 Jan 20.
3
Empirical evaluation of consistency and accuracy of methods to detect differentially expressed genes based on microarray data.基于微阵列数据检测差异表达基因方法的一致性和准确性的实证评估。
识别微阵列数据中的显著基因:一种用于共表达网络的新型博弈论模型
Diagnostics (Basel). 2020 Aug 13;10(8):586. doi: 10.3390/diagnostics10080586.
4
ORdensity: user-friendly R package to identify differentially expressed genes.ORdensity:一个用户友好的 R 包,用于识别差异表达基因。
BMC Bioinformatics. 2020 Apr 7;21(1):135. doi: 10.1186/s12859-020-3463-4.
Comput Biol Med. 2014 Mar;46:1-10. doi: 10.1016/j.compbiomed.2013.12.002. Epub 2013 Dec 13.
4
Evaluating reproducibility of differential expression discoveries in microarray studies by considering correlated molecular changes.通过考虑相关分子变化评估微阵列研究中差异表达发现的可重复性。
Bioinformatics. 2009 Jul 1;25(13):1662-8. doi: 10.1093/bioinformatics/btp295. Epub 2009 May 5.
5
GO-2D: identifying 2-dimensional cellular-localized functional modules in Gene Ontology.GO-2D:在基因本体论中识别二维细胞定位功能模块。
BMC Genomics. 2007 Jan 24;8:30. doi: 10.1186/1471-2164-8-30.
6
Rat toxicogenomic study reveals analytical consistency across microarray platforms.大鼠毒理基因组学研究揭示了不同微阵列平台间的分析一致性。
Nat Biotechnol. 2006 Sep;24(9):1162-9. doi: 10.1038/nbt1238.
7
Microarray analysis and tumor classification.微阵列分析与肿瘤分类
N Engl J Med. 2006 Jun 8;354(23):2463-72. doi: 10.1056/NEJMra042342.
8
Linear models and empirical bayes methods for assessing differential expression in microarray experiments.用于评估微阵列实验中差异表达的线性模型和经验贝叶斯方法。
Stat Appl Genet Mol Biol. 2004;3:Article3. doi: 10.2202/1544-6115.1027. Epub 2004 Feb 12.
9
Gene selection and classification of microarray data using random forest.使用随机森林进行微阵列数据的基因选择与分类
BMC Bioinformatics. 2006 Jan 6;7:3. doi: 10.1186/1471-2105-7-3.
10
Microarray data analysis: from disarray to consolidation and consensus.微阵列数据分析:从混乱到整合与共识。
Nat Rev Genet. 2006 Jan;7(1):55-65. doi: 10.1038/nrg1749.