• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因表达的信息性或非信息性调用:一种潜在变量方法。

Informative or noninformative calls for gene expression: a latent variable approach.

作者信息

Kasim Adetayo, Lin Dan, Van Sanden Suzy, Clevert Djork-Arné, Bijnens Luc, Göhlmann Hinrich, Amaratunga Dhammika, Hochreiter Sepp, Shkedy Ziv, Talloen Willem

机构信息

Universiteit Hasselt & Katholieke Universiteit Leuven.

出版信息

Stat Appl Genet Mol Biol. 2010;9:Article 4. doi: 10.2202/1544-6115.1460. Epub 2010 Jan 6.

DOI:10.2202/1544-6115.1460
PMID:20196754
Abstract

The strength and weakness of microarray technology can be attributed to the enormous amount of information it is generating. To fully enhance the benefit of microarray technology for testing differentially expressed genes and classification, there is a need to minimize the amount of irrelevant genes present in microarray data. A major interest is to use probe-level data to call genes informative or noninformative based on the trade-off between the array-to-array variability and the measurement error. Existing works in this direction include filtering likely uninformative sets of hybridization (FLUSH; Calza et al., 2007) and I/NI calls for the exclusion of noninformative genes using FARMS (I/NI calls; Talloen et al., 2007; Hochreiter et al., 2006). In this paper, we propose a linear mixed model as a more flexible method that performs equally good as I/NI calls and outperforms FLUSH. We also introduce other criteria for gene filtering, such as, R2 and intra-cluster correlation. Additionally, we include some objective criteria based on likelihood ratio testing, the Akaike information criteria (AIC; Akaike, 1973) and the Bayesian information criterion (BIC; Schwarz, 1978 ). Based on the HGU-133A Spiked-in data set, it is shown that the linear mixed model approach outperforms FLUSH, a method that filters genes based on a quantile regression. The linear model is equivalent to a factor analysis model when either the factor loadings are set to a constant with the variance of the latent factor equal to one, or if the factor loadings are set to one together with unconstrained variance of the latent factor. Filtering based on conditional variance calls a probe set informative when the intensity of one or more probes is consistent across the arrays, while filtering using R2 or intra-cluster correlation calls a probe set informative only when average intensity of a probe set is consistent across the arrays. Filtering based on likelihood ratio test AIC and BIC are less stringent compared to the other criteria.

摘要

微阵列技术的优势与不足可归因于它所产生的海量信息。为了充分提高微阵列技术在检测差异表达基因和分类方面的效益,有必要尽量减少微阵列数据中无关基因的数量。一个主要的关注点是根据阵列间变异性与测量误差之间的权衡,利用探针水平的数据来判断基因是否具有信息性。这一方向上现有的工作包括过滤可能无信息的杂交集(FLUSH;卡尔扎等人,2007年)以及使用FARMS进行I/NI调用以排除无信息基因(I/NI调用;塔洛恩等人,2007年;霍赫雷特等人,2006年)。在本文中,我们提出一种线性混合模型,作为一种更灵活的方法,其性能与I/NI调用相当且优于FLUSH。我们还引入了其他基因过滤标准,如R2和簇内相关性。此外,我们纳入了一些基于似然比检验、赤池信息准则(AIC;赤池,1973年)和贝叶斯信息准则(BIC;施瓦茨,1978年)的客观标准。基于HGU - 133A加标数据集的结果表明,线性混合模型方法优于FLUSH,后者是一种基于分位数回归过滤基因的方法。当因子载荷设置为常数且潜在因子的方差等于1时,或者当因子载荷设置为1且潜在因子的方差不受约束时,线性模型等同于因子分析模型。基于条件方差进行过滤时,当一个或多个探针的强度在各阵列间一致时,就判定一个探针集具有信息性;而使用R2或簇内相关性进行过滤时,只有当一个探针集的平均强度在各阵列间一致时,才判定该探针集具有信息性。与其他标准相比,基于似然比检验、AIC和BIC的过滤要求没那么严格。

相似文献

1
Informative or noninformative calls for gene expression: a latent variable approach.基因表达的信息性或非信息性调用:一种潜在变量方法。
Stat Appl Genet Mol Biol. 2010;9:Article 4. doi: 10.2202/1544-6115.1460. Epub 2010 Jan 6.
2
I/NI-calls for the exclusion of non-informative genes: a highly effective filtering tool for microarray data.I/NI-要求排除无信息基因:一种用于微阵列数据的高效筛选工具。
Bioinformatics. 2007 Nov 1;23(21):2897-902. doi: 10.1093/bioinformatics/btm478. Epub 2007 Oct 5.
3
Gene filtering in the analysis of Illumina microarray experiments.Illumina微阵列实验分析中的基因筛选
Stat Appl Genet Mol Biol. 2012 Jan 6;11(2):/j/sagmb.2012.11.issue-2/1544-6115.1710/1544-6115.1710.xml. doi: 10.2202/1544-6115.1710.
4
A Laplace mixture model for identification of differential expression in microarray experiments.一种用于识别微阵列实验中差异表达的拉普拉斯混合模型。
Biostatistics. 2006 Oct;7(4):630-41. doi: 10.1093/biostatistics/kxj032. Epub 2006 Mar 24.
5
An empirical Bayesian method for estimating biological networks from temporal microarray data.一种从时间微阵列数据估计生物网络的经验贝叶斯方法。
Stat Appl Genet Mol Biol. 2010;9:Article 9. doi: 10.2202/1544-6115.1513. Epub 2010 Jan 15.
6
Variance component estimation for mixed model analysis of cDNA microarray data.用于cDNA微阵列数据混合模型分析的方差成分估计
Biom J. 2008 Dec;50(6):927-39. doi: 10.1002/bimj.200810476.
7
An alternative model of type A dependence in a gene set of correlated genes.相关基因集中A型依赖性的另一种模型。
Stat Appl Genet Mol Biol. 2010;9:Article 12. doi: 10.2202/1544-6115.1525. Epub 2010 Jan 26.
8
Shrinkage estimation of effect sizes as an alternative to hypothesis testing followed by estimation in high-dimensional biology: applications to differential gene expression.作为高维生物学中假设检验后进行估计的替代方法的效应量收缩估计:在差异基因表达中的应用
Stat Appl Genet Mol Biol. 2010;9:Article23. doi: 10.2202/1544-6115.1504. Epub 2010 Jun 8.
9
Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data.基于疾病谱数据中错误发现率的七种生成Affymetrix表达分数方法的比较。
BMC Bioinformatics. 2005 Feb 10;6:26. doi: 10.1186/1471-2105-6-26.
10
Leveraging two-way probe-level block design for identifying differential gene expression with high-density oligonucleotide arrays.利用双向探针水平块设计通过高密度寡核苷酸阵列鉴定差异基因表达。
BMC Bioinformatics. 2004 Apr 20;5:42. doi: 10.1186/1471-2105-5-42.

引用本文的文献

1
The Usage of Exon-Exon Splice Junctions for the Detection of Alternative Splicing using the REIDS model.使用外显子-外显子剪接接头检测使用 REIDS 模型的可变剪接。
Sci Rep. 2018 May 29;8(1):8331. doi: 10.1038/s41598-018-26695-9.
2
A random effects model for the identification of differential splicing (REIDS) using exon and HTA arrays.一种使用外显子和HTA阵列识别差异剪接的随机效应模型(REIDS)。
BMC Bioinformatics. 2017 May 25;18(1):273. doi: 10.1186/s12859-017-1687-8.
3
Identification of in vitro and in vivo disconnects using transcriptomic data.
利用转录组数据识别体外和体内的脱节现象。
BMC Genomics. 2015 Aug 18;16(1):615. doi: 10.1186/s12864-015-1726-7.
4
Joint analysis of transcriptional and post- transcriptional brain tumor data: searching for emergent properties of cellular systems.联合分析转录和转录后脑肿瘤数据:寻找细胞系统的新属性。
BMC Bioinformatics. 2011 Mar 30;12:86. doi: 10.1186/1471-2105-12-86.