• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过一种新的集成方法提高基于微阵列的上位性数据缺失值估计的准确性。

Towards better accuracy for missing value estimation of epistatic miniarray profiling data by a novel ensemble approach.

机构信息

Department of Automation, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai, 200240, China.

出版信息

Genomics. 2011 May;97(5):257-64. doi: 10.1016/j.ygeno.2011.03.001. Epub 2011 Mar 21.

DOI:10.1016/j.ygeno.2011.03.001
PMID:21397683
Abstract

Epistatic miniarray profiling (E-MAP) is a powerful tool for analyzing gene functions and their biological relevance. However, E-MAP data suffers from large proportion of missing values, which often results in misleading and biased analysis results. It is urgent to develop effective missing value estimation methods for E-MAP. Although several independent algorithms can be applied to achieve this goal, their performance varies significantly on different datasets, indicating different algorithms having their own advantages and disadvantages. In this paper, we propose a novel ensemble approach EMDI based on the high-level diversity to impute missing values that consists of two global and four local base estimators. Experimental results on five E-MAP datasets show that EMDI outperforms all single base algorithms, demonstrating an appropriate combination providing complementarity among different methods. Comparison results between several fusion strategies also demonstrate that the proposed high-level diversity scheme is superior to others. EMDI is freely available at www.csbio.sjtu.edu.cn/bioinf/EMDI/.

摘要

上位性微阵列分析(E-MAP)是分析基因功能及其生物学相关性的有力工具。然而,E-MAP 数据存在大量缺失值,这常常导致误导和有偏差的分析结果。因此,迫切需要开发用于 E-MAP 的有效缺失值估计方法。尽管可以应用几种独立的算法来实现这一目标,但它们在不同的数据集上的性能差异很大,这表明不同的算法各有优缺点。在本文中,我们提出了一种基于高层多样性的新型集成方法 EMDI 来估计缺失值,它由两个全局和四个局部基估计器组成。在五个 E-MAP 数据集上的实验结果表明,EMDI 优于所有单基算法,这表明适当的组合在不同方法之间提供了互补性。几种融合策略的比较结果也表明,所提出的高层多样性方案优于其他方案。EMDI 可在 www.csbio.sjtu.edu.cn/bioinf/EMDI/ 上免费获得。

相似文献

1
Towards better accuracy for missing value estimation of epistatic miniarray profiling data by a novel ensemble approach.通过一种新的集成方法提高基于微阵列的上位性数据缺失值估计的准确性。
Genomics. 2011 May;97(5):257-64. doi: 10.1016/j.ygeno.2011.03.001. Epub 2011 Mar 21.
2
Sequential imputation for missing values.缺失值的顺序插补法。
Comput Biol Chem. 2007 Oct;31(5-6):320-7. doi: 10.1016/j.compbiolchem.2007.07.001. Epub 2007 Jul 10.
3
Autoregressive-model-based missing value estimation for DNA microarray time series data.基于自回归模型的DNA微阵列时间序列数据缺失值估计
IEEE Trans Inf Technol Biomed. 2009 Jan;13(1):131-7. doi: 10.1109/TITB.2008.2007421.
4
Collateral missing value imputation: a new robust missing value estimation algorithm for microarray data.并行缺失值插补:一种用于微阵列数据的新型稳健缺失值估计算法。
Bioinformatics. 2005 May 15;21(10):2417-23. doi: 10.1093/bioinformatics/bti345. Epub 2005 Feb 24.
5
Ameliorative missing value imputation for robust biological knowledge inference.用于稳健生物学知识推理的改进型缺失值插补
J Biomed Inform. 2008 Aug;41(4):499-514. doi: 10.1016/j.jbi.2007.10.005. Epub 2007 Dec 31.
6
Robust imputation method for missing values in microarray data.微阵列数据中缺失值的稳健插补方法。
BMC Bioinformatics. 2007 May 3;8 Suppl 2(Suppl 2):S6. doi: 10.1186/1471-2105-8-S2-S6.
7
DNA microarray data imputation and significance analysis of differential expression.DNA微阵列数据插补与差异表达的显著性分析
Bioinformatics. 2005 Nov 15;21(22):4155-61. doi: 10.1093/bioinformatics/bti638. Epub 2005 Aug 23.
8
An ensemble approach to microarray data-based gene prioritization after missing value imputation.一种在缺失值插补后基于微阵列数据进行基因优先级排序的集成方法。
Bioinformatics. 2007 Mar 15;23(6):747-54. doi: 10.1093/bioinformatics/btm010. Epub 2007 Jan 31.
9
Missing value imputation in DNA microarrays based on conjugate gradient method.基于共轭梯度法的 DNA 微阵列缺失值插补。
Comput Biol Med. 2012 Feb;42(2):222-7. doi: 10.1016/j.compbiomed.2011.11.011. Epub 2011 Dec 10.
10
Possibilistic approach for biclustering microarray data.用于双聚类微阵列数据的可能性方法。
Comput Biol Med. 2007 Oct;37(10):1426-36. doi: 10.1016/j.compbiomed.2007.01.005. Epub 2007 Mar 8.

引用本文的文献

1
A comprehensive survey on computational learning methods for analysis of gene expression data.关于用于基因表达数据分析的计算学习方法的全面综述。
Front Mol Biosci. 2022 Nov 7;9:907150. doi: 10.3389/fmolb.2022.907150. eCollection 2022.
2
SAWRPI: A Stacking Ensemble Framework With Adaptive Weight for Predicting ncRNA-Protein Interactions Using Sequence Information.SAWRPI:一种基于序列信息预测非编码RNA-蛋白质相互作用的具有自适应权重的堆叠集成框架。
Front Genet. 2022 Feb 28;13:839540. doi: 10.3389/fgene.2022.839540. eCollection 2022.
3
LncRBase V.2: an updated resource for multispecies lncRNAs and ClinicLSNP hosting genetic variants in lncRNAs for cancer patients.
LncRBase V.2:一个更新的多物种 lncRNA 资源,以及为癌症患者的 lncRNA 提供基因变异的 ClinicLSNP 宿主。
RNA Biol. 2021 Aug;18(8):1136-1151. doi: 10.1080/15476286.2020.1833529. Epub 2020 Oct 28.
4
BGFE: A Deep Learning Model for ncRNA-Protein Interaction Predictions Based on Improved Sequence Information.BGFE:一种基于改进序列信息的 ncRNA-蛋白质相互作用预测深度学习模型。
Int J Mol Sci. 2019 Feb 23;20(4):978. doi: 10.3390/ijms20040978.
5
Identification of the copy number variant biomarkers for breast cancer subtypes.鉴定乳腺癌亚型的拷贝数变异生物标志物。
Mol Genet Genomics. 2019 Feb;294(1):95-110. doi: 10.1007/s00438-018-1488-4. Epub 2018 Sep 10.
6
A Deep Learning Framework for Robust and Accurate Prediction of ncRNA-Protein Interactions Using Evolutionary Information.一种利用进化信息对非编码RNA-蛋白质相互作用进行稳健且准确预测的深度学习框架。
Mol Ther Nucleic Acids. 2018 Jun 1;11:337-344. doi: 10.1016/j.omtn.2018.03.001. Epub 2018 Mar 9.
7
An Algorithm for Creating Virtual Controls Using Integrated and Harmonized Longitudinal Data.一种使用整合与协调的纵向数据创建虚拟对照的算法。
Eval Health Prof. 2018 Jun;41(2):183-215. doi: 10.1177/0163278718772882. Epub 2018 May 3.
8
RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach.基于新型混合深度学习跨域知识整合方法的RNA-蛋白质结合基序挖掘
BMC Bioinformatics. 2017 Feb 28;18(1):136. doi: 10.1186/s12859-017-1561-8.
9
IPMiner: hidden ncRNA-protein interaction sequential pattern mining with stacked autoencoder for accurate computational prediction.IPMiner:基于堆叠自编码器的隐藏非编码RNA-蛋白质相互作用序列模式挖掘,用于准确的计算预测。
BMC Genomics. 2016 Aug 9;17:582. doi: 10.1186/s12864-016-2931-8.
10
A hybrid imputation approach for microarray missing value estimation.一种用于微阵列缺失值估计的混合插补方法。
BMC Genomics. 2015;16 Suppl 9(Suppl 9):S1. doi: 10.1186/1471-2164-16-S9-S1. Epub 2015 Aug 17.