• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因表达数据的缺失值填补:从现有信息中恢复缺失数据的计算技术。

Missing value imputation for gene expression data: computational techniques to recover missing data from available information.

机构信息

School of Information and Communication Technology, Gold Coast Campus, Griffith University, QLD4222, Australia.

出版信息

Brief Bioinform. 2011 Sep;12(5):498-513. doi: 10.1093/bib/bbq080. Epub 2010 Dec 14.

DOI:10.1093/bib/bbq080
PMID:21156727
Abstract

Microarray gene expression data generally suffers from missing value problem due to a variety of experimental reasons. Since the missing data points can adversely affect downstream analysis, many algorithms have been proposed to impute missing values. In this survey, we provide a comprehensive review of existing missing value imputation algorithms, focusing on their underlying algorithmic techniques and how they utilize local or global information from within the data, or their use of domain knowledge during imputation. In addition, we describe how the imputation results can be validated and the different ways to assess the performance of different imputation algorithms, as well as a discussion on some possible future research directions. It is hoped that this review will give the readers a good understanding of the current development in this field and inspire them to come up with the next generation of imputation algorithms.

摘要

微阵列基因表达数据通常由于各种实验原因而存在缺失值问题。由于缺失数据点会对下游分析产生不利影响,因此已经提出了许多算法来估算缺失值。在本调查中,我们全面回顾了现有的缺失值估算算法,重点介绍了它们的基本算法技术以及它们如何利用数据内部的局部或全局信息,或者在估算过程中利用领域知识。此外,我们还描述了如何验证估算结果以及评估不同估算算法性能的不同方法,以及对一些可能的未来研究方向的讨论。希望本综述能使读者很好地了解该领域的当前发展,并激发他们提出下一代估算算法。

相似文献

1
Missing value imputation for gene expression data: computational techniques to recover missing data from available information.基因表达数据的缺失值填补:从现有信息中恢复缺失数据的计算技术。
Brief Bioinform. 2011 Sep;12(5):498-513. doi: 10.1093/bib/bbq080. Epub 2010 Dec 14.
2
Dealing with missing values in large-scale studies: microarray data imputation and beyond.处理大规模研究中的缺失值:微阵列数据插补及其他方法。
Brief Bioinform. 2010 Mar;11(2):253-64. doi: 10.1093/bib/bbp059. Epub 2009 Dec 4.
3
DNA microarray data imputation and significance analysis of differential expression.DNA微阵列数据插补与差异表达的显著性分析
Bioinformatics. 2005 Nov 15;21(22):4155-61. doi: 10.1093/bioinformatics/bti638. Epub 2005 Aug 23.
4
Collateral missing value imputation: a new robust missing value estimation algorithm for microarray data.并行缺失值插补:一种用于微阵列数据的新型稳健缺失值估计算法。
Bioinformatics. 2005 May 15;21(10):2417-23. doi: 10.1093/bioinformatics/bti345. Epub 2005 Feb 24.
5
Ameliorative missing value imputation for robust biological knowledge inference.用于稳健生物学知识推理的改进型缺失值插补
J Biomed Inform. 2008 Aug;41(4):499-514. doi: 10.1016/j.jbi.2007.10.005. Epub 2007 Dec 31.
6
Robust imputation method for missing values in microarray data.微阵列数据中缺失值的稳健插补方法。
BMC Bioinformatics. 2007 May 3;8 Suppl 2(Suppl 2):S6. doi: 10.1186/1471-2105-8-S2-S6.
7
A meta-data based method for DNA microarray imputation.一种基于元数据的DNA微阵列插补方法。
BMC Bioinformatics. 2007 Mar 29;8:109. doi: 10.1186/1471-2105-8-109.
8
Integrative analysis of transcriptomic and proteomic data of Shewanella oneidensis: missing value imputation using temporal datasets.嗜温栖热放线菌转录组学和蛋白质组学数据的综合分析:利用时间数据集进行缺失值插补
Mol Biosyst. 2011 Apr;7(4):1093-104. doi: 10.1039/c0mb00260g. Epub 2011 Jan 7.
9
An ensemble approach to microarray data-based gene prioritization after missing value imputation.一种在缺失值插补后基于微阵列数据进行基因优先级排序的集成方法。
Bioinformatics. 2007 Mar 15;23(6):747-54. doi: 10.1093/bioinformatics/btm010. Epub 2007 Jan 31.
10
Improving missing value imputation of microarray data by using spot quality weights.利用斑点质量权重改进微阵列数据的缺失值插补
BMC Bioinformatics. 2006 Jun 16;7:306. doi: 10.1186/1471-2105-7-306.

引用本文的文献

1
Application of Multi-Omics Techniques in Aquatic Ecotoxicology: A Review.多组学技术在水生生态毒理学中的应用:综述
Toxics. 2025 Jul 31;13(8):653. doi: 10.3390/toxics13080653.
2
Missing data imputation of climate time series: A review.气候时间序列的缺失数据插补:综述
MethodsX. 2025 Jun 19;15:103455. doi: 10.1016/j.mex.2025.103455. eCollection 2025 Dec.
3
Evaluating patient experience in maternity services using a Bayesian belief network model.使用贝叶斯信念网络模型评估产科服务中的患者体验。
PLoS One. 2025 Feb 20;20(2):e0318612. doi: 10.1371/journal.pone.0318612. eCollection 2025.
4
Functional effects of mutations in proteins can be predicted and interpreted by guided selection of sequence covariation information.通过对序列协变信息的有针对性选择,可以预测和解释蛋白质突变的功能影响。
Proc Natl Acad Sci U S A. 2024 Jun 25;121(26):e2312335121. doi: 10.1073/pnas.2312335121. Epub 2024 Jun 18.
5
Integrative approaches based on genomic techniques in the functional studies on enhancers.基于基因组技术的增强子功能研究的综合方法。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad442.
6
Combining data discretization and missing value imputation for incomplete medical datasets.对不完整的医学数据集进行数据离散化和缺失值插补的组合。
PLoS One. 2023 Nov 30;18(11):e0295032. doi: 10.1371/journal.pone.0295032. eCollection 2023.
7
Plasma-Derived Exosome Proteins as Novel Diagnostic and Prognostic Biomarkers in Neuroblastoma Patients.血浆衍生外泌体蛋白作为神经母细胞瘤患者新型诊断和预后生物标志物。
Cells. 2023 Oct 25;12(21):2516. doi: 10.3390/cells12212516.
8
Infer global, predict local: Quantity-relevance trade-off in protein fitness predictions from sequence data.从序列数据推断全局,预测局部:蛋白质适应性预测中的数量-相关性权衡。
PLoS Comput Biol. 2023 Oct 26;19(10):e1011521. doi: 10.1371/journal.pcbi.1011521. eCollection 2023 Oct.
9
Applications of multi-omics analysis in human diseases.多组学分析在人类疾病中的应用。
MedComm (2020). 2023 Jul 31;4(4):e315. doi: 10.1002/mco2.315. eCollection 2023 Aug.
10
Generating in silico CODEX from a small set of immunofluorescence markers.从一小组免疫荧光标记物生成计算机模拟的CODEX。
PNAS Nexus. 2023 May 19;2(6):pgad171. doi: 10.1093/pnasnexus/pgad171. eCollection 2023 Jun.