使用约束最小二乘法对miRNA表达数据进行快速且稳健的插补

Fast and robust imputation for miRNA expression data using constrained least squares.

作者信息

Webber James W, Elias Kevin M

机构信息

Department of Oncology and Gynecology, Brigham and Women's Hospital, Boston, MA, USA.

出版信息

BMC Bioinformatics. 2022 Apr 22;23(1):145. doi: 10.1186/s12859-022-04656-4.

DOI:10.1186/s12859-022-04656-4

PMID:35459087

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9027475/

Abstract

BACKGROUND

High dimensional transcriptome profiling, whether through next generation sequencing techniques or high-throughput arrays, may result in scattered variables with missing data. Data imputation is a common strategy to maximize the inclusion of samples by using statistical techniques to fill in missing values. However, many data imputation methods are cumbersome and risk introduction of systematic bias.

RESULTS

We present a new data imputation method using constrained least squares and algorithms from the inverse problems literature and present applications for this technique in miRNA expression analysis. The proposed technique is shown to offer an imputation orders of magnitude faster, with greater than or equal accuracy when compared to similar methods from the literature.

CONCLUSIONS

This study offers a robust and efficient algorithm for data imputation, which can be used, e.g., to improve cancer prediction accuracy in the presence of missing data.

摘要

背景

高维转录组分析，无论是通过下一代测序技术还是高通量阵列，都可能产生带有缺失数据的分散变量。数据插补是一种常见策略，通过使用统计技术填充缺失值来最大化样本的纳入。然而，许多数据插补方法很繁琐，并且有引入系统偏差的风险。

结果

我们提出了一种使用约束最小二乘法和反问题文献中的算法的新数据插补方法，并展示了该技术在miRNA表达分析中的应用。与文献中的类似方法相比，所提出的技术显示出插补速度快几个数量级，且准确性相同或更高。

结论

本研究提供了一种用于数据插补的强大而有效的算法，例如可用于在存在缺失数据的情况下提高癌症预测准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cc10/9027475/1578e6787d1f/12859_2022_4656_Fig1_HTML.jpg

相似文献

Fast and robust imputation for miRNA expression data using constrained least squares.使用约束最小二乘法对miRNA表达数据进行快速且稳健的插补

BMC Bioinformatics. 2022 Apr 22;23(1):145. doi: 10.1186/s12859-022-04656-4.

Robust imputation method for missing values in microarray data.微阵列数据中缺失值的稳健插补方法。

BMC Bioinformatics. 2007 May 3;8 Suppl 2(Suppl 2):S6. doi: 10.1186/1471-2105-8-S2-S6.

Missing value estimation for DNA microarray gene expression data: local least squares imputation.DNA微阵列基因表达数据的缺失值估计：局部最小二乘插补法

Bioinformatics. 2005 Jan 15;21(2):187-98. doi: 10.1093/bioinformatics/bth499. Epub 2004 Aug 27.

Missing value imputation for microRNA expression data by using a GO-based similarity measure.基于基因本体（GO）相似性度量的微小RNA表达数据缺失值插补

BMC Bioinformatics. 2016 Jan 11;17 Suppl 1(Suppl 1):10. doi: 10.1186/s12859-015-0853-0.

A hybrid imputation approach for microarray missing value estimation.一种用于微阵列缺失值估计的混合插补方法。

BMC Genomics. 2015;16 Suppl 9(Suppl 9):S1. doi: 10.1186/1471-2164-16-S9-S1. Epub 2015 Aug 17.

Missing value imputation in DNA microarrays based on conjugate gradient method.基于共轭梯度法的 DNA 微阵列缺失值插补。

Comput Biol Med. 2012 Feb;42(2):222-7. doi: 10.1016/j.compbiomed.2011.11.011. Epub 2011 Dec 10.

Collateral missing value imputation: a new robust missing value estimation algorithm for microarray data.并行缺失值插补：一种用于微阵列数据的新型稳健缺失值估计算法。

Bioinformatics. 2005 May 15;21(10):2417-23. doi: 10.1093/bioinformatics/bti345. Epub 2005 Feb 24.

An Iterative Locally Auto-Weighted Least Squares Method for Microarray Missing Value Estimation.一种用于微阵列缺失值估计的迭代局部自加权最小二乘法

IEEE Trans Nanobioscience. 2017 Jan;16(1):21-33. doi: 10.1109/TNB.2016.2636243. Epub 2016 Dec 6.

A weighted local least squares imputation method for missing value estimation in microarray gene expression data.一种用于微阵列基因表达数据中缺失值估计的加权局部最小二乘插补方法。

Int J Data Min Bioinform. 2010;4(3):331-47. doi: 10.1504/ijdmb.2010.033524.

Iterated local least squares microarray missing value imputation.迭代局部最小二乘法微阵列缺失值插补

J Bioinform Comput Biol. 2006 Oct;4(5):935-57. doi: 10.1142/s0219720006002302.

引用本文的文献

Evaluating Genetic Regulators of MicroRNAs Using Machine Learning Models.使用机器学习模型评估微小RNA的基因调控因子

Int J Mol Sci. 2025 Jun 16;26(12):5757. doi: 10.3390/ijms26125757.

本文引用的文献

Identification of Circulating Serum miRNAs as Novel Biomarkers in Pancreatic Cancer Using a Penalized Algorithm.使用惩罚算法鉴定循环血清微小RNA作为胰腺癌的新型生物标志物

Int J Mol Sci. 2021 Jan 20;22(3):1007. doi: 10.3390/ijms22031007.

Highly Sensitive Circulating MicroRNA Panel for Accurate Detection of Hepatocellular Carcinoma in Patients With Liver Disease.用于准确检测肝病患者肝细胞癌的高灵敏度循环微小RNA检测板

Hepatol Commun. 2019 Dec 19;4(2):284-297. doi: 10.1002/hep4.1451. eCollection 2020 Feb.

SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data.SCRABBLE：基于批量 RNA-seq 数据约束的单细胞 RNA-seq 推断。

Genome Biol. 2019 May 6;20(1):88. doi: 10.1186/s13059-019-1681-8.

VIPER: variability-preserving imputation for accurate gene expression recovery in single-cell RNA sequencing studies.VIPER：单细胞 RNA 测序研究中用于准确基因表达恢复的保留变异性的插补。

Genome Biol. 2018 Nov 12;19(1):196. doi: 10.1186/s13059-018-1575-1.

Circulating miRNA panels for specific and early detection in bladder cancer.用于膀胱癌特异性和早期检测的循环 miRNA 面板。

Cancer Sci. 2019 Jan;110(1):408-419. doi: 10.1111/cas.13856. Epub 2018 Dec 12.

Comparison of Computational Methods for Imputing Single-Cell RNA-Sequencing Data.比较单细胞 RNA 测序数据插补的计算方法。

IEEE/ACM Trans Comput Biol Bioinform. 2020 Mar-Apr;17(2):376-389. doi: 10.1109/TCBB.2018.2848633. Epub 2018 Jun 19.

Recovering Gene Interactions from Single-Cell Data Using Data Diffusion.利用数据扩散从单细胞数据中恢复基因相互作用。

Cell. 2018 Jul 26;174(3):716-729.e27. doi: 10.1016/j.cell.2018.05.061. Epub 2018 Jun 28.

SAVER: gene expression recovery for single-cell RNA sequencing.SAVER：单细胞 RNA 测序的基因表达恢复。

Nat Methods. 2018 Jul;15(7):539-542. doi: 10.1038/s41592-018-0033-z. Epub 2018 Jun 25.

DrImpute: imputing dropout events in single cell RNA sequencing data.DrImpute：在单细胞 RNA 测序数据中推断缺失事件。

BMC Bioinformatics. 2018 Jun 8;19(1):220. doi: 10.1186/s12859-018-2226-y.

An accurate and robust imputation method scImpute for single-cell RNA-seq data.一种用于单细胞 RNA-seq 数据的准确稳健的插补方法 scImpute。

Nat Commun. 2018 Mar 8;9(1):997. doi: 10.1038/s41467-018-03405-7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用约束最小二乘法对miRNA表达数据进行快速且稳健的插补

Fast and robust imputation for miRNA expression data using constrained least squares.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献