RNAseqCovarImpute：一种优于完全案例和单值插补差异表达分析的多重插补程序。

RNAseqCovarImpute: a multiple imputation procedure that outperforms complete case and single imputation differential expression analysis.

机构信息

Department of Environmental and Occupational Health Sciences, University of Washington, Seattle, WA, USA.

Center for Child Health, Behavior, and Development, Seattle Children's Research Institute, Seattle, WA, USA.

出版信息

Genome Biol. 2024 Sep 3;25(1):236. doi: 10.1186/s13059-024-03376-7.

DOI:10.1186/s13059-024-03376-7

PMID:39227979

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11370143/

Abstract

Missing covariate data is a common problem that has not been addressed in observational studies of gene expression. Here, we present a multiple imputation method that accommodates high dimensional gene expression data by incorporating principal component analysis of the transcriptome into the multiple imputation prediction models to avoid bias. Simulation studies using three datasets show that this method outperforms complete case and single imputation analyses at uncovering true positive differentially expressed genes, limiting false discovery rates, and minimizing bias. This method is easily implemented via an R Bioconductor package, RNAseqCovarImpute that integrates with the limma-voom pipeline for differential expression analysis.

摘要

缺失协变量数据是观察性基因表达研究中尚未解决的一个常见问题。在这里，我们提出了一种多重插补方法，通过将转录组的主成分分析纳入多重插补预测模型，来避免偏差，从而适应高维基因表达数据。使用三个数据集的模拟研究表明，这种方法在发现真正的差异表达基因、限制假发现率和最小化偏差方面优于完全案例分析和单插补分析。这种方法可以通过一个 R Bioconductor 包 RNAseqCovarImpute 轻松实现，该包与 limma-voom 管道集成，用于差异表达分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d388/11370143/7afbb224674e/13059_2024_3376_Fig1_HTML.jpg

相似文献

RNAseqCovarImpute: a multiple imputation procedure that outperforms complete case and single imputation differential expression analysis.RNAseqCovarImpute：一种优于完全案例和单值插补差异表达分析的多重插补程序。

Genome Biol. 2024 Sep 3;25(1):236. doi: 10.1186/s13059-024-03376-7.

Regulatory network-based imputation of dropouts in single-cell RNA sequencing data.基于调控网络的单细胞 RNA 测序数据缺失值插补。

PLoS Comput Biol. 2022 Feb 17;18(2):e1009849. doi: 10.1371/journal.pcbi.1009849. eCollection 2022 Feb.

Accurate and interpretable gene expression imputation on scRNA-seq data using IGSimpute.使用 IGSimpute 实现 scRNA-seq 数据的准确和可解释的基因表达推断。

Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad124.

A robust (re-)annotation approach to generate unbiased mapping references for RNA-seq-based analyses of differential expression across closely related species.一种强大的（重新）注释方法，用于为基于RNA测序的密切相关物种间差异表达分析生成无偏映射参考。

BMC Genomics. 2016 May 24;17:392. doi: 10.1186/s12864-016-2646-x.

SigEMD: A powerful method for differential gene expression analysis in single-cell RNA sequencing data.SigEMD：一种用于单细胞 RNA 测序数据分析中差异基因表达分析的强大方法。

Methods. 2018 Aug 1;145:25-32. doi: 10.1016/j.ymeth.2018.04.017. Epub 2018 Apr 24.

CDSImpute: An ensemble similarity imputation method for single-cell RNA sequence dropouts.CDSImpute：一种用于单细胞 RNA 序列缺失的集成相似性插补方法。

Comput Biol Med. 2022 Jul;146:105658. doi: 10.1016/j.compbiomed.2022.105658. Epub 2022 May 21.

Improvements Achieved by Multiple Imputation for Single-Cell RNA-Seq Data in Clustering Analysis and Differential Expression Analysis.单细胞 RNA-Seq 数据在聚类分析和差异表达分析中通过多重插补实现的改进。

J Comput Biol. 2022 Jul;29(7):634-649. doi: 10.1089/cmb.2021.0597. Epub 2022 May 16.

Collaborative Structure-Preserved Missing Data Imputation for Single-Cell RNA-Seq Clustering.单细胞 RNA-Seq 聚类的协作结构保留缺失数据插补。

IEEE/ACM Trans Comput Biol Bioinform. 2024 Sep-Oct;21(5):1480-1491. doi: 10.1109/TCBB.2024.3404013. Epub 2024 Oct 9.

SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis.SPARTA：用于基于参考的细菌RNA测序转录组自动分析的简单程序。

BMC Bioinformatics. 2016 Feb 4;17:66. doi: 10.1186/s12859-016-0923-y.

DaMiRseq-an R/Bioconductor package for data mining of RNA-Seq data: normalization, feature selection and classification.DaMiRseq-一个用于 RNA-Seq 数据挖掘的 R/Bioconductor 包：归一化、特征选择和分类。

Bioinformatics. 2018 Apr 15;34(8):1416-1418. doi: 10.1093/bioinformatics/btx795.

本文引用的文献

TRIP6 a potential diagnostic marker for colorectal cancer with glycolysis and immune infiltration association.TRIP6 是结直肠癌的一个潜在诊断标志物，与糖酵解和免疫浸润有关。

Sci Rep. 2024 Feb 19;14(1):4042. doi: 10.1038/s41598-024-54670-0.

Porphyromonas gingivalis promotes malignancy and chemo-resistance via GSK3β-mediated mitochondrial oxidative phosphorylation in human esophageal squamous cell carcinoma.牙龈卟啉单胞菌通过GSK3β介导的线粒体氧化磷酸化促进人食管鳞状细胞癌的恶性进展和化疗耐药。

Transl Oncol. 2023 Jun;32:101656. doi: 10.1016/j.tranon.2023.101656. Epub 2023 Mar 27.

Placental transcriptomic signatures of prenatal exposure to Hydroxy-Polycyclic aromatic hydrocarbons.产前暴露于羟多环芳烃的胎盘转录组特征。

Environ Int. 2023 Feb;172:107763. doi: 10.1016/j.envint.2023.107763. Epub 2023 Jan 18.

Metal mixtures modeling identifies birth weight-associated gene networks in the placentas of children born extremely preterm.金属混合物建模确定了极早产儿胎盘与出生体重相关的基因网络。

Chemosphere. 2023 Feb;313:137469. doi: 10.1016/j.chemosphere.2022.137469. Epub 2022 Dec 6.

Maternal age at birth and child attention-deficit hyperactivity disorder: causal association or familial confounding?产妇生育年龄与儿童注意缺陷多动障碍：因果关联还是家族性混杂？

J Child Psychol Psychiatry. 2023 Feb;64(2):299-310. doi: 10.1111/jcpp.13726. Epub 2022 Nov 28.

UniProt: the Universal Protein Knowledgebase in 2023.UniProt：2023 年的通用蛋白质知识库。

Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.

Machine learning with in silico analysis markedly improves survival prediction modeling in colon cancer patients.基于计算机的分析的机器学习显著改善了结肠癌患者的生存预测模型。

Cancer Med. 2023 Mar;12(6):7603-7615. doi: 10.1002/cam4.5420. Epub 2022 Nov 7.

Cohort profile: the ECHO prenatal and early childhood pathways to health consortium (ECHO-PATHWAYS).队列简介：ECHO 产前和儿童早期健康途径联盟 (ECHO-PATHWAYS)。

BMJ Open. 2022 Oct 21;12(10):e064288. doi: 10.1136/bmjopen-2022-064288.

Placental transcriptomic signatures of spontaneous preterm birth.自发性早产的胎盘转录组特征。

Am J Obstet Gynecol. 2023 Jan;228(1):73.e1-73.e18. doi: 10.1016/j.ajog.2022.07.015. Epub 2022 Jul 19.

Prenatal exposure to particulate matter and placental gene expression.产前暴露于颗粒物与胎盘基因表达。

Environ Int. 2022 Jul;165:107310. doi: 10.1016/j.envint.2022.107310. Epub 2022 May 25.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

RNAseqCovarImpute：一种优于完全案例和单值插补差异表达分析的多重插补程序。

RNAseqCovarImpute: a multiple imputation procedure that outperforms complete case and single imputation differential expression analysis.

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献