Suppr超能文献

采用二维结构见解的自我基准测试方法解决所有共存偏差,从而增强 RNA-seq 分析。

Enhancing RNA-seq analysis by addressing all co-existing biases using a self-benchmarking approach with 2D structural insights.

机构信息

Faculty of Synthetic Biology, Shenzhen University of Advanced Technology, Shenzhen Key Laboratory of Quantitative Synthetic Biology, Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, 1068 Xueyuan Avenue, Nanshan District, Shenzhen, 518055, China.

State Key Laboratory of Chemical Oncogenomics, School of Chemical Biology and Biotechnology, Peking University Shenzhen Graduate School, 2199 Lishui Avenue, Nanshan District, Shenzhen, 518055, China.

出版信息

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae532.

Abstract

We introduce a groundbreaking approach: the minimum free energy-based Gaussian Self-Benchmarking (MFE-GSB) framework, designed to combat the myriad of biases inherent in RNA-seq data. Central to our methodology is the MFE concept, facilitating the adoption of a Gaussian distribution model tailored to effectively mitigate all co-existing biases within a k-mer counting scheme. The MFE-GSB framework operates on a sophisticated dual-model system, juxtaposing modeling data of uniform k-mer distribution against the real, observed sequencing data characterized by nonuniform k-mer distributions. The framework applies a Gaussian function, guided by the predetermined parameters-mean and SD-derived from modeling data, to fit unknown sequencing data. This dual comparison allows for the accurate prediction of k-mer abundances across MFE categories, enabling simultaneous correction of biases at the single k-mer level. Through validation with both engineered RNA constructs and human tissue RNA samples, its wide-ranging efficacy and applicability are demonstrated.

摘要

我们引入了一种开创性的方法

基于最小自由能的高斯自基准(MFE-GSB)框架,旨在克服 RNA-seq 数据中固有的多种偏差。我们方法的核心是 MFE 概念,它促进了采用高斯分布模型,该模型可针对有效减轻 k-mer 计数方案中所有共存偏差进行定制。MFE-GSB 框架基于复杂的双模型系统运行,将均匀 k-mer 分布的数据建模与以非均匀 k-分布特征的真实观察测序数据并列。该框架应用高斯函数,由来自建模数据的预定参数-平均值和标准差来指导,以拟合未知测序数据。这种双重比较允许在 MFE 类别中准确预测 k-mer 的丰度,从而能够在单个 k-mer 级别上同时校正偏差。通过对工程 RNA 构建体和人类组织 RNA 样本的验证,证明了其广泛的功效和适用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f757/11491153/7e7e58d95127/bbae532f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验