Department of Radiology, The Methodist Hospital Research Institute, Weill Medical College of Cornell University, Houston, TX 77030, USA.
BMC Bioinformatics. 2012 Dec 27;13:337. doi: 10.1186/1471-2105-13-337.
RNA interference (RNAi) becomes an increasingly important and effective genetic tool to study the function of target genes by suppressing specific genes of interest. This system approach helps identify signaling pathways and cellular phase types by tracking intensity and/or morphological changes of cells. The traditional RNAi screening scheme, in which one siRNA is designed to knockdown one specific mRNA target, needs a large library of siRNAs and turns out to be time-consuming and expensive.
In this paper, we propose a conceptual model, called compressed sensing RNAi (csRNAi), which employs a unique combination of group of small interfering RNAs (siRNAs) to knockdown a much larger size of genes. This strategy is based on the fact that one gene can be partially bound with several small interfering RNAs (siRNAs) and conversely, one siRNA can bind to a few genes with distinct binding affinity. This model constructs a multi-to-multi correspondence between siRNAs and their targets, with siRNAs much fewer than mRNA targets, compared with the conventional scheme. Mathematically this problem involves an underdetermined system of equations (linear or nonlinear), which is ill-posed in general. However, the recently developed compressed sensing (CS) theory can solve this problem. We present a mathematical model to describe the csRNAi system based on both CS theory and biological concerns. To build this model, we first search nucleotide motifs in a target gene set. Then we propose a machine learning based method to find the effective siRNAs with novel features, such as image features and speech features to describe an siRNA sequence. Numerical simulations show that we can reduce the siRNA library to one third of that in the conventional scheme. In addition, the features to describe siRNAs outperform the existing ones substantially.
This csRNAi system is very promising in saving both time and cost for large-scale RNAi screening experiments which may benefit the biological research with respect to cellular processes and pathways.
RNA 干扰(RNAi)通过抑制特定的感兴趣基因成为研究靶基因功能的一种越来越重要和有效的遗传工具。通过跟踪细胞强度和/或形态变化,这种系统方法有助于识别信号通路和细胞相类型。传统的 RNAi 筛选方案,其中一个 siRNA 设计用于敲低一个特定的 mRNA 靶标,需要一个大型的 siRNA 文库,结果既耗时又昂贵。
在本文中,我们提出了一种称为压缩感知 RNAi(csRNAi)的概念模型,该模型采用了一组小干扰 RNA(siRNA)的独特组合来敲低更大尺寸的基因。该策略基于这样一个事实,即一个基因可以与几个小干扰 RNA(siRNA)部分结合,反之亦然,一个 siRNA 可以与几个具有不同结合亲和力的基因结合。该模型在 siRNA 与其靶标之间建立了一个多对多的对应关系,与传统方案相比,siRNA 比 mRNA 靶标少得多。从数学上讲,这个问题涉及到一个不定方程组(线性或非线性),一般来说是不适定的。然而,最近发展的压缩感知(CS)理论可以解决这个问题。我们提出了一个基于 CS 理论和生物学考虑的 csRNAi 系统的数学模型。为了建立这个模型,我们首先在靶基因集中搜索核苷酸基序。然后,我们提出了一种基于机器学习的方法,用新的特征来寻找有效的 siRNA,如图像特征和语音特征来描述 siRNA 序列。数值模拟表明,我们可以将 siRNA 文库减少到传统方案的三分之一。此外,描述 siRNA 的特征大大优于现有特征。
csRNAi 系统在节省大规模 RNAi 筛选实验的时间和成本方面非常有前景,这可能会使细胞过程和通路方面的生物学研究受益。