Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong.
BMC Bioinformatics. 2010 Apr 10;11:181. doi: 10.1186/1471-2105-11-181.
Gene silencing using exogenous small interfering RNAs (siRNAs) is now a widespread molecular tool for gene functional study and new-drug target identification. The key mechanism in this technique is to design efficient siRNAs that incorporated into the RNA-induced silencing complexes (RISC) to bind and interact with the mRNA targets to repress their translations to proteins. Although considerable progress has been made in the computational analysis of siRNA binding efficacy, few joint analysis of different RNAi experiments conducted under different experimental scenarios has been done in research so far, while the joint analysis is an important issue in cross-platform siRNA efficacy prediction. A collective analysis of RNAi mechanisms for different datasets and experimental conditions can often provide new clues on the design of potent siRNAs.
An elegant multi-task learning paradigm for cross-platform siRNA efficacy prediction is proposed. Experimental studies were performed on a large dataset of siRNA sequences which encompass several RNAi experiments recently conducted by different research groups. By using our multi-task learning method, the synergy among different experiments is exploited and an efficient multi-task predictor for siRNA efficacy prediction is obtained. The 19 most popular biological features for siRNA according to their jointly importance in multi-task learning were ranked. Furthermore, the hypothesis is validated out that the siRNA binding efficacy on different messenger RNAs(mRNAs) have different conditional distribution, thus the multi-task learning can be conducted by viewing tasks at an "mRNA"-level rather than at the "experiment"-level. Such distribution diversity derived from siRNAs bound to different mRNAs help indicate that the properties of target mRNA have important implications on the siRNA binding efficacy.
The knowledge gained from our study provides useful insights on how to analyze various cross-platform RNAi data for uncovering of their complex mechanism.
利用外源小干扰 RNA(siRNAs)进行基因沉默,现在是一种广泛应用的分子工具,用于基因功能研究和新药靶标鉴定。该技术的关键机制是设计有效的 siRNAs,使其整合到 RNA 诱导的沉默复合物(RISC)中,与 mRNA 靶标结合并相互作用,抑制其翻译为蛋白质。尽管在 siRNA 结合效力的计算分析方面已经取得了相当大的进展,但迄今为止,研究中很少对不同实验条件下进行的不同 RNAi 实验进行联合分析,而联合分析是跨平台 siRNA 效力预测中的一个重要问题。对不同数据集和实验条件下的 RNAi 机制进行集体分析,通常可以为设计有效的 siRNA 提供新的线索。
提出了一种用于跨平台 siRNA 效力预测的多任务学习方法。在一个涵盖了最近由不同研究小组进行的几个 RNAi 实验的 siRNA 序列的大型数据集上进行了实验研究。通过使用我们的多任务学习方法,利用不同实验之间的协同作用,获得了一个有效的多任务 siRNA 效力预测器。根据其在多任务学习中的共同重要性,对 19 种最受欢迎的 siRNA 生物学特征进行了排名。此外,验证了一个假设,即不同信使 RNA(mRNA)上的 siRNA 结合效力具有不同的条件分布,因此可以通过在“mRNA”水平而不是“实验”水平上进行多任务学习。这种来自与不同 mRNAs 结合的 siRNAs 的分布多样性有助于表明靶 mRNA 的特性对 siRNA 结合效力有重要影响。
我们的研究获得的知识为如何分析各种跨平台 RNAi 数据以揭示其复杂机制提供了有用的见解。