Department of Computer Science and Technology, Jilin University, Changchun 130012, China; Department of Electrical Engineering & Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA.
Department of Computer Science and Technology, Jilin University, Changchun 130012, China.
Methods. 2019 Aug 15;166:22-30. doi: 10.1016/j.ymeth.2019.05.010. Epub 2019 May 21.
Detecting cancer-related genes and their interactions is a crucial task in cancer research. For this purpose, we proposed an efficient method, to detect coding genes, microRNAs (miRNAs), and their interactions related to a particular cancer or a cancer subtype using their expression data from the same set of samples. Firstly, biclusters specific to a particular type of cancer are detected based on rectified factor networks and ranked according to their associations with general cancers. Secondly, coding genes and miRNAs in each bicluster are prioritized by considering their differential expression and differential correlation values, protein-protein interaction data, and potential cancer markers. Finally, a rank fusion process is used to obtain the final comprehensive rank by combining multiple ranking results. We applied our proposed method on breast cancer datasets. Results show that our method outperforms other methods in detecting breast cancer-related coding genes and miRNAs. Furthermore, our method is very efficient in computing time, which can handle tens of thousands genes/miRNAs and hundreds of patients in hours on a desktop. This work may aid researchers in studying the genetic architecture of complex diseases, and improving the accuracy of diagnosis.
检测与癌症相关的基因及其相互作用是癌症研究中的一项关键任务。为此,我们提出了一种有效的方法,使用来自同一组样本的表达数据,检测与特定癌症或癌症亚型相关的编码基因、微小 RNA(miRNA)及其相互作用。首先,基于校正因子网络检测特定类型癌症特有的双聚类,并根据它们与一般癌症的关联进行排序。其次,通过考虑差异表达和差异相关值、蛋白质-蛋白质相互作用数据和潜在的癌症标志物,对每个双聚类中的编码基因和 miRNA 进行优先级排序。最后,通过结合多个排序结果,使用排名融合过程获得最终的综合排名。我们在乳腺癌数据集上应用了我们提出的方法。结果表明,我们的方法在检测乳腺癌相关编码基因和 miRNA 方面优于其他方法。此外,我们的方法在计算时间上非常高效,在台式机上可以在数小时内处理数万个基因/miRNA 和数百个患者。这项工作可能有助于研究人员研究复杂疾病的遗传结构,并提高诊断的准确性。