基于深度学习的方法,通过结合卷积神经网络和极限学习机来预测 LncRNA-疾病关联。
A learning-based method to predict LncRNA-disease associations by combining CNN and ELM.
机构信息
School of Electronics and Information Engineering, Tongji University, No. 4800 Cao'an Road, Shanghai, 201804, China.
College of Computer Science and Engineering, Shenzhen University, Shenzhen, 518060, China.
出版信息
BMC Bioinformatics. 2022 Mar 22;22(Suppl 5):622. doi: 10.1186/s12859-022-04611-3.
BACKGROUND
lncRNAs play a critical role in numerous biological processes and life activities, especially diseases. Considering that traditional wet experiments for identifying uncovered lncRNA-disease associations is limited in terms of time consumption and labor cost. It is imperative to construct reliable and efficient computational models as addition for practice. Deep learning technologies have been proved to make impressive contributions in many areas, but the feasibility of it in bioinformatics has not been adequately verified.
RESULTS
In this paper, a machine learning-based model called LDACE was proposed to predict potential lncRNA-disease associations by combining Extreme Learning Machine (ELM) and Convolutional Neural Network (CNN). Specifically, the representation vectors are constructed by integrating multiple types of biology information including functional similarity and semantic similarity. Then, CNN is applied to mine both local and global features. Finally, ELM is chosen to carry out the prediction task to detect the potential lncRNA-disease associations. The proposed method achieved remarkable Area Under Receiver Operating Characteristic Curve of 0.9086 in Leave-one-out cross-validation and 0.8994 in fivefold cross-validation, respectively. In addition, 2 kinds of case studies based on lung cancer and endometrial cancer indicate the robustness and efficiency of LDACE even in a real environment.
CONCLUSIONS
Substantial results demonstrated that the proposed model is expected to be an auxiliary tool to guide and assist biomedical research, and the close integration of deep learning and biology big data will provide life sciences with novel insights.
背景
lncRNAs 在许多生物过程和生命活动中发挥着关键作用,尤其是在疾病方面。鉴于传统的湿实验在时间消耗和劳动力成本方面都受到限制,因此构建可靠且高效的计算模型作为补充是当务之急。深度学习技术已被证明在许多领域做出了令人印象深刻的贡献,但它在生物信息学中的可行性尚未得到充分验证。
结果
在本文中,我们提出了一种基于机器学习的模型 LDACE,通过结合极端学习机(ELM)和卷积神经网络(CNN)来预测潜在的 lncRNA-疾病关联。具体来说,通过整合多种生物学信息(包括功能相似性和语义相似性)来构建表示向量。然后,应用 CNN 来挖掘局部和全局特征。最后,选择 ELM 来执行预测任务,以检测潜在的 lncRNA-疾病关联。在留一交叉验证和五重交叉验证中,该方法的 Receiver Operating Characteristic 曲线下面积分别达到了 0.9086 和 0.8994,取得了显著的效果。此外,基于肺癌和子宫内膜癌的 2 种案例研究表明,即使在真实环境中,LDACE 也具有稳健性和高效性。
结论
大量结果表明,所提出的模型有望成为指导和辅助生物医学研究的辅助工具,深度学习与生物大数据的紧密结合将为生命科学提供新的视角。
相似文献
BMC Bioinformatics. 2022-3-22
IEEE/ACM Trans Comput Biol Bioinform. 2022
Int J Mol Sci. 2020-2-22
BMC Bioinformatics. 2021-6-16
引用本文的文献
本文引用的文献
Brief Bioinform. 2022-1-17
IEEE Trans Cybern. 2023-1
Brief Bioinform. 2021-9-2
Brief Bioinform. 2021-7-20
IEEE/ACM Trans Comput Biol Bioinform. 2021
IEEE/ACM Trans Comput Biol Bioinform. 2021
IEEE/ACM Trans Comput Biol Bioinform. 2022
IEEE Trans Cybern. 2020-5
IEEE/ACM Trans Comput Biol Bioinform. 2020