• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SaPt-CNN-LSTM-AR-EA:一种用于基于时间序列的多变量DNA序列预测的混合集成学习框架。

SaPt-CNN-LSTM-AR-EA: a hybrid ensemble learning framework for time series-based multivariate DNA sequence prediction.

作者信息

Yan Wu, Tan Li, Meng-Shan Li, Sheng Sheng, Jun Wang, Fu-An Wu

机构信息

School of Biotechnology, Jiangsu University of Science & Technology, Zhenjiang, China.

School of Mathematics and Computer Science, Gannan Normal University, Ganzhou, Jiangxi, China.

出版信息

PeerJ. 2023 Oct 4;11:e16192. doi: 10.7717/peerj.16192. eCollection 2023.

DOI:10.7717/peerj.16192
PMID:37810796
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10559882/
Abstract

Biological sequence data mining is hot spot in bioinformatics. A biological sequence can be regarded as a set of characters. Time series is similar to biological sequences in terms of both representation and mechanism. Therefore, in the article, biological sequences are represented with time series to obtain biological time sequence (BTS). Hybrid ensemble learning framework (SaPt-CNN-LSTM-AR-EA) for BTS is proposed. Single-sequence and multi-sequence models are respectively constructed with self-adaption pre-training one-dimensional convolutional recurrent neural network and autoregressive fractional integrated moving average fused evolutionary algorithm. In DNA sequence experiments with six viruses, SaPt-CNN-LSTM-AR-EA realized the good overall prediction performance and the prediction accuracy and correlation respectively reached 1.7073 and 0.9186. SaPt-CNN-LSTM-AR-EA was compared with other five benchmark models so as to verify its effectiveness and stability. SaPt-CNN-LSTM-AR-EA increased the average accuracy by about 30%. The framework proposed in this article is significant in biology, biomedicine, and computer science, and can be widely applied in sequence splicing, computational biology, bioinformation, and other fields.

摘要

生物序列数据挖掘是生物信息学中的热点。生物序列可被视为一组字符。时间序列在表示和机制方面与生物序列相似。因此,在本文中,用时间序列来表示生物序列以获得生物时间序列(BTS)。提出了用于BTS的混合集成学习框架(SaPt-CNN-LSTM-AR-EA)。分别用自适应预训练一维卷积递归神经网络和自回归分数整合移动平均融合进化算法构建单序列和多序列模型。在六种病毒的DNA序列实验中,SaPt-CNN-LSTM-AR-EA实现了良好的整体预测性能,预测准确率和相关性分别达到1.7073和0.9186。将SaPt-CNN-LSTM-AR-EA与其他五个基准模型进行比较,以验证其有效性和稳定性。SaPt-CNN-LSTM-AR-EA使平均准确率提高了约30%。本文提出的框架在生物学、生物医学和计算机科学中具有重要意义,可广泛应用于序列拼接、计算生物学、生物信息学等领域。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/9364026a3219/peerj-11-16192-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/021e79bf057f/peerj-11-16192-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/94c40823ecbd/peerj-11-16192-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/5b7d0bf0ca33/peerj-11-16192-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/7bd3fa7362ed/peerj-11-16192-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/c2c5b13337a6/peerj-11-16192-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/6699e51381fa/peerj-11-16192-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/8713bbb0549f/peerj-11-16192-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/8dfd403b9aa0/peerj-11-16192-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/4165856e5b52/peerj-11-16192-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/4f827b4b33d3/peerj-11-16192-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/ce160dc2190b/peerj-11-16192-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/9364026a3219/peerj-11-16192-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/021e79bf057f/peerj-11-16192-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/94c40823ecbd/peerj-11-16192-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/5b7d0bf0ca33/peerj-11-16192-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/7bd3fa7362ed/peerj-11-16192-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/c2c5b13337a6/peerj-11-16192-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/6699e51381fa/peerj-11-16192-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/8713bbb0549f/peerj-11-16192-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/8dfd403b9aa0/peerj-11-16192-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/4165856e5b52/peerj-11-16192-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/4f827b4b33d3/peerj-11-16192-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/ce160dc2190b/peerj-11-16192-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/01dc/10559882/9364026a3219/peerj-11-16192-g012.jpg

相似文献

1
SaPt-CNN-LSTM-AR-EA: a hybrid ensemble learning framework for time series-based multivariate DNA sequence prediction.SaPt-CNN-LSTM-AR-EA:一种用于基于时间序列的多变量DNA序列预测的混合集成学习框架。
PeerJ. 2023 Oct 4;11:e16192. doi: 10.7717/peerj.16192. eCollection 2023.
2
Time series-based hybrid ensemble learning model with multivariate multidimensional feature coding for DNA methylation prediction.基于时间序列的混合集成学习模型,具有多维多维特征编码,用于 DNA 甲基化预测。
BMC Genomics. 2023 Dec 11;24(1):758. doi: 10.1186/s12864-023-09866-5.
3
A transfer learning-based CNN and LSTM hybrid deep learning model to classify motor imagery EEG signals.一种基于迁移学习的卷积神经网络和长短期记忆网络混合深度学习模型,用于对运动想象脑电信号进行分类。
Comput Biol Med. 2022 Apr;143:105288. doi: 10.1016/j.compbiomed.2022.105288. Epub 2022 Feb 10.
4
EMDLP: Ensemble multiscale deep learning model for RNA methylation site prediction.EMDLP:用于 RNA 甲基化位点预测的集成多尺度深度学习模型。
BMC Bioinformatics. 2022 Jun 8;23(1):221. doi: 10.1186/s12859-022-04756-1.
5
Machine Learning Analysis of RNA-seq Data for Diagnostic and Prognostic Prediction of Colon Cancer.基于 RNA-seq 数据的机器学习分析用于结直肠癌的诊断和预后预测。
Sensors (Basel). 2023 Mar 13;23(6):3080. doi: 10.3390/s23063080.
6
DeepD2V: A Novel Deep Learning-Based Framework for Predicting Transcription Factor Binding Sites from Combined DNA Sequence.DeepD2V:一种基于深度学习的新型框架,用于从组合 DNA 序列预测转录因子结合位点。
Int J Mol Sci. 2021 May 24;22(11):5521. doi: 10.3390/ijms22115521.
7
PlncRNA-HDeep: plant long noncoding RNA prediction using hybrid deep learning based on two encoding styles.PlncRNA-HDeep:基于两种编码方式的混合深度学习进行植物长链非编码RNA预测
BMC Bioinformatics. 2021 May 12;22(Suppl 3):242. doi: 10.1186/s12859-020-03870-2.
8
Air quality index forecast in Beijing based on CNN-LSTM multi-model.基于 CNN-LSTM 多模型的北京市空气质量指数预报。
Chemosphere. 2022 Dec;308(Pt 1):136180. doi: 10.1016/j.chemosphere.2022.136180. Epub 2022 Sep 1.
9
Classification of benign and malignant subtypes of breast cancer histopathology imaging using hybrid CNN-LSTM based transfer learning.基于混合 CNN-LSTM 的迁移学习的乳腺癌组织病理学成像的良恶性亚型分类。
BMC Med Imaging. 2023 Jan 30;23(1):19. doi: 10.1186/s12880-023-00964-0.
10
Streamflow prediction using an integrated methodology based on convolutional neural network and long short-term memory networks.基于卷积神经网络和长短时记忆网络的集成方法进行流域径流预测。
Sci Rep. 2021 Sep 1;11(1):17497. doi: 10.1038/s41598-021-96751-4.

本文引用的文献

1
Enhanced Neural Network-Based Univariate Time-Series Forecasting Model for Big Data.基于增强神经网络的大数据单变量时间序列预测模型
Big Data. 2024 Apr;12(2):83-99. doi: 10.1089/big.2022.0155. Epub 2023 Feb 24.
2
Biological Activity Predictions of Ligands Based on Hybrid Molecular Fingerprinting and Ensemble Learning.基于混合分子指纹图谱和集成学习的配体生物活性预测
ACS Omega. 2023 Feb 1;8(6):5561-5570. doi: 10.1021/acsomega.2c06944. eCollection 2023 Feb 14.
3
Computed tomography-based radiomics machine learning models for prediction of histological invasiveness with sub-centimeter subsolid pulmonary nodules: a retrospective study.
基于计算机断层扫描的放射组学机器学习模型预测亚厘米磨玻璃肺结节的组织学侵袭性:一项回顾性研究。
PeerJ. 2023 Jan 10;11:e14559. doi: 10.7717/peerj.14559. eCollection 2023.
4
An Empirical Quantitative Structure-Activity Relationship Equation Assists the Discovery of High-Affinity Phosphodiesterase 4D Inhibitors as Leads to PET Radioligands.一种经验性定量构效关系方程有助于发现高亲和力磷酸二酯酶 4D 抑制剂作为 PET 放射性配体的先导化合物。
J Med Chem. 2023 Jan 26;66(2):1543-1561. doi: 10.1021/acs.jmedchem.2c01745. Epub 2023 Jan 6.
5
Quantitative Structure-Activity Relationship Studies on Alkane Chemistry Tuning Ice Nucleation.关于烷烃化学调控冰核形成的定量构效关系研究
J Phys Chem Lett. 2022 Dec 15;13(49):11564-11570. doi: 10.1021/acs.jpclett.2c03183. Epub 2022 Dec 7.
6
Machine learning based estimation of field-scale daily, high resolution, multi-depth soil moisture for the Western and Midwestern United States.基于机器学习的美国西部和中西部大尺度、高分辨率、多深度逐日土壤湿度估算。
PeerJ. 2022 Nov 4;10:e14275. doi: 10.7717/peerj.14275. eCollection 2022.
7
Quantum Chemical Roots of Machine-Learning Molecular Similarity Descriptors.机器学习分子相似性描述符的量子化学根源
J Chem Theory Comput. 2022 Nov 8;18(11):6670-6689. doi: 10.1021/acs.jctc.2c00718. Epub 2022 Oct 11.
8
Deep Neural Networks for Human's Fall-risk Prediction using Force-Plate Time Series Signal.基于测力板时间序列信号的深度神经网络用于人体跌倒风险预测
Expert Syst Appl. 2021 Nov 15;182. doi: 10.1016/j.eswa.2021.115220. Epub 2021 May 26.
9
MERRIN: MEtabolic regulation rule INference from time series data.梅林:从时间序列数据中推断代谢调节规则。
Bioinformatics. 2022 Sep 16;38(Suppl_2):ii127-ii133. doi: 10.1093/bioinformatics/btac479.
10
Mapping the dynamics of insulin-responsive pathways in the blood-brain barrier endothelium using time-series transcriptomics data.利用时间序列转录组学数据绘制血脑屏障内皮细胞中胰岛素反应途径的动态变化。
NPJ Syst Biol Appl. 2022 Aug 16;8(1):29. doi: 10.1038/s41540-022-00235-8.