• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过迭代特征表示计算预测物种特异性酵母 DNA 复制原点。

Computational prediction of species-specific yeast DNA replication origin via iterative feature representation.

机构信息

Department of Physiology, Ajou University School of Medicine, Republic of Korea.

出版信息

Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa304.

DOI:10.1093/bib/bbaa304
PMID:33232970
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8294535/
Abstract

Deoxyribonucleic acid replication is one of the most crucial tasks taking place in the cell, and it has to be precisely regulated. This process is initiated in the replication origins (ORIs), and thus it is essential to identify such sites for a deeper understanding of the cellular processes and functions related to the regulation of gene expression. Considering the important tasks performed by ORIs, several experimental and computational approaches have been developed in the prediction of such sites. However, existing computational predictors for ORIs have certain curbs, such as building only single-feature encoding models, limited systematic feature engineering efforts and failure to validate model robustness. Hence, we developed a novel species-specific yeast predictor called yORIpred that accurately identify ORIs in the yeast genomes. To develop yORIpred, we first constructed optimal 40 baseline models by exploring eight different sequence-based encodings and five different machine learning classifiers. Subsequently, the predicted probability of 40 models was considered as the novel feature vector and carried out iterative feature learning approach independently using five different classifiers. Our systematic analysis revealed that the feature representation learned by the support vector machine algorithm (yORIpred) could well discriminate the distribution characteristics between ORIs and non-ORIs when compared with the other four algorithms. Comprehensive benchmarking experiments showed that yORIpred achieved superior and stable performance when compared with the existing predictors on the same training datasets. Furthermore, independent evaluation showcased the best and accurate performance of yORIpred thus underscoring the significance of iterative feature representation. To facilitate the users in obtaining their desired results without undergoing any mathematical, statistical or computational hassles, we developed a web server for the yORIpred predictor, which is available at: http://thegleelab.org/yORIpred.

摘要

脱氧核糖核酸复制是细胞中进行的最重要的任务之一,必须进行精确的调控。这个过程从复制起点(ORIs)开始,因此,识别这些位点对于深入了解与基因表达调控相关的细胞过程和功能至关重要。考虑到 ORIs 执行的重要任务,已经开发了几种实验和计算方法来预测这些位点。然而,现有的 ORIs 计算预测器存在某些限制,例如仅构建单特征编码模型、系统特征工程工作有限以及未能验证模型稳健性。因此,我们开发了一种新的物种特异性酵母预测器,称为 yORIpred,可准确识别酵母基因组中的 ORIs。为了开发 yORIpred,我们首先通过探索八种不同的基于序列的编码和五种不同的机器学习分类器来构建最佳的 40 个基线模型。随后,将 40 个模型的预测概率作为新的特征向量,并使用五种不同的分类器独立进行迭代特征学习方法。我们的系统分析表明,与其他四种算法相比,支持向量机算法(yORIpred)学习的特征表示可以很好地区分 ORIs 和非 ORIs 之间的分布特征。综合基准测试实验表明,与同一训练数据集上的现有预测器相比,yORIpred 具有优越和稳定的性能。此外,独立评估展示了 yORIpred 的最佳和准确性能,从而强调了迭代特征表示的重要性。为了方便用户在不进行任何数学、统计或计算麻烦的情况下获得所需的结果,我们开发了一个 yORIpred 预测器的网络服务器,可在以下网址获得:http://thegleelab.org/yORIpred。

相似文献

1
Computational prediction of species-specific yeast DNA replication origin via iterative feature representation.通过迭代特征表示计算预测物种特异性酵母 DNA 复制原点。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa304.
2
Computational prediction and interpretation of cell-specific replication origin sites from multiple eukaryotes by exploiting stacking framework.利用堆积框架从多种真核生物中计算预测和解释细胞特异性复制起始位点。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa275.
3
A computational platform to identify origins of replication sites in eukaryotes.一种用于鉴定真核生物复制起始位点的计算平台。
Brief Bioinform. 2021 Mar 22;22(2):1940-1950. doi: 10.1093/bib/bbaa017.
4
Meta-i6mA: an interspecies predictor for identifying DNA N6-methyladenine sites of plant genomes by exploiting informative features in an integrative machine-learning framework.Meta-i6mA:利用集成机器学习框架中的信息特征,用于识别植物基因组中 DNA N6-甲基腺嘌呤位点的种间预测因子。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa202.
5
Meta-4mCpred: A Sequence-Based Meta-Predictor for Accurate DNA 4mC Site Prediction Using Effective Feature Representation.Meta-4mCpred:一种基于序列的元预测器,用于通过有效特征表示准确预测DNA 4mC位点。
Mol Ther Nucleic Acids. 2019 Jun 7;16:733-744. doi: 10.1016/j.omtn.2019.04.019. Epub 2019 Apr 30.
6
iRO-3wPseKNC: identify DNA replication origins by three-window-based PseKNC.iRO-3wPseKNC:通过三窗口 PseKNC 识别 DNA 复制起点。
Bioinformatics. 2018 Sep 15;34(18):3086-3093. doi: 10.1093/bioinformatics/bty312.
7
ORI-Explorer: a unified cell-specific tool for origin of replication sites prediction by feature fusion.ORI-Explorer:通过特征融合进行复制起始位点预测的统一细胞特异性工具。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad664.
8
mAHTPred: a sequence-based meta-predictor for improving the prediction of anti-hypertensive peptides using effective feature representation.mAHTPred:一种基于序列的元预测器,用于使用有效的特征表示来提高抗高血压肽的预测。
Bioinformatics. 2019 Aug 15;35(16):2757-2765. doi: 10.1093/bioinformatics/bty1047.
9
Recent advances in the genome-wide study of DNA replication origins in yeast.酵母中DNA复制起点全基因组研究的最新进展
Front Microbiol. 2015 Feb 19;6:117. doi: 10.3389/fmicb.2015.00117. eCollection 2015.
10
A deep learning framework combined with word embedding to identify DNA replication origins.深度学习框架结合词嵌入技术识别 DNA 复制起点
Sci Rep. 2021 Jan 12;11(1):844. doi: 10.1038/s41598-020-80670-x.

引用本文的文献

1
Advancing the accuracy of tyrosinase inhibitory peptides prediction via a multiview feature fusion strategy.通过多视图特征融合策略提高酪氨酸酶抑制肽预测的准确性。
Sci Rep. 2025 Feb 8;15(1):4762. doi: 10.1038/s41598-024-81807-y.
2
Meta-2OM: A multi-classifier meta-model for the accurate prediction of RNA 2'-O-methylation sites in human RNA.Meta-2OM:一种用于准确预测人类 RNA 2'-O-甲基化位点的多分类器元模型。
PLoS One. 2024 Jun 26;19(6):e0305406. doi: 10.1371/journal.pone.0305406. eCollection 2024.
3
Long extrachromosomal circular DNA identification by fusing sequence-derived features of physicochemical properties and nucleotide distribution patterns.

本文引用的文献

1
Ori-Finder 3: a web server for genome-wide prediction of replication origins in Saccharomyces cerevisiae.Ori-Finder 3:一个用于酿酒酵母全基因组复制起点预测的网络服务器。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa182.
2
Computational prediction and interpretation of cell-specific replication origin sites from multiple eukaryotes by exploiting stacking framework.利用堆积框架从多种真核生物中计算预测和解释细胞特异性复制起始位点。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa275.
3
Meta-i6mA: an interspecies predictor for identifying DNA N6-methyladenine sites of plant genomes by exploiting informative features in an integrative machine-learning framework.
通过融合物理化学性质和核苷酸分布模式的序列衍生特征来鉴定长链染色体外环状DNA
Sci Rep. 2024 Apr 24;14(1):9466. doi: 10.1038/s41598-024-57457-5.
4
H2Opred: a robust and efficient hybrid deep learning model for predicting 2'-O-methylation sites in human RNA.H2Opred:一种用于预测人 RNA 2'-O-甲基化位点的稳健高效的混合深度学习模型。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad476.
5
Accurately identifying hemagglutinin using sequence information and machine learning methods.使用序列信息和机器学习方法准确识别血凝素。
Front Med (Lausanne). 2023 Oct 31;10:1281880. doi: 10.3389/fmed.2023.1281880. eCollection 2023.
6
Empirical comparison and recent advances of computational prediction of hormone binding proteins using machine learning methods.使用机器学习方法对激素结合蛋白进行计算预测的实证比较与最新进展
Comput Struct Biotechnol J. 2023 Mar 17;21:2253-2261. doi: 10.1016/j.csbj.2023.03.024. eCollection 2023.
7
Integrating LASSO Feature Selection and Soft Voting Classifier to Identify Origins of Replication Sites.整合套索特征选择与软投票分类器以识别复制起点位点
Curr Genomics. 2022 Jun 10;23(2):83-93. doi: 10.2174/1389202923666220214122506.
8
Accurate Identification of DNA Replication Origin by Fusing Epigenomics and Chromatin Interaction Information.通过融合表观基因组学和染色质相互作用信息准确鉴定DNA复制起点
Research (Wash D C). 2022 Oct 29;2022:9780293. doi: 10.34133/2022/9780293. eCollection 2022.
9
Clarion is a multi-label problem transformation method for identifying mRNA subcellular localizations.Clarion 是一种多标签问题转换方法,用于识别 mRNA 亚细胞定位。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac467.
10
TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization.TACOS:一种用于准确预测细胞特异性长非编码 RNA 亚细胞定位的新方法。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac243.
Meta-i6mA:利用集成机器学习框架中的信息特征,用于识别植物基因组中 DNA N6-甲基腺嘌呤位点的种间预测因子。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa202.
4
DeepTorrent: a deep learning-based approach for predicting DNA N4-methylcytosine sites.DeepTorrent:一种基于深度学习的方法,用于预测 DNA N4-甲基胞嘧啶位点。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa124.
5
DeepVF: a deep learning-based hybrid framework for identifying virulence factors using the stacking strategy.DeepVF:一种基于深度学习的混合框架,使用堆叠策略识别毒力因子。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa125.
6
A Bioinformatics Tool for the Prediction of DNA N6-Methyladenine Modifications Based on Feature Fusion and Optimization Protocol.一种基于特征融合与优化协议的DNA N6-甲基腺嘌呤修饰预测的生物信息学工具。
Front Bioeng Biotechnol. 2020 Jun 4;8:502. doi: 10.3389/fbioe.2020.00502. eCollection 2020.
7
PASSION: an ensemble neural network approach for identifying the binding sites of RBPs on circRNAs.PASSION:一种用于识别 circRNAs 上 RBPs 结合位点的集成神经网络方法。
Bioinformatics. 2020 Aug 1;36(15):4276-4282. doi: 10.1093/bioinformatics/btaa522.
8
Computational prediction and interpretation of both general and specific types of promoters in Escherichia coli by exploiting a stacked ensemble-learning framework.利用堆叠集成学习框架对大肠杆菌中的一般和特定类型启动子进行计算预测和解释。
Brief Bioinform. 2021 Mar 22;22(2):2126-2140. doi: 10.1093/bib/bbaa049.
9
iDNA-MS: An Integrated Computational Tool for Detecting DNA Modification Sites in Multiple Genomes.iDNA-MS:一种用于检测多个基因组中DNA修饰位点的综合计算工具。
iScience. 2020 Apr 24;23(4):100991. doi: 10.1016/j.isci.2020.100991. Epub 2020 Mar 19.
10
HLPpred-Fuse: improved and robust prediction of hemolytic peptide and its activity by fusing multiple feature representation.HLPpred-Fuse:通过融合多种特征表示提高和增强溶血肽及其活性的预测
Bioinformatics. 2020 Jun 1;36(11):3350-3356. doi: 10.1093/bioinformatics/btaa160.