• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用两步特征选择技术鉴定酿酒酵母中的复制原点。

Identify origin of replication in Saccharomyces cerevisiae using two-step feature selection technique.

机构信息

Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.

Innovative Institute of Chinese Medicine and Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu, China.

出版信息

Bioinformatics. 2019 Jun 1;35(12):2075-2083. doi: 10.1093/bioinformatics/bty943.

DOI:10.1093/bioinformatics/bty943
PMID:30428009
Abstract

MOTIVATION

DNA replication is a key step to maintain the continuity of genetic information between parental generation and offspring. The initiation site of DNA replication, also called origin of replication (ORI), plays an extremely important role in the basic biochemical process. Thus, rapidly and effectively identifying the location of ORI in genome will provide key clues for genome analysis. Although biochemical experiments could provide detailed information for ORI, it requires high experimental cost and long experimental period. As good complements to experimental techniques, computational methods could overcome these disadvantages.

RESULTS

Thus, in this study, we developed a predictor called iORI-PseKNC2.0 to identify ORIs in the Saccharomyces cerevisiae genome based on sequence information. The PseKNC including 90 physicochemical properties was proposed to formulate ORI and non-ORI samples. In order to improve the accuracy, a two-step feature selection was proposed to exclude redundant and noise information. As a result, the overall success rate of 88.53% was achieved in the 5-fold cross-validation test by using support vector machine.

AVAILABILITY AND IMPLEMENTATION

Based on the proposed model, a user-friendly webserver was established and can be freely accessed at http://lin-group.cn/server/iORI-PseKNC2.0. The webserver will provide more convenience to most of wet-experimental scholars.

摘要

动机

DNA 复制是维持亲代和后代遗传信息连续性的关键步骤。DNA 复制的起始位点,也称为复制起点(ORI),在基本生化过程中起着极其重要的作用。因此,快速有效地确定基因组中 ORI 的位置将为基因组分析提供关键线索。虽然生化实验可以为 ORI 提供详细信息,但它需要高昂的实验成本和较长的实验周期。作为实验技术的良好补充,计算方法可以克服这些缺点。

结果

因此,在本研究中,我们开发了一种名为 iORI-PseKNC2.0 的预测器,用于基于序列信息识别酿酒酵母基因组中的 ORI。提出了包含 90 种物理化学性质的 PseKNC 来构建 ORI 和非 ORI 样本。为了提高准确性,我们提出了两步特征选择来排除冗余和噪声信息。结果,在 5 折交叉验证测试中,使用支持向量机获得了 88.53%的总体成功率。

可用性和实现

基于提出的模型,建立了一个用户友好的网络服务器,并可在 http://lin-group.cn/server/iORI-PseKNC2.0 上免费访问。该网络服务器将为大多数湿实验学者提供更多的便利。

相似文献

1
Identify origin of replication in Saccharomyces cerevisiae using two-step feature selection technique.使用两步特征选择技术鉴定酿酒酵母中的复制原点。
Bioinformatics. 2019 Jun 1;35(12):2075-2083. doi: 10.1093/bioinformatics/bty943.
2
A computational platform to identify origins of replication sites in eukaryotes.一种用于鉴定真核生物复制起始位点的计算平台。
Brief Bioinform. 2021 Mar 22;22(2):1940-1950. doi: 10.1093/bib/bbaa017.
3
iOri-Human: identify human origin of replication by incorporating dinucleotide physicochemical properties into pseudo nucleotide composition.iOri-Human:通过将二核苷酸物理化学性质纳入伪核苷酸组成来识别人类复制起点。
Oncotarget. 2016 Oct 25;7(43):69783-69793. doi: 10.18632/oncotarget.11975.
4
Ori-Finder 3: a web server for genome-wide prediction of replication origins in Saccharomyces cerevisiae.Ori-Finder 3:一个用于酿酒酵母全基因组复制起点预测的网络服务器。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa182.
5
iRSpot-Pse6NC: Identifying recombination spots in by incorporating hexamer composition into general PseKNC.iRSpot-Pse6NC:通过将六聚体组成纳入通用 PseKNC 来识别 中的重组热点。
Int J Biol Sci. 2018 May 22;14(8):883-891. doi: 10.7150/ijbs.24616. eCollection 2018.
6
ORI-Deep: improving the accuracy for predicting origin of replication sites by using a blend of features and long short-term memory network.ORI-Deep:通过混合使用特征和长短期记忆网络来提高复制起始位点预测的准确性。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbac001.
7
iTerm-PseKNC: a sequence-based tool for predicting bacterial transcriptional terminators.iTerm-PseKNC:一种基于序列的细菌转录终止子预测工具。
Bioinformatics. 2019 May 1;35(9):1469-1477. doi: 10.1093/bioinformatics/bty827.
8
iORI-ENST: identifying origin of replication sites based on elastic net and stacking learning.基于弹性网络和堆叠学习的复制起始位点识别。
SAR QSAR Environ Res. 2021 Apr;32(4):317-331. doi: 10.1080/1062936X.2021.1895884. Epub 2021 Mar 18.
9
iCarPS: a computational tool for identifying protein carbonylation sites by novel encoded features.iCarPS:一种通过新型编码特征识别蛋白质羰基化位点的计算工具。
Bioinformatics. 2021 Apr 19;37(2):171-177. doi: 10.1093/bioinformatics/btaa702.
10
Computational prediction and interpretation of cell-specific replication origin sites from multiple eukaryotes by exploiting stacking framework.利用堆积框架从多种真核生物中计算预测和解释细胞特异性复制起始位点。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa275.

引用本文的文献

1
BGATT-GR: accurate identification of glucocorticoid receptor antagonists based on data augmentation combined with BiGRU-attention.BGATT-GR:基于数据增强结合双向门控循环单元-注意力机制的糖皮质激素受体拮抗剂准确识别
Sci Rep. 2025 Jul 1;15(1):21402. doi: 10.1038/s41598-025-05839-8.
2
An efficient hybrid filter-wrapper method based on improved Harris Hawks optimization for feature selection.一种基于改进的哈里斯鹰优化算法的高效混合过滤-包装特征选择方法。
Bioimpacts. 2024 Oct 1;15:30340. doi: 10.34172/bi.30340. eCollection 2025.
3
MetaCGRP is a high-precision meta-model for large-scale identification of CGRP inhibitors using multi-view information.
MetaCGRP 是一种高精度的元模型,用于使用多视图信息大规模识别 CGRP 抑制剂。
Sci Rep. 2024 Oct 21;14(1):24764. doi: 10.1038/s41598-024-75487-x.
4
Identifying nucleotide-binding leucine-rich repeat receptor and pathogen effector pairing using transfer-learning and bilinear attention network.利用迁移学习和双线性注意网络识别核苷酸结合富含亮氨酸重复受体和病原体效应子的配对。
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae581.
5
Leveraging a meta-learning approach to advance the accuracy of Na blocking peptides prediction.利用元学习方法提高 Na 阻断肽预测的准确性。
Sci Rep. 2024 Feb 23;14(1):4463. doi: 10.1038/s41598-024-55160-z.
6
Deep-STP: a deep learning-based approach to predict snake toxin proteins by using word embeddings.深度序列到蛋白预测(Deep-STP):一种基于深度学习的方法,通过词嵌入来预测蛇毒蛋白。
Front Med (Lausanne). 2024 Jan 17;10:1291352. doi: 10.3389/fmed.2023.1291352. eCollection 2023.
7
Advancing the accuracy of SARS-CoV-2 phosphorylation site detection via meta-learning approach.通过元学习方法提高 SARS-CoV-2 磷酸化位点检测的准确性。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad433.
8
Computational prediction of allergenic proteins based on multi-feature fusion.基于多特征融合的变应原蛋白的计算预测
Front Genet. 2023 Oct 19;14:1294159. doi: 10.3389/fgene.2023.1294159. eCollection 2023.
9
TROLLOPE: A novel sequence-based stacked approach for the accelerated discovery of linear T-cell epitopes of hepatitis C virus.特罗洛普:一种基于新型序列的堆叠方法,用于加速发现丙型肝炎病毒的线性 T 细胞表位。
PLoS One. 2023 Aug 25;18(8):e0290538. doi: 10.1371/journal.pone.0290538. eCollection 2023.
10
Where and when to start: Regulating DNA replication origin activity in eukaryotic genomes.在何处以及何时开始:调控真核基因组中 DNA 复制起始点的活性。
Nucleus. 2023 Dec;14(1):2229642. doi: 10.1080/19491034.2023.2229642.