• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iFeatureOmega:一个综合性平台,用于对分子序列、结构和配体数据集的特征进行工程设计、可视化和分析。

iFeatureOmega: an integrative platform for engineering, visualization and analysis of features from molecular sequences, structural and ligand data sets.

机构信息

Collaborative Innovation Center of Henan Grain Crops, Henan Agricultural University, Zhengzhou 450046, China.

Center for Crop Genome Engineering, Henan Agricultural University, Zhengzhou 450046, China.

出版信息

Nucleic Acids Res. 2022 Jul 5;50(W1):W434-W447. doi: 10.1093/nar/gkac351.

DOI:10.1093/nar/gkac351
PMID:35524557
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9252729/
Abstract

The rapid accumulation of molecular data motivates development of innovative approaches to computationally characterize sequences, structures and functions of biological and chemical molecules in an efficient, accessible and accurate manner. Notwithstanding several computational tools that characterize protein or nucleic acids data, there are no one-stop computational toolkits that comprehensively characterize a wide range of biomolecules. We address this vital need by developing a holistic platform that generates features from sequence and structural data for a diverse collection of molecule types. Our freely available and easy-to-use iFeatureOmega platform generates, analyzes and visualizes 189 representations for biological sequences, structures and ligands. To the best of our knowledge, iFeatureOmega provides the largest scope when directly compared to the current solutions, in terms of the number of feature extraction and analysis approaches and coverage of different molecules. We release three versions of iFeatureOmega including a webserver, command line interface and graphical interface to satisfy needs of experienced bioinformaticians and less computer-savvy biologists and biochemists. With the assistance of iFeatureOmega, users can encode their molecular data into representations that facilitate construction of predictive models and analytical studies. We highlight benefits of iFeatureOmega based on three research applications, demonstrating how it can be used to accelerate and streamline research in bioinformatics, computational biology, and cheminformatics areas. The iFeatureOmega webserver is freely available at http://ifeatureomega.erc.monash.edu and the standalone versions can be downloaded from https://github.com/Superzchen/iFeatureOmega-GUI/ and https://github.com/Superzchen/iFeatureOmega-CLI/.

摘要

分子数据的快速积累促使人们开发创新方法,以高效、可及和准确的方式计算生物和化学分子的序列、结构和功能。尽管有一些用于描述蛋白质或核酸数据的计算工具,但没有一个综合性的计算工具包可以全面描述广泛的生物分子。我们通过开发一个全面的平台来解决这一关键需求,该平台从序列和结构数据中生成各种分子类型的特征。我们的免费且易于使用的 iFeatureOmega 平台为多种分子类型生成生物序列、结构和配体的 189 种表示形式。据我们所知,在直接比较当前解决方案时,iFeatureOmega 在特征提取和分析方法的数量以及不同分子的覆盖范围方面提供了最大的范围。我们发布了三个版本的 iFeatureOmega,包括一个网络服务器、命令行接口和图形界面,以满足经验丰富的生物信息学家和不太精通计算机的生物学家和生物化学家的需求。有了 iFeatureOmega 的帮助,用户可以将他们的分子数据编码为表示形式,从而方便构建预测模型和分析研究。我们基于三个研究应用突出了 iFeatureOmega 的优势,展示了如何将其用于加速和简化生物信息学、计算生物学和化学信息学领域的研究。iFeatureOmega 网络服务器可免费在 http://ifeatureomega.erc.monash.edu 上使用,独立版本可从 https://github.com/Superzchen/iFeatureOmega-GUI/ 和 https://github.com/Superzchen/iFeatureOmega-CLI/ 下载。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c4b/9252729/3a3c755b9b05/gkac351fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c4b/9252729/6a3a1a38af8b/gkac351figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c4b/9252729/fc9e149a108d/gkac351fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c4b/9252729/24da8153f472/gkac351fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c4b/9252729/32c69de5907e/gkac351fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c4b/9252729/bf06fa8696ca/gkac351fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c4b/9252729/3a3c755b9b05/gkac351fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c4b/9252729/6a3a1a38af8b/gkac351figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c4b/9252729/fc9e149a108d/gkac351fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c4b/9252729/24da8153f472/gkac351fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c4b/9252729/32c69de5907e/gkac351fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c4b/9252729/bf06fa8696ca/gkac351fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5c4b/9252729/3a3c755b9b05/gkac351fig5.jpg

相似文献

1
iFeatureOmega: an integrative platform for engineering, visualization and analysis of features from molecular sequences, structural and ligand data sets.iFeatureOmega:一个综合性平台,用于对分子序列、结构和配体数据集的特征进行工程设计、可视化和分析。
Nucleic Acids Res. 2022 Jul 5;50(W1):W434-W447. doi: 10.1093/nar/gkac351.
2
iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization.iLearnPlus:一个全面的、自动化的机器学习平台,用于核酸和蛋白质序列分析、预测和可视化。
Nucleic Acids Res. 2021 Jun 4;49(10):e60. doi: 10.1093/nar/gkab122.
3
Leaf-GP: an open and automated software application for measuring growth phenotypes for arabidopsis and wheat.Leaf-GP:一款用于测量拟南芥和小麦生长表型的开放且自动化的软件应用程序。
Plant Methods. 2017 Dec 22;13:117. doi: 10.1186/s13007-017-0266-3. eCollection 2017.
4
Realtime analysis and visualization of MinION sequencing data with npReader.使用 npReader 对 MinION 测序数据进行实时分析和可视化。
Bioinformatics. 2016 Mar 1;32(5):764-6. doi: 10.1093/bioinformatics/btv658. Epub 2015 Nov 10.
5
TBtools: An Integrative Toolkit Developed for Interactive Analyses of Big Biological Data.TBtools:一个用于生物大数据交互式分析的集成工具包。
Mol Plant. 2020 Aug 3;13(8):1194-1202. doi: 10.1016/j.molp.2020.06.009. Epub 2020 Jun 23.
6
MBEToolbox: a MATLAB toolbox for sequence data analysis in molecular biology and evolution.MBE工具包:用于分子生物学和进化中序列数据分析的MATLAB工具包。
BMC Bioinformatics. 2005 Mar 22;6:64. doi: 10.1186/1471-2105-6-64.
7
iFeature: a Python package and web server for features extraction and selection from protein and peptide sequences.iFeature:一个用于从蛋白质和肽序列中提取和选择特征的 Python 包和网络服务器。
Bioinformatics. 2018 Jul 15;34(14):2499-2502. doi: 10.1093/bioinformatics/bty140.
8
Automated programming for bioinformatics algorithm deployment.用于生物信息学算法部署的自动化编程。
Bioinformatics. 2008 Feb 1;24(3):450-1. doi: 10.1093/bioinformatics/btm602. Epub 2008 Jan 3.
9
GUIDEMOL: A Python graphical user interface for molecular descriptors based on RDKit.GUIDEMOL:一个基于 RDKit 的分子描述符的 Python 图形用户界面。
Mol Inform. 2024 Jan;43(1):e202300190. doi: 10.1002/minf.202300190. Epub 2023 Nov 20.
10
Eureka-DMA: an easy-to-operate graphical user interface for fast comprehensive investigation and analysis of DNA microarray data.Eureka-DMA:一个易于操作的图形用户界面,用于快速全面地调查和分析 DNA 微阵列数据。
BMC Bioinformatics. 2014 Feb 24;15:53. doi: 10.1186/1471-2105-15-53.

引用本文的文献

1
Exo-Tox: Identifying Exotoxins from secreted bacterial proteins.外毒素:从分泌的细菌蛋白中鉴定外毒素
BioData Min. 2025 Aug 8;18(1):52. doi: 10.1186/s13040-025-00469-2.
2
siRNA Features-Automated Machine Learning of 3D Molecular Fingerprints and Structures for Therapeutic Off-Target Data.小干扰RNA特征——用于治疗性脱靶数据的3D分子指纹和结构的自动机器学习
Int J Mol Sci. 2025 Jul 16;26(14):6795. doi: 10.3390/ijms26146795.
3
PREDAC-FluB: predicting antigenic clusters of seasonal influenza B viruses with protein language model embedding based convolutional neural network.

本文引用的文献

1
Pfeature: A Tool for Computing Wide Range of Protein Features and Building Prediction Models.Pfeature:一种用于计算广泛的蛋白质特征并构建预测模型的工具。
J Comput Biol. 2023 Feb;30(2):204-222. doi: 10.1089/cmb.2022.0241. Epub 2022 Oct 13.
2
AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.AlphaFold 蛋白质结构数据库:用高精度模型极大地扩展蛋白质序列空间的结构覆盖范围。
Nucleic Acids Res. 2022 Jan 7;50(D1):D439-D444. doi: 10.1093/nar/gkab1061.
3
MathFeature: feature extraction package for DNA, RNA and protein sequences based on mathematical descriptors.
PREDAC-FluB:基于蛋白质语言模型嵌入的卷积神经网络预测季节性乙型流感病毒的抗原簇
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf308.
4
Graph-RPI: predicting RNA-protein interactions via graph autoencoder and self-supervised learning strategies.Graph-RPI:通过图自动编码器和自监督学习策略预测RNA-蛋白质相互作用
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf292.
5
m5CStack: An integrated framework for m5C site prediction using multi-feature stacking.m5CStack:一种使用多特征堆叠进行m5C位点预测的集成框架。
Comput Struct Biotechnol J. 2025 May 12;27:1901-1912. doi: 10.1016/j.csbj.2025.05.004. eCollection 2025.
6
Prediction and validation of nanowire proteins in G20 using machine learning and feature engineering.使用机器学习和特征工程对G20中的纳米线蛋白进行预测与验证。
Comput Struct Biotechnol J. 2025 Apr 19;27:1706-1718. doi: 10.1016/j.csbj.2025.04.022. eCollection 2025.
7
TetraRNA, a tetra-class machine learning model for deciphering the coding potential derivation of RNA world.四聚体RNA,一种用于解读RNA世界编码潜能衍生的四分类机器学习模型。
Comput Struct Biotechnol J. 2025 Mar 26;27:1305-1317. doi: 10.1016/j.csbj.2025.03.039. eCollection 2025.
8
SUMO-LMNet: Lossless mapping network for predicting SUMOylation sites in SUMO1 and SUMO2 using high-dimensional features.SUMO-LMNet:用于利用高维特征预测SUMO1和SUMO2中SUMO化位点的无损映射网络。
Comput Struct Biotechnol J. 2025 Mar 6;27:1048-1059. doi: 10.1016/j.csbj.2025.03.005. eCollection 2025.
9
Phased T2T genome assemblies facilitate the mining of disease-resistance genes in .分阶段的端粒到端粒基因组组装有助于挖掘其中的抗病基因。
Hortic Res. 2024 Nov 6;12(2):uhae306. doi: 10.1093/hr/uhae306. eCollection 2025 Feb.
10
Empirical Comparison and Analysis of Artificial Intelligence-Based Methods for Identifying Phosphorylation Sites of SARS-CoV-2 Infection.基于人工智能的新冠病毒感染磷酸化位点识别方法的实证比较与分析
Int J Mol Sci. 2024 Dec 21;25(24):13674. doi: 10.3390/ijms252413674.
MathFeature:基于数学描述符的 DNA、RNA 和蛋白质序列特征提取包。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab434.
4
Representation learning applications in biological sequence analysis.生物序列分析中的表示学习应用。
Comput Struct Biotechnol J. 2021 May 23;19:3198-3208. doi: 10.1016/j.csbj.2021.05.039. eCollection 2021.
5
Structure-based protein function prediction using graph convolutional networks.基于结构的蛋白质功能预测使用图卷积网络。
Nat Commun. 2021 May 26;12(1):3168. doi: 10.1038/s41467-021-23303-9.
6
iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization.iLearnPlus:一个全面的、自动化的机器学习平台,用于核酸和蛋白质序列分析、预测和可视化。
Nucleic Acids Res. 2021 Jun 4;49(10):e60. doi: 10.1093/nar/gkab122.
7
Sequence representation approaches for sequence-based protein prediction tasks that use deep learning.用于基于序列的蛋白质预测任务的序列表示方法,这些任务使用深度学习。
Brief Funct Genomics. 2021 Mar 2;20(1):61-73. doi: 10.1093/bfgp/elaa030.
8
The lncRNA Toolkit: Databases and In Silico Tools for lncRNA Analysis.长链非编码RNA工具包:用于长链非编码RNA分析的数据库和计算机模拟工具
Noncoding RNA. 2020 Dec 16;6(4):49. doi: 10.3390/ncrna6040049.
9
UniProt: the universal protein knowledgebase in 2021.UniProt:2021 年的通用蛋白质知识库。
Nucleic Acids Res. 2021 Jan 8;49(D1):D480-D489. doi: 10.1093/nar/gkaa1100.
10
Ensembl 2021.Ensembl 2021.
Nucleic Acids Res. 2021 Jan 8;49(D1):D884-D891. doi: 10.1093/nar/gkaa942.