• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SECLAF:一个用于分层生物序列分类的网络服务器和深度神经网络设计工具。

SECLAF: a webserver and deep neural network design tool for hierarchical biological sequence classification.

机构信息

PIT Bioinformatics Group, Institute of Mathematics, Eötvös University, H-1117 Budapest, Hungary.

Uratim Ltd, H-1118 Budapest, Hungary.

出版信息

Bioinformatics. 2018 Jul 15;34(14):2487-2489. doi: 10.1093/bioinformatics/bty116.

DOI:10.1093/bioinformatics/bty116
PMID:29490010
Abstract

SUMMARY

Artificial intelligence tools are gaining more and more ground each year in bioinformatics. Learning algorithms can be taught for specific tasks by using the existing enormous biological databases, and the resulting models can be used for the high-quality classification of novel, un-categorized data in numerous areas, including biological sequence analysis. Here, we introduce SECLAF, a webserver that uses deep neural networks for hierarchical biological sequence classification. By applying SECLAF for residue-sequences, we have reported [Methods (2018), https://doi.org/10.1016/j.ymeth.2017.06.034] the most accurate multi-label protein classifier to date (UniProt-into 698 classes-AUC 99.99%; Gene Ontology-into 983 classes-AUC 99.45%). Our framework SECLAF can be applied for other sequence classification tasks, as we describe in the present contribution.

AVAILABILITY AND IMPLEMENTATION

The program SECLAF is implemented in Python, and is available for download, with example datasets at the website https://pitgroup.org/seclaf/. For Gene Ontology and UniProt based classifications a webserver is also available at the address above.

摘要

摘要

人工智能工具在生物信息学中每年都在获得越来越多的关注。通过使用现有的大量生物数据库,可以为特定任务教授学习算法,并且可以将由此产生的模型用于许多领域的新型、未分类数据的高质量分类,包括生物序列分析。在这里,我们介绍了 SECLAF,这是一个使用深度神经网络进行层次生物序列分类的网络服务器。通过应用 SECLAF 进行残基序列,我们报告了[方法(2018 年),https://doi.org/10.1016/j.ymeth.2017.06.034]迄今为止最准确的多标签蛋白质分类器(UniProt 分为 698 类-AUC 99.99%;基因本体论分为 983 类-AUC 99.45%)。正如我们在本研究中所描述的,我们的 SECLAF 框架可应用于其他序列分类任务。

可用性和实现

SECLAF 程序是用 Python 编写的,可在网站 https://pitgroup.org/seclaf/ 上下载,同时还提供了示例数据集。对于基因本体论和 UniProt 分类,也可以在上述地址访问网络服务器。

相似文献

1
SECLAF: a webserver and deep neural network design tool for hierarchical biological sequence classification.SECLAF:一个用于分层生物序列分类的网络服务器和深度神经网络设计工具。
Bioinformatics. 2018 Jul 15;34(14):2487-2489. doi: 10.1093/bioinformatics/bty116.
2
Near perfect protein multi-label classification with deep neural networks.利用深度神经网络实现近乎完美的蛋白质多标签分类。
Methods. 2018 Jan 1;132:50-56. doi: 10.1016/j.ymeth.2017.06.034. Epub 2017 Jul 3.
3
DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier.DeepGO:使用深度本体感知分类器从序列和相互作用预测蛋白质功能。
Bioinformatics. 2018 Feb 15;34(4):660-668. doi: 10.1093/bioinformatics/btx624.
4
ECO, the Evidence & Conclusion Ontology: community standard for evidence information.ECO,即证据与结论本体论:证据信息的社区标准。
Nucleic Acids Res. 2019 Jan 8;47(D1):D1186-D1194. doi: 10.1093/nar/gky1036.
5
GOLabeler: improving sequence-based large-scale protein function prediction by learning to rank.GOLabeler:通过学习排序提高基于序列的大规模蛋白质功能预测。
Bioinformatics. 2018 Jul 15;34(14):2465-2473. doi: 10.1093/bioinformatics/bty130.
6
PFP/ESG: automated protein function prediction servers enhanced with Gene Ontology visualization tool.PFP/ESG:通过基因本体可视化工具增强的自动化蛋白质功能预测服务器。
Bioinformatics. 2015 Jan 15;31(2):271-2. doi: 10.1093/bioinformatics/btu646. Epub 2014 Oct 1.
7
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.
8
ProFET: Feature engineering captures high-level protein functions.ProFET:特征工程可捕捉高级蛋白质功能。
Bioinformatics. 2015 Nov 1;31(21):3429-36. doi: 10.1093/bioinformatics/btv345. Epub 2015 Jun 30.
9
A deep neural network based hierarchical multi-label classification method.一种基于深度神经网络的层次多标签分类方法。
Rev Sci Instrum. 2020 Feb 1;91(2):024103. doi: 10.1063/1.5141161.
10
GANN: genetic algorithm neural networks for the detection of conserved combinations of features in DNA.GANN:用于检测DNA中特征保守组合的遗传算法神经网络。
BMC Bioinformatics. 2005 Feb 22;6:36. doi: 10.1186/1471-2105-6-36.

引用本文的文献

1
Tree visualizations of protein sequence embedding space enable improved functional clustering of diverse protein superfamilies.蛋白质序列嵌入空间的树状图可视化可提高不同蛋白质超家族功能聚类的效果。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac619.
2
layerUMAP: A tool for visualizing and understanding deep learning models in biological sequence classification using UMAP.layerUMAP:一种使用UMAP对生物序列分类中的深度学习模型进行可视化和理解的工具。
iScience. 2022 Nov 7;25(12):105530. doi: 10.1016/j.isci.2022.105530. eCollection 2022 Dec 22.
3
Protein Science Meets Artificial Intelligence: A Systematic Review and a Biochemical Meta-Analysis of an Inter-Field.
蛋白质科学与人工智能相遇:跨领域的系统评价与生化荟萃分析
Front Bioeng Biotechnol. 2022 Jul 7;10:788300. doi: 10.3389/fbioe.2022.788300. eCollection 2022.
4
Identifying super-feminine, super-masculine and sex-defining connections in the human braingraph.识别人类脑图谱中超级女性化、超级男性化和性别定义连接。
Cogn Neurodyn. 2021 Dec;15(6):949-959. doi: 10.1007/s11571-021-09687-w. Epub 2021 Jul 15.
5
Protein function prediction with gene ontology: from traditional to deep learning models.利用基因本体进行蛋白质功能预测:从传统模型到深度学习模型
PeerJ. 2021 Aug 24;9:e12019. doi: 10.7717/peerj.12019. eCollection 2021.
6
Comparison of Methods for Picking the Operational Taxonomic Units From Amplicon Sequences.从扩增子序列中挑选操作分类单元的方法比较
Front Microbiol. 2021 Mar 24;12:644012. doi: 10.3389/fmicb.2021.644012. eCollection 2021.
7
Genome-wide microsatellite characteristics of five human Plasmodium species, focusing on Plasmodium malariae and P. ovale curtisi.五种人类疟原虫全基因组微卫星特征,重点关注间日疟原虫和卵形疟原虫 curtisi。
Parasite. 2020;27:34. doi: 10.1051/parasite/2020034. Epub 2020 May 15.
8
DEEPred: Automated Protein Function Prediction with Multi-task Feed-forward Deep Neural Networks.DEEPred:基于多任务前馈深度神经网络的蛋白质自动功能预测。
Sci Rep. 2019 May 14;9(1):7344. doi: 10.1038/s41598-019-43708-3.
9
A quantitative taxonomy of human hand grasps.人手抓握的定量分类学。
J Neuroeng Rehabil. 2019 Feb 15;16(1):28. doi: 10.1186/s12984-019-0488-x.