• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
GIpred: a computational tool for prediction of GIGANTEA proteins using machine learning algorithm.GIpred:一种使用机器学习算法预测巨蛋白的计算工具。
Physiol Mol Biol Plants. 2022 Jan;28(1):1-16. doi: 10.1007/s12298-022-01130-6. Epub 2022 Jan 24.
2
Improved recognition of splice sites in by incorporating secondary structure information into sequence-derived features: a computational study.通过将二级结构信息纳入序列衍生特征来提高对剪接位点的识别:一项计算研究。
3 Biotech. 2021 Nov;11(11):484. doi: 10.1007/s13205-021-03036-8. Epub 2021 Oct 31.
3
PredCRG: A computational method for recognition of plant circadian genes by employing support vector machine with Laplace kernel.PredCRG:一种通过使用带拉普拉斯核的支持向量机识别植物生物钟基因的计算方法。
Plant Methods. 2021 Apr 26;17(1):46. doi: 10.1186/s13007-021-00744-3.
4
mLoc-mRNA: predicting multiple sub-cellular localization of mRNAs using random forest algorithm coupled with feature selection via elastic net.mLoc-mRNA:使用随机森林算法和弹性网络特征选择预测 mRNAs 的多个亚细胞定位。
BMC Bioinformatics. 2021 Jun 24;22(1):342. doi: 10.1186/s12859-021-04264-8.
5
Improved Prediction of Protein-Protein Interaction Mapping on by Using Amino Acid Sequence Features in a Supervised Learning Framework.利用监督学习框架中的氨基酸序列特征改进蛋白质相互作用预测映射。
Protein Pept Lett. 2021;28(1):74-83. doi: 10.2174/0929866527666200610141258.
6
PVP-SVM: Sequence-Based Prediction of Phage Virion Proteins Using a Support Vector Machine.PVP-SVM:使用支持向量机基于序列预测噬菌体病毒粒子蛋白
Front Microbiol. 2018 Mar 16;9:476. doi: 10.3389/fmicb.2018.00476. eCollection 2018.
7
nifPred: Proteome-Wide Identification and Categorization of Nitrogen-Fixation Proteins of Diaztrophs Based on Composition-Transition-Distribution Features Using Support Vector Machine.固氮预测:基于组成-转变-分布特征,利用支持向量机对固氮菌的固氮蛋白进行全蛋白质组鉴定和分类
Front Microbiol. 2018 May 29;9:1100. doi: 10.3389/fmicb.2018.01100. eCollection 2018.
8
Positive-unlabelled learning of glycosylation sites in the human proteome.人类蛋白质组中天冬酰胺糖基化位点的阳性无标记学习。
BMC Bioinformatics. 2019 Mar 6;20(1):112. doi: 10.1186/s12859-019-2700-1.
9
Accurate in silico identification of protein succinylation sites using an iterative semi-supervised learning technique.使用迭代半监督学习技术在计算机上准确识别蛋白质琥珀酰化位点
J Theor Biol. 2015 Jun 7;374:60-5. doi: 10.1016/j.jtbi.2015.03.029. Epub 2015 Apr 2.
10
HRGPred: Prediction of herbicide resistant genes with k-mer nucleotide compositional features and support vector machine.HRGPred:基于 k--mer 核苷酸组成特征和支持向量机预测除草剂抗性基因。
Sci Rep. 2019 Jan 28;9(1):778. doi: 10.1038/s41598-018-37309-9.

本文引用的文献

1
Functional Characterization of COG1713 (YqeK) as a Novel Diadenosine Tetraphosphate Hydrolase Family.功能表征 COG1713(YqeK)作为一种新型二腺苷四磷酸水解酶家族。
J Bacteriol. 2020 Apr 27;202(10). doi: 10.1128/JB.00053-20.
2
GIGANTEA Regulates the Timing Stabilization of CONSTANS by Altering the Interaction between FKF1 and ZEITLUPE.GIGANTEA 通过改变 FKF1 和 ZEITLUPE 之间的相互作用来调节 CONSTANS 的时间稳定性。
Mol Cells. 2019 Oct 31;42(10):693-701. doi: 10.14348/molcells.2019.0199.
3
Protocol Update for large-scale genome and gene function analysis with the PANTHER classification system (v.14.0).PANTHER 分类系统(版本 14.0)进行大规模基因组和基因功能分析的方案更新。
Nat Protoc. 2019 Mar;14(3):703-721. doi: 10.1038/s41596-019-0128-8. Epub 2019 Feb 25.
4
Time-resolved interaction proteomics of the GIGANTEA protein under diurnal cycles in Arabidopsis.拟南芥中日节律中 GIGANTEA 蛋白的时间分辨互作蛋白质组学。
FEBS Lett. 2019 Feb;593(3):319-338. doi: 10.1002/1873-3468.13311. Epub 2018 Dec 28.
5
GIGANTEA-like genes control seasonal growth cessation in Populus.GIGANTEA 样基因控制杨树季节性生长停止。
New Phytol. 2018 Jun;218(4):1491-1503. doi: 10.1111/nph.15087. Epub 2018 Mar 13.
6
Cloning and characterization of a novel GIGANTEA gene in sweet potato.甘薯中一个新的巨大基因的克隆与特性分析
Plant Physiol Biochem. 2017 Jul;116:27-35. doi: 10.1016/j.plaphy.2017.04.025. Epub 2017 Apr 27.
7
DIRProt: a computational approach for discriminating insecticide resistant proteins from non-resistant proteins.DIRProt:一种区分抗杀虫剂蛋白和非抗杀虫剂蛋白的计算方法。
BMC Bioinformatics. 2017 Mar 24;18(1):190. doi: 10.1186/s12859-017-1587-y.
8
CSL encodes a leucine-rich-repeat protein implicated in red/violet light signaling to the circadian clock in Chlamydomonas.CSL编码一种富含亮氨酸重复序列的蛋白质,该蛋白质与衣藻中昼夜节律钟的红/紫光信号传导有关。
PLoS Genet. 2017 Mar 23;13(3):e1006645. doi: 10.1371/journal.pgen.1006645. eCollection 2017 Mar.
9
CGDB: a database of circadian genes in eukaryotes.CGDB:真核生物中昼夜节律基因的数据库。
Nucleic Acids Res. 2017 Jan 4;45(D1):D397-D403. doi: 10.1093/nar/gkw1028. Epub 2016 Oct 26.
10
The Pepper RING-Type E3 Ligase, CaAIP1, Functions as a Positive Regulator of Drought and High Salinity Stress Responses.辣椒环型E3连接酶CaAIP1作为干旱和高盐胁迫反应的正调控因子发挥作用。
Plant Cell Physiol. 2016 Oct;57(10):2202-2212. doi: 10.1093/pcp/pcw139. Epub 2016 Aug 8.

GIpred:一种使用机器学习算法预测巨蛋白的计算工具。

GIpred: a computational tool for prediction of GIGANTEA proteins using machine learning algorithm.

作者信息

Meher Prabina Kumar, Dash Sagarika, Sahu Tanmaya Kumar, Satpathy Subhrajit, Pradhan Sukanta Kumar

机构信息

ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India.

Division of Statistical Genetics, ICAR-IASRI, New Delhi-12, India.

出版信息

Physiol Mol Biol Plants. 2022 Jan;28(1):1-16. doi: 10.1007/s12298-022-01130-6. Epub 2022 Jan 24.

DOI:10.1007/s12298-022-01130-6
PMID:35221569
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8847649/
Abstract

UNLABELLED

In plants, GIGANTEA (GI) protein plays different biological functions including carbon and sucrose metabolism, cell wall deposition, transpiration and hypocotyl elongation. This suggests that GI is an important class of proteins. So far, the resource-intensive experimental methods have been mostly utilized for identification of GI proteins. Thus, we made an attempt in this study to develop a computational model for fast and accurate prediction of GI proteins. Ten different supervised learning algorithms i.e., SVM, RF, JRIP, J48, LMT, IBK, NB, PART, BAGG and LGB were employed for prediction, where the amino acid composition (AAC), FASGAI features and physico-chemical (PHYC) properties were used as numerical inputs for the learning algorithms. Higher accuracies i.e., 96.75% of AUC-ROC and 86.7% of AUC-PR were observed for SVM coupled with AAC + PHYC feature combination, while evaluated with five-fold cross validation. With leave-one-out cross validation, 97.29% of AUC-ROC and 87.89% of AUC-PR were respectively achieved. While the performance of the model was evaluated with an independent dataset of 18 GI sequences, 17 were observed as correctly predicted. We have also performed proteome-wide identification of GI proteins in wheat, followed by functional annotation using Gene Ontology terms. A prediction server "GIpred" is freely accessible at http://cabgrid.res.in:8080/gipred/ for proteome-wide recognition of GI proteins.

SUPPLEMENTARY INFORMATION

The online version contains supplementary material available at 10.1007/s12298-022-01130-6.

摘要

未标注

在植物中,巨大蛋白(GIGANTEA,GI)发挥着不同的生物学功能,包括碳和蔗糖代谢、细胞壁沉积、蒸腾作用以及下胚轴伸长。这表明GI是一类重要的蛋白质。到目前为止,资源密集型的实验方法大多用于GI蛋白的鉴定。因此,我们在本研究中尝试开发一种计算模型,用于快速准确地预测GI蛋白。使用了十种不同的监督学习算法,即支持向量机(SVM)、随机森林(RF)、JRIP、J48、LMT、IBK、朴素贝叶斯(NB)、PART、BAGG和LightGBM(LGB)进行预测,其中氨基酸组成(AAC)、FASGAI特征和物理化学(PHYC)性质被用作学习算法的数值输入。在五折交叉验证评估中,支持向量机与AAC + PHYC特征组合的预测准确率较高,AUC-ROC为96.75%,AUC-PR为86.7%。采用留一法交叉验证时,分别实现了97.29%的AUC-ROC和87.89%的AUC-PR。当使用包含18个GI序列的独立数据集评估模型性能时,观察到17个被正确预测。我们还对小麦中的GI蛋白进行了全蛋白质组鉴定,随后使用基因本体术语进行功能注释。预测服务器“GIpred”可在http://cabgrid.res.in:8080/gipred/免费访问,用于全蛋白质组范围内GI蛋白的识别。

补充信息

在线版本包含可在10.1007/s12298-022-01130-6获取的补充材料。