• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ANPS:基于机器学习的植物抗营养蛋白鉴定服务器。

ANPS: machine learning based server for identification of anti-nutritional proteins in plants.

机构信息

Division of Computer Applications, ICAR-Indian Agricultural Statistics Research Institute, Pusa, New Delhi, 110012, India.

ICAR-National Institute for Plant Biotechnology, Pusa, New Delhi, 110012, India.

出版信息

Funct Integr Genomics. 2024 Oct 25;24(6):201. doi: 10.1007/s10142-024-01474-0.

DOI:10.1007/s10142-024-01474-0
PMID:39453508
Abstract

Anti-nutrient factors are inherently present in almost all major crops, which impede the absorption of crucial vitamins and minerals upon human consumption. The commonly found anti-nutrients in food crops are saponins, tannins, lectins, and phytates etc. Currently, there is a lack of computational server for identification of proteins that encode for anti-nutritional factors in plants. Consequently, this study represents a computational approach aimed at distinguishing between proteins encoding anti-nutritional factors and those providing essential nutrients. In this work, machine learning algorithms have been employed to identify plant specific anti-nutrient factor proteins from protein sequences by using compositional features. Achieving a five-fold cross-validation training performance of 94.34% AUC-ROC and 94.13% AUC-PR with extreme gradient boosting surpasses the performance of other methods such as support vector machine, random forest, and adaptive boosting. These results suggest the proposed approach is highly reliable in predicting plant-specific anti-nutritional factor proteins. The resulting prediction models have led to the development of an online server named ANPS, freely available at https://nipb-bi.icar.gov.in .

摘要

抗营养因子几乎存在于所有主要作物中,这些因子会在人类食用时阻碍关键维生素和矿物质的吸收。在食物作物中常见的抗营养因子有皂素、单宁、凝集素和植酸盐等。目前,缺乏用于鉴定植物中编码抗营养因子的蛋白质的计算服务器。因此,本研究代表了一种计算方法,旨在区分编码抗营养因子的蛋白质和提供必需营养素的蛋白质。在这项工作中,我们使用组成特征,通过机器学习算法从蛋白质序列中识别植物特异性抗营养因子蛋白。使用极端梯度提升算法在五重交叉验证训练中的性能达到 94.34% AUC-ROC 和 94.13% AUC-PR,超过了支持向量机、随机森林和自适应提升等其他方法的性能。这些结果表明,所提出的方法在预测植物特异性抗营养因子蛋白方面具有很高的可靠性。所得到的预测模型导致了一个名为 ANPS 的在线服务器的开发,可在 https://nipb-bi.icar.gov.in 免费获得。

相似文献

1
ANPS: machine learning based server for identification of anti-nutritional proteins in plants.ANPS:基于机器学习的植物抗营养蛋白鉴定服务器。
Funct Integr Genomics. 2024 Oct 25;24(6):201. doi: 10.1007/s10142-024-01474-0.
2
ASPTF: A computational tool to predict abiotic stress-responsive transcription factors in plants by employing machine learning algorithms.ASPTF:一种通过机器学习算法预测植物非生物胁迫响应转录因子的计算工具。
Biochim Biophys Acta Gen Subj. 2024 Jun;1868(6):130597. doi: 10.1016/j.bbagen.2024.130597. Epub 2024 Mar 14.
3
ASRpro: A machine-learning computational model for identifying proteins associated with multiple abiotic stress in plants.ASRpro:一种用于识别植物中与多种非生物胁迫相关的蛋白质的机器学习计算模型。
Plant Genome. 2024 Mar;17(1):e20259. doi: 10.1002/tpg2.20259. Epub 2022 Sep 13.
4
RNAmining: A machine learning stand-alone and web server tool for RNA coding potential prediction.RNAmining:一个用于 RNA 编码潜力预测的机器学习独立和网络服务器工具。
F1000Res. 2021 Apr 26;10:323. doi: 10.12688/f1000research.52350.2. eCollection 2021.
5
PlDBPred: a novel computational model for discovery of DNA binding proteins in plants.PlDBPred:一种用于发现植物中 DNA 结合蛋白的新型计算模型。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac483.
6
Predictive etiological classification of acute ischemic stroke through interpretable machine learning algorithms: a multicenter, prospective cohort study.通过可解释的机器学习算法对急性缺血性脑卒中进行预测病因分类:一项多中心前瞻性队列研究。
BMC Med Res Methodol. 2024 Sep 10;24(1):199. doi: 10.1186/s12874-024-02331-1.
7
Prediction of Acute Respiratory Distress Syndrome in Traumatic Brain Injury Patients Based on Machine Learning Algorithms.基于机器学习算法预测创伤性脑损伤患者的急性呼吸窘迫综合征。
Medicina (Kaunas). 2023 Jan 15;59(1):171. doi: 10.3390/medicina59010171.
8
PredPSP: a novel computational tool to discover pathway-specific photosynthetic proteins in plants.PredPSP:一种新型计算工具,用于发现植物中途径特异性的光合蛋白。
Plant Mol Biol. 2024 Sep 24;114(5):106. doi: 10.1007/s11103-024-01500-6.
9
RBPLight: a computational tool for discovery of plant-specific RNA-binding proteins using light gradient boosting machine and ensemble of evolutionary features.RBPLight:一种使用光梯度提升机和进化特征集的计算工具,用于发现植物特异性 RNA 结合蛋白。
Brief Funct Genomics. 2023 Nov 10;22(5):401-410. doi: 10.1093/bfgp/elad016.
10
ASRmiRNA: Abiotic Stress-Responsive miRNA Prediction in Plants by Using Machine Learning Algorithms with Pseudo -Tuple Nucleotide Compositional Features.ASRmiRNA:基于伪元组核苷酸组成特征的机器学习算法预测植物非生物胁迫响应 miRNA
Int J Mol Sci. 2022 Jan 30;23(3):1612. doi: 10.3390/ijms23031612.

引用本文的文献

1
Harnessing Thalassochemicals: Marine Saponins as Bioactive Agents in Nutraceuticals and Food Technologies.利用海洋化学成分:海洋皂苷作为营养保健品和食品技术中的生物活性剂
Mar Drugs. 2025 May 26;23(6):227. doi: 10.3390/md23060227.

本文引用的文献

1
The influence of dataset homology and a rigorous evaluation strategy on protein secondary structure prediction.数据集同源性和严格评估策略对蛋白质二级结构预测的影响。
PLoS One. 2021 Jul 14;16(7):e0254555. doi: 10.1371/journal.pone.0254555. eCollection 2021.
2
Area under precision-recall curves for weighted and unweighted data.加权和未加权数据的精确率-召回率曲线下面积。
PLoS One. 2014 Mar 20;9(3):e92209. doi: 10.1371/journal.pone.0092209. eCollection 2014.
3
Identification of DNA-binding proteins using support vector machines and evolutionary profiles.
利用支持向量机和进化谱鉴定DNA结合蛋白。
BMC Bioinformatics. 2007 Nov 27;8:463. doi: 10.1186/1471-2105-8-463.