• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于质谱指纹图谱的基因本体论全基因组规模预测揭示了新的代谢基因功能。

Genome-scale prediction of gene ontology from mass fingerprints reveals new metabolic gene functions.

作者信息

Vavricka Christopher J, Mochizuki Masao, Yuzawa Satoshi, Murata Masahiro, Yoshida Takanobu, Watanabe Naoki, Nakatsui Masahiko, Ishii Jun, Hara Kiyotaka Y, Alper Hal S, Hasunuma Tomohisa, Kondo Akihiko, Araki Michihiro

机构信息

Department of Biotechnology and Life Science, Tokyo University of Agriculture and Technology, Koganei, Japan.

Bacchus Bio Innovation, Kobe, Japan.

出版信息

Life Sci Alliance. 2025 Sep 10;8(11). doi: 10.26508/lsa.202403154. Print 2025 Nov.

DOI:10.26508/lsa.202403154
PMID:40930839
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12423557/
Abstract

Mass-based fingerprinting can characterize microorganisms; however, expansion of these methods to predict specific gene functions is lacking. Therefore, mass fingerprinting was developed to functionally profile a yeast knockout library. Matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) fingerprints of 3,238 knockouts were digitized for correlation with gene ontology (GO). Random forests and support vector machine (SVM) algorithms assigned GO terms with average AUC values of 0.994 and 0.980, respectively. SVM was the best predictor with average true-positive and true-negative rates of 0.983 and 0.993, respectively. To test predictions of unknown gene functions, the dataset of uncharacterized yeast gene knockouts was evaluated based on SVM scores, and new functions were suggested for 28 corresponding genes. Metabolomics analysis of two knockouts (YDR215C and YLR122C) of uncharacterized genes predicted to be involved in methylation-related metabolism showed altered intracellular contents of methionine-related metabolites. Increased S-adenosylmethionine in YDR215C indicated that this strain shows potential as a chassis for bioproduction of methylated compounds. This study demonstrates that fingerprinting can generate large functional datasets for improved machine learning-based gene function prediction.

摘要

基于质量的指纹图谱可对微生物进行表征;然而,这些方法在预测特定基因功能方面的扩展尚显不足。因此,开发了质量指纹图谱来对酵母基因敲除文库进行功能分析。对3238个基因敲除株的基质辅助激光解吸/电离飞行时间(MALDI-TOF)指纹图谱进行数字化处理,以与基因本体论(GO)进行关联。随机森林算法和支持向量机(SVM)算法分别以平均0.994和0.980的曲线下面积(AUC)值分配GO术语。SVM是最佳预测器,平均真阳性率和真阴性率分别为0.983和0.993。为了测试对未知基因功能的预测,基于SVM分数对未表征的酵母基因敲除数据集进行了评估,并为28个相应基因提出了新功能。对预测参与甲基化相关代谢的两个未表征基因的基因敲除株(YDR215C和YLR122C)进行代谢组学分析,结果显示蛋氨酸相关代谢物的细胞内含量发生了变化。YDR215C中S-腺苷甲硫氨酸的增加表明该菌株具有作为甲基化化合物生物生产底盘的潜力。本研究表明,指纹图谱可生成大型功能数据集,以改进基于机器学习的基因功能预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/624e3e7618b3/LSA-2024-03154_Fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/f7320202c7cf/LSA-2024-03154_Fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/ba5a4d95835e/LSA-2024-03154_Fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/9c42c99170a0/LSA-2024-03154_Fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/95e68de108f8/LSA-2024-03154_FigS1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/7082d653e428/LSA-2024-03154_Fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/9a81d72d2a1e/LSA-2024-03154_Fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/e4d8f8e9f2f2/LSA-2024-03154_Fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/8587b78aaf1f/LSA-2024-03154_Fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/4762571a35c7/LSA-2024-03154_FigS2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/ca3fa173ff0f/LSA-2024-03154_FigS3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/624e3e7618b3/LSA-2024-03154_Fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/f7320202c7cf/LSA-2024-03154_Fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/ba5a4d95835e/LSA-2024-03154_Fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/9c42c99170a0/LSA-2024-03154_Fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/95e68de108f8/LSA-2024-03154_FigS1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/7082d653e428/LSA-2024-03154_Fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/9a81d72d2a1e/LSA-2024-03154_Fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/e4d8f8e9f2f2/LSA-2024-03154_Fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/8587b78aaf1f/LSA-2024-03154_Fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/4762571a35c7/LSA-2024-03154_FigS2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/ca3fa173ff0f/LSA-2024-03154_FigS3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/788d/12423557/624e3e7618b3/LSA-2024-03154_Fig8.jpg

相似文献

1
Genome-scale prediction of gene ontology from mass fingerprints reveals new metabolic gene functions.基于质谱指纹图谱的基因本体论全基因组规模预测揭示了新的代谢基因功能。
Life Sci Alliance. 2025 Sep 10;8(11). doi: 10.26508/lsa.202403154. Print 2025 Nov.
2
Identification of Southeast Asian Anopheles mosquito species with matrix-assisted laser desorption/ionization time-of-flight mass spectrometry using a cross-correlation approach.使用互相关方法通过基质辅助激光解吸/电离飞行时间质谱法鉴定东南亚按蚊种类
Parasit Vectors. 2025 Jan 16;18(1):8. doi: 10.1186/s13071-024-06655-1.
3
Taxonomic identification of Morocco scorpions using MALDI-MS fingerprints of venom proteomes and computational modeling.利用 MALDI-MS 毒液蛋白质组指纹图谱和计算建模对摩洛哥蝎子进行分类鉴定。
J Proteomics. 2025 Jan 6;310:105321. doi: 10.1016/j.jprot.2024.105321. Epub 2024 Sep 19.
4
Machine learning based screening of biomarkers associated with cell death and immunosuppression of multiple life stages sepsis populations.基于机器学习对与多生命阶段脓毒症人群细胞死亡和免疫抑制相关生物标志物的筛选。
Sci Rep. 2025 Aug 19;15(1):30302. doi: 10.1038/s41598-025-14600-0.
5
Automated identification of serotype using MALDI-TOF mass spectrometry and machine learning techniques.使用基质辅助激光解吸电离飞行时间质谱(MALDI-TOF)和机器学习技术自动鉴定血清型。
J Clin Microbiol. 2025 Jul 9;63(7):e0003725. doi: 10.1128/jcm.00037-25. Epub 2025 Jun 11.
6
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
7
Serum peptide biomarkers by MALDI-TOF MS coupled with machine learning for diagnosis and classification of hepato-pancreato-biliary cancers.通过基质辅助激光解吸电离飞行时间质谱结合机器学习检测血清肽生物标志物用于肝胰胆管癌的诊断和分类
Sci Rep. 2025 Aug 9;15(1):29169. doi: 10.1038/s41598-025-14451-9.
8
Machine learning and matrix-assisted laser desorption/ionization time-of-flight mass spectra for antimicrobial resistance prediction: A systematic review of recent advancements and future development.用于抗菌药物耐药性预测的机器学习与基质辅助激光解吸/电离飞行时间质谱:近期进展与未来发展的系统评价
J Chromatogr A. 2024 Oct 11;1734:465262. doi: 10.1016/j.chroma.2024.465262. Epub 2024 Aug 23.
9
Deciphering Shared Gene Signatures and Immune Infiltration Characteristics Between Gestational Diabetes Mellitus and Preeclampsia by Integrated Bioinformatics Analysis and Machine Learning.通过综合生物信息学分析和机器学习破译妊娠期糖尿病和子痫前期之间共享的基因特征及免疫浸润特征
Reprod Sci. 2025 May 15. doi: 10.1007/s43032-025-01847-1.
10
How to use learning curves to evaluate the sample size for malaria prediction models developed using machine learning algorithms.如何使用学习曲线评估利用机器学习算法开发的疟疾预测模型的样本量。
Malar J. 2025 Jul 24;24(1):242. doi: 10.1186/s12936-025-05479-3.

本文引用的文献

1
EnzymeNet: residual neural networks model for Enzyme Commission number prediction.酶网络:用于预测酶委员会编号的残差神经网络模型。
Bioinform Adv. 2023 Nov 24;3(1):vbad173. doi: 10.1093/bioadv/vbad173. eCollection 2023.
2
The proteomic landscape of genome-wide genetic perturbations.全基因组遗传扰动的蛋白质组学全景。
Cell. 2023 Apr 27;186(9):2018-2034.e21. doi: 10.1016/j.cell.2023.03.026. Epub 2023 Apr 19.
3
High-throughput metabolomics for the design and validation of a diauxic shift model.高通量代谢组学在双相转变模型设计和验证中的应用。
NPJ Syst Biol Appl. 2023 Apr 7;9(1):11. doi: 10.1038/s41540-023-00274-9.
4
A roadmap to establish a comprehensive platform for sustainable manufacturing of natural products in yeast.建立酵母中天然产物可持续制造综合平台的路线图。
Nat Commun. 2023 Apr 6;14(1):1916. doi: 10.1038/s41467-023-37627-1.
5
Enzyme function prediction using contrastive learning.使用对比学习进行酶功能预测。
Science. 2023 Mar 31;379(6639):1358-1363. doi: 10.1126/science.adf2465. Epub 2023 Mar 30.
6
Comprehensive Machine Learning Prediction of Extensive Enzymatic Reactions.全面的机器学习预测广泛的酶反应。
J Phys Chem B. 2022 Sep 15;126(36):6762-6770. doi: 10.1021/acs.jpcb.2c03287. Epub 2022 Sep 2.
7
Machine learning discovery of missing links that mediate alternative branches to plant alkaloids.机器学习发现介导植物生物碱替代分支的缺失环节。
Nat Commun. 2022 Mar 16;13(1):1405. doi: 10.1038/s41467-022-28883-8.
8
Predictive Potential of MALDI-TOF Analyses for Wine and Brewing Yeast.基质辅助激光解吸电离飞行时间分析对葡萄酒和酿造酵母的预测潜力
Microorganisms. 2022 Jan 24;10(2):265. doi: 10.3390/microorganisms10020265.
9
Bacterial species identification using MALDI-TOF mass spectrometry and machine learning techniques: A large-scale benchmarking study.使用基质辅助激光解吸电离飞行时间质谱和机器学习技术进行细菌物种鉴定:一项大规模基准研究
Comput Struct Biotechnol J. 2021 Nov 9;19:6157-6168. doi: 10.1016/j.csbj.2021.11.004. eCollection 2021.
10
Model-guided development of an evolutionarily stable yeast chassis.基于模型的进化稳定酵母底盘的开发。
Mol Syst Biol. 2021 Jul;17(7):e10253. doi: 10.15252/msb.202110253.