• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

《机器学习轻松入门》(MLme):用于机器学习驱动数据分析的综合工具包。

Machine Learning Made Easy (MLme): A Comprehensive Toolkit for Machine Learning-Driven Data Analysis.

作者信息

Akshay Akshay, Katoch Mitali, Shekarchizadeh Navid, Abedi Masoud, Sharma Ankush, Burkhard Fiona C, Adam Rosalyn M, Monastyrskaya Katia, Gheinani Ali Hashemi

机构信息

Functional Urology Research Group, Department for BioMedical Research DBMR, University of Bern, Switzerland.

Graduate School for Cellular and Biomedical Sciences, University of Bern, Switzerland.

出版信息

bioRxiv. 2023 Jul 4:2023.07.04.546825. doi: 10.1101/2023.07.04.546825.

DOI:10.1101/2023.07.04.546825
PMID:37461685
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10349995/
Abstract

BACKGROUND

Machine learning (ML) has emerged as a vital asset for researchers to analyze and extract valuable information from complex datasets. However, developing an effective and robust ML pipeline can present a real challenge, demanding considerable time and effort, thereby impeding research progress. Existing tools in this landscape require a profound understanding of ML principles and programming skills. Furthermore, users are required to engage in the comprehensive configuration of their ML pipeline to obtain optimal performance.

RESULTS

To address these challenges, we have developed a novel tool called (MLme) that streamlines the use of ML in research, specifically focusing on classification problems at present. By integrating four essential functionalities, namely Data Exploration, AutoML, CustomML, and Visualization, MLme fulfills the diverse requirements of researchers while eliminating the need for extensive coding efforts. To demonstrate the applicability of MLme, we conducted rigorous testing on six distinct datasets, each presenting unique characteristics and challenges. Our results consistently showed promising performance across different datasets, reaffirming the versatility and effectiveness of the tool. Additionally, by utilizing MLme's feature selection functionality, we successfully identified significant markers for CD8+ naive (BACH2), CD16+ (CD16), and CD14+ (VCAN) cell populations.

CONCLUSION

MLme serves as a valuable resource for leveraging machine learning (ML) to facilitate insightful data analysis and enhance research outcomes, while alleviating concerns related to complex coding scripts. The source code and a detailed tutorial for MLme are available at https://github.com/FunctionalUrology/MLme.

摘要

背景

机器学习(ML)已成为研究人员从复杂数据集中分析和提取有价值信息的重要工具。然而,开发一个有效且强大的机器学习流程可能是一项真正的挑战,需要大量的时间和精力,从而阻碍研究进展。这一领域现有的工具需要对机器学习原理有深入理解以及具备编程技能。此外,用户需要对其机器学习流程进行全面配置以获得最佳性能。

结果

为应对这些挑战,我们开发了一种名为(MLme)的新型工具,它简化了研究中机器学习的使用,目前特别关注分类问题。通过整合数据探索、自动机器学习、自定义机器学习和可视化这四个基本功能,MLme满足了研究人员的各种需求,同时无需进行大量编码工作。为证明MLme的适用性,我们在六个不同的数据集上进行了严格测试,每个数据集都有独特的特征和挑战。我们的结果始终显示在不同数据集上都有良好的性能,再次证明了该工具的通用性和有效性。此外,通过利用MLme的特征选择功能,我们成功识别出了CD8 + 初始(BACH2)、CD16 +(CD16)和CD14 +(VCAN)细胞群体的重要标志物。

结论

MLme是利用机器学习(ML)促进深入数据分析和提高研究成果的宝贵资源,同时减轻了与复杂编码脚本相关的担忧。MLme的源代码和详细教程可在https://github.com/FunctionalUrology/MLme获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ec/10349995/b90746dd9054/nihpp-2023.07.04.546825v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ec/10349995/718fe863b22e/nihpp-2023.07.04.546825v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ec/10349995/b43dcf844b17/nihpp-2023.07.04.546825v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ec/10349995/b90746dd9054/nihpp-2023.07.04.546825v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ec/10349995/718fe863b22e/nihpp-2023.07.04.546825v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ec/10349995/b43dcf844b17/nihpp-2023.07.04.546825v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9ec/10349995/b90746dd9054/nihpp-2023.07.04.546825v1-f0003.jpg

相似文献

1
Machine Learning Made Easy (MLme): A Comprehensive Toolkit for Machine Learning-Driven Data Analysis.《机器学习轻松入门》(MLme):用于机器学习驱动数据分析的综合工具包。
bioRxiv. 2023 Jul 4:2023.07.04.546825. doi: 10.1101/2023.07.04.546825.
2
Machine Learning Made Easy (MLme): a comprehensive toolkit for machine learning-driven data analysis.机器学习入门(MLme):一个用于机器学习驱动数据分析的全面工具包。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giad111.
3
Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.在新合成数据集上训练的集成机器学习模型,对于使用可穿戴设备进行压力预测具有良好的泛化能力。
J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.
4
RatInABox, a toolkit for modelling locomotion and neuronal activity in continuous environments.盒子里的老鼠,一个用于模拟连续环境中运动和神经元活动的工具包。
Elife. 2024 Feb 9;13:e85274. doi: 10.7554/eLife.85274.
5
Biomedical image classification made easier thanks to transfer and semi-supervised learning.得益于迁移学习和半监督学习,生物医学图像分类变得更加容易。
Comput Methods Programs Biomed. 2021 Jan;198:105782. doi: 10.1016/j.cmpb.2020.105782. Epub 2020 Oct 3.
6
Human behavior in image-based Road Health Inspection Systems despite the emerging AutoML.尽管出现了自动化机器学习,但基于图像的道路健康检测系统中的人类行为。
J Big Data. 2022;9(1):96. doi: 10.1186/s40537-022-00646-8. Epub 2022 Jul 20.
7
ChampKit: A framework for rapid evaluation of deep neural networks for patch-based histopathology classification.ChampKit:一种基于补丁的组织病理学分类的深度神经网络快速评估框架。
Comput Methods Programs Biomed. 2023 Sep;239:107631. doi: 10.1016/j.cmpb.2023.107631. Epub 2023 May 30.
8
AutoDC: an automatic machine learning framework for disease classification.AutoDC:一种用于疾病分类的自动化机器学习框架。
Bioinformatics. 2022 Jun 27;38(13):3415-3421. doi: 10.1093/bioinformatics/btac334.
9
ShinyLearner: A containerized benchmarking tool for machine-learning classification of tabular data.ShinyLearner:一个用于表格数据机器学习分类的容器化基准测试工具。
Gigascience. 2020 Apr 1;9(4). doi: 10.1093/gigascience/giaa026.
10
Robust biomarker screening from gene expression data by stable machine learning-recursive feature elimination methods.基于稳健机器学习-递归特征消除方法的基因表达数据的稳健生物标志物筛选。
Comput Biol Chem. 2022 Oct;100:107747. doi: 10.1016/j.compbiolchem.2022.107747. Epub 2022 Jul 29.

本文引用的文献

1
Machine Learning Made Easy (MLme): a comprehensive toolkit for machine learning-driven data analysis.机器学习入门(MLme):一个用于机器学习驱动数据分析的全面工具包。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giad111.
2
Bcl11b sustains multipotency and restricts effector programs of intestinal-resident memory CD8 T cells.Bcl11b 维持多能性并限制肠道驻留记忆 CD8 T 细胞的效应器程序。
Sci Immunol. 2023 Apr 28;8(82):eabn0484. doi: 10.1126/sciimmunol.abn0484.
3
Using machine learning to estimate the incidence rate of intimate partner violence.
利用机器学习估计亲密伴侣暴力的发生率。
Sci Rep. 2023 Apr 4;13(1):5533. doi: 10.1038/s41598-023-31846-8.
4
Themis suppresses the effector function of CD8 T cells in acute viral infection.Themis 抑制急性病毒感染中 CD8 T 细胞的效应功能。
Cell Mol Immunol. 2023 May;20(5):512-524. doi: 10.1038/s41423-023-00997-z. Epub 2023 Mar 28.
5
Faecal microbiome-based machine learning for multi-class disease diagnosis.基于粪便微生物组的机器学习用于多类疾病诊断。
Nat Commun. 2022 Nov 10;13(1):6818. doi: 10.1038/s41467-022-34405-3.
6
CSF1R defines the mononuclear phagocyte system lineage in human blood in health and COVID-19.CSF1R在健康状态和新冠疫情下定义了人类血液中的单核吞噬细胞系统谱系。
Immunother Adv. 2021 Feb 17;1(1):ltab003. doi: 10.1093/immadv/ltab003. eCollection 2021 Jan.
7
Machine learning reveals distinct gene signature profiles in lesional and nonlesional regions of inflammatory skin diseases.机器学习揭示了炎症性皮肤病皮损和非皮损区域的独特基因特征谱。
Sci Adv. 2022 Apr 29;8(17):eabn4776. doi: 10.1126/sciadv.abn4776.
8
Unexplored Antarctic meteorite collection sites revealed through machine learning.通过机器学习揭示未被探索的南极陨石收集地点。
Sci Adv. 2022 Jan 28;8(4):eabj8138. doi: 10.1126/sciadv.abj8138. Epub 2022 Jan 26.
9
CD169 Defines Activated CD14 Monocytes With Enhanced CD8 T Cell Activation Capacity.CD169 定义了具有增强 CD8 T 细胞激活能力的活化 CD14 单核细胞。
Front Immunol. 2021 Jul 28;12:697840. doi: 10.3389/fimmu.2021.697840. eCollection 2021.
10
Integration of machine learning and genome-scale metabolic modeling identifies multi-omics biomarkers for radiation resistance.机器学习和基因组代谢建模的整合确定了辐射抗性的多组学生物标志物。
Nat Commun. 2021 May 11;12(1):2700. doi: 10.1038/s41467-021-22989-1.