• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一门面向生物学家的、易于理解、灵活且实用的机器学习工作坊。

An approachable, flexible and practical machine learning workshop for biologists.

机构信息

Morgridge Institute for Research, Madison, WI 53715, USA.

Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI 53706, USA.

出版信息

Bioinformatics. 2022 Jun 24;38(Suppl 1):i10-i18. doi: 10.1093/bioinformatics/btac233.

DOI:10.1093/bioinformatics/btac233
PMID:35758797
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9236579/
Abstract

SUMMARY

The increasing prevalence and importance of machine learning in biological research have created a need for machine learning training resources tailored towards biological researchers. However, existing resources are often inaccessible, infeasible or inappropriate for biologists because they require significant computational and mathematical knowledge, demand an unrealistic time-investment or teach skills primarily for computational researchers. We created the Machine Learning for Biologists (ML4Bio) workshop, a short, intensive workshop that empowers biological researchers to comprehend machine learning applications and pursue machine learning collaborations in their own research. The ML4Bio workshop focuses on classification and was designed around three principles: (i) emphasizing preparedness over fluency or expertise, (ii) necessitating minimal coding and mathematical background and (iii) requiring low time investment. It incorporates active learning methods and custom open-source software that allows participants to explore machine learning workflows. After multiple sessions to improve workshop design, we performed a study on three workshop sessions. Despite some confusion around identifying subtle methodological flaws in machine learning workflows, participants generally reported that the workshop met their goals, provided them with valuable skills and knowledge and greatly increased their beliefs that they could engage in research that uses machine learning. ML4Bio is an educational tool for biological researchers, and its creation and evaluation provide valuable insight into tailoring educational resources for active researchers in different domains.

AVAILABILITY AND IMPLEMENTATION

Workshop materials are available at https://github.com/carpentries-incubator/ml4bio-workshop and the ml4bio software is available at https://github.com/gitter-lab/ml4bio.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

摘要

机器学习在生物研究中的日益普及和重要性,使得针对生物研究人员的机器学习培训资源成为一种需求。然而,现有的资源对于生物学家来说通常是无法访问、不可行或不适用的,因为它们需要大量的计算和数学知识,要求不切实际的时间投入,或者教授的技能主要是针对计算研究人员的。我们创建了机器学习生物学家研讨会(ML4Bio),这是一个短期、密集的研讨会,使生物研究人员能够理解机器学习应用,并在自己的研究中追求机器学习合作。ML4Bio 研讨会侧重于分类,其设计围绕三个原则:(i)强调准备而不是流利或专业,(ii)需要最小的编码和数学背景,(iii)要求低时间投入。它结合了主动学习方法和定制的开源软件,允许参与者探索机器学习工作流程。在多次会议以改进研讨会设计之后,我们对三个研讨会进行了研究。尽管在识别机器学习工作流程中的微妙方法缺陷方面存在一些混淆,但参与者普遍报告说,研讨会达到了他们的目标,为他们提供了有价值的技能和知识,并大大增加了他们对使用机器学习进行研究的信心。ML4Bio 是生物研究人员的教育工具,它的创建和评估为针对不同领域的活跃研究人员定制教育资源提供了有价值的见解。

可用性和实施情况

研讨会材料可在 https://github.com/carpentries-incubator/ml4bio-workshop 获得,ml4bio 软件可在 https://github.com/gitter-lab/ml4bio 获得。

补充信息

补充数据可在《生物信息学》在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206b/9236579/f4c1b30bd6e2/btac233f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206b/9236579/e26d0133a67e/btac233f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206b/9236579/cb45c4bad132/btac233f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206b/9236579/4d05368f63fd/btac233f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206b/9236579/8d51c72d68e6/btac233f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206b/9236579/f4c1b30bd6e2/btac233f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206b/9236579/e26d0133a67e/btac233f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206b/9236579/cb45c4bad132/btac233f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206b/9236579/4d05368f63fd/btac233f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206b/9236579/8d51c72d68e6/btac233f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206b/9236579/f4c1b30bd6e2/btac233f5.jpg

相似文献

1
An approachable, flexible and practical machine learning workshop for biologists.一门面向生物学家的、易于理解、灵活且实用的机器学习工作坊。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i10-i18. doi: 10.1093/bioinformatics/btac233.
2
BioKEEN: a library for learning and evaluating biological knowledge graph embeddings.BioKEEN:用于学习和评估生物知识图嵌入的库。
Bioinformatics. 2019 Sep 15;35(18):3538-3540. doi: 10.1093/bioinformatics/btz117.
3
Container-based bioinformatics with Pachyderm.基于容器的生物信息学与 Pachyderm。
Bioinformatics. 2019 Mar 1;35(5):839-846. doi: 10.1093/bioinformatics/bty699.
4
PyMethylProcess-convenient high-throughput preprocessing workflow for DNA methylation data.PyMethylProcess-适用于 DNA 甲基化数据的高通量预处理工作流。
Bioinformatics. 2019 Dec 15;35(24):5379-5381. doi: 10.1093/bioinformatics/btz594.
5
Automated workflow composition in mass spectrometry-based proteomics.基于质谱的蛋白质组学中的自动化工作流组合。
Bioinformatics. 2019 Feb 15;35(4):656-664. doi: 10.1093/bioinformatics/bty646.
6
Aliro: an automated machine learning tool leveraging large language models.阿里罗:一个利用大型语言模型的自动化机器学习工具。
Bioinformatics. 2023 Oct 3;39(10). doi: 10.1093/bioinformatics/btad606.
7
RMTL: an R library for multi-task learning.RMTL:一个用于多任务学习的 R 库。
Bioinformatics. 2019 May 15;35(10):1797-1798. doi: 10.1093/bioinformatics/bty831.
8
PeakBot: machine-learning-based chromatographic peak picking.PeakBot:基于机器学习的色谱峰提取。
Bioinformatics. 2022 Jun 27;38(13):3422-3428. doi: 10.1093/bioinformatics/btac344.
9
PCirc: random forest-based plant circRNA identification software.PCirc:基于随机森林的植物 circRNA 鉴定软件。
BMC Bioinformatics. 2021 Jan 6;22(1):10. doi: 10.1186/s12859-020-03944-1.
10
PHOTONAI-A Python API for rapid machine learning model development.PHOTONAI-用于快速机器学习模型开发的 Python API。
PLoS One. 2021 Jul 21;16(7):e0254062. doi: 10.1371/journal.pone.0254062. eCollection 2021.

引用本文的文献

1
Meeting the challenge of genomic analysis: a collaboratively developed workshop for pangenomics and topological data analysis.迎接基因组分析的挑战:一个关于泛基因组学和拓扑数据分析的合作开发研讨会。
Bioinform Adv. 2024 Sep 27;4(1):vbae139. doi: 10.1093/bioadv/vbae139. eCollection 2024.
2
Machine Learning Methods in Protein-Protein Docking.机器学习方法在蛋白质-蛋白质对接中的应用。
Methods Mol Biol. 2024;2780:107-126. doi: 10.1007/978-1-0716-3985-6_7.

本文引用的文献

1
Navigating the pitfalls of applying machine learning in genomics.在基因组学中应用机器学习的陷阱。
Nat Rev Genet. 2022 Mar;23(3):169-181. doi: 10.1038/s41576-021-00434-9. Epub 2021 Nov 26.
2
A guide to machine learning for biologists.生物学机器学习指南。
Nat Rev Mol Cell Biol. 2022 Jan;23(1):40-55. doi: 10.1038/s41580-021-00407-0. Epub 2021 Sep 13.
3
DOME: recommendations for supervised machine learning validation in biology.DOME:生物学中监督式机器学习验证的建议
Nat Methods. 2021 Oct;18(10):1122-1127. doi: 10.1038/s41592-021-01205-4.
4
Supervised machine learning tools: a tutorial for clinicians.监督机器学习工具:临床医生教程。
J Neural Eng. 2020 Nov 19;17(6). doi: 10.1088/1741-2552/abbff2.
5
How to Read Articles That Use Machine Learning: Users' Guides to the Medical Literature.如何阅读使用机器学习的文章:医学文献的用户指南。
JAMA. 2019 Nov 12;322(18):1806-1816. doi: 10.1001/jama.2019.16489.
6
Classifying T cell activity in autofluorescence intensity images with convolutional neural networks.使用卷积神经网络对自发荧光强度图像中的T细胞活性进行分类。
J Biophotonics. 2020 Mar;13(3):e201960050. doi: 10.1002/jbio.201960050. Epub 2019 Dec 15.
7
Setting the standards for machine learning in biology.设定生物学中机器学习的标准。
Nat Rev Mol Cell Biol. 2019 Nov;20(11):659-660. doi: 10.1038/s41580-019-0176-5.
8
The development and application of bioinformatics core competencies to improve bioinformatics training and education.生物信息学核心能力的发展与应用,以提高生物信息学培训和教育。
PLoS Comput Biol. 2018 Feb 1;14(2):e1005772. doi: 10.1371/journal.pcbi.1005772. eCollection 2018 Feb.
9
Perceptions of scientific research literature and strategies for reading papers depend on academic career stage.对科研文献的认知以及阅读论文的策略取决于学术生涯阶段。
PLoS One. 2017 Dec 28;12(12):e0189753. doi: 10.1371/journal.pone.0189753. eCollection 2017.
10
Unmet needs for analyzing biological big data: A survey of 704 NSF principal investigators.分析生物大数据的未满足需求:对704名美国国家科学基金会首席研究员的调查
PLoS Comput Biol. 2017 Oct 19;13(10):e1005755. doi: 10.1371/journal.pcbi.1005755. eCollection 2017 Oct.