Suppr超能文献

集体知识:将研究项目组织为可重复使用组件和具有通用接口的可移植工作流程的数据库。

Collective knowledge: organizing research projects as a database of reusable components and portable workflows with common interfaces.

机构信息

cTuning foundation and cKnowledge SAS.

出版信息

Philos Trans A Math Phys Eng Sci. 2021 May 17;379(2197):20200211. doi: 10.1098/rsta.2020.0211. Epub 2021 Mar 29.

Abstract

This article provides the motivation and overview of the Collective Knowledge Framework (CK or cKnowledge). The CK concept is to decompose research projects into reusable components that encapsulate research artifacts and provide unified application programming interfaces (APIs), command-line interfaces (CLIs), meta descriptions and common automation actions for related artifacts. The CK framework is used to organize and manage research projects as a database of such components. Inspired by the USB 'plug and play' approach for hardware, CK also helps to assemble portable workflows that can automatically plug in compatible components from different users and vendors (models, datasets, frameworks, compilers, tools). Such workflows can build and run algorithms on different platforms and environments in a unified way using the customizable CK program pipeline with software detection plugins and the automatic installation of missing packages. This article presents a number of industrial projects in which the modular CK approach was successfully validated in order to automate benchmarking, auto-tuning and co-design of efficient software and hardware for machine learning and artificial intelligence in terms of speed, accuracy, energy, size and various costs. The CK framework also helped to automate the artifact evaluation process at several computer science conferences as well as to make it easier to reproduce, compare and reuse research techniques from published papers, deploy them in production, and automatically adapt them to continuously changing datasets, models and systems. The long-term goal is to accelerate innovation by connecting researchers and practitioners to share and reuse all their knowledge, best practices, artifacts, workflows and experimental results in a common, portable and reproducible format at https://cKnowledge.io/. This article is part of the theme issue 'Reliability and reproducibility in computational science: implementing verification, validation and uncertainty quantification '.

摘要

本文提供了 Collective Knowledge Framework(CK 或 cKnowledge)的动机和概述。CK 的概念是将研究项目分解为可重用的组件,这些组件封装了研究成果,并提供了统一的应用程序编程接口(API)、命令行接口(CLI)、元描述和相关成果的常用自动化操作。CK 框架用于组织和管理研究项目,将这些组件作为数据库进行管理。受硬件的 USB“即插即用”方法的启发,CK 还有助于组装可移植的工作流程,这些工作流程可以自动插入来自不同用户和供应商的兼容组件(模型、数据集、框架、编译器、工具)。这样的工作流程可以使用可定制的 CK 程序管道和软件检测插件,以及自动安装缺少的软件包,以统一的方式在不同的平台和环境中构建和运行算法。本文介绍了一些工业项目,其中模块化的 CK 方法在自动化基准测试、自动调优和协同设计方面得到了成功验证,从而实现了机器学习和人工智能在速度、准确性、能源、大小和各种成本方面的高效软件和硬件。CK 框架还帮助在多个计算机科学会议上自动化了成果评估过程,使人们更容易从已发表的论文中复制、比较和重用研究技术,将它们部署到生产中,并自动适应不断变化的数据集、模型和系统。长期目标是通过将研究人员和实践者联系起来,共享和重用他们所有的知识、最佳实践、成果、工作流程和实验结果,以加速创新,以一种通用、可移植和可重复的格式在 https://cKnowledge.io/ 上进行。本文是“计算科学中的可靠性和可重复性:实施验证、确认和不确定性量化”主题特刊的一部分。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验