Harish Keerthi B, Price W Nicholson, Aphinyanaphongs Yindalon
Grossman School of Medicine, New York University, New York, NY, United States.
Law School, University of Michigan, Ann Arbor, MI, United States.
JMIR Form Res. 2022 Apr 11;6(4):e33970. doi: 10.2196/33970.
Machine learning applications promise to augment clinical capabilities and at least 64 models have already been approved by the US Food and Drug Administration. These tools are developed, shared, and used in an environment in which regulations and market forces remain immature. An important consideration when evaluating this environment is the introduction of open-source solutions in which innovations are freely shared; such solutions have long been a facet of digital culture. We discuss the feasibility and implications of open-source machine learning in a health care infrastructure built upon proprietary information. The decreased cost of development as compared to drugs and devices, a longstanding culture of open-source products in other industries, and the beginnings of machine learning-friendly regulatory pathways together allow for the development and deployment of open-source machine learning models. Such tools have distinct advantages including enhanced product integrity, customizability, and lower cost, leading to increased access. However, significant questions regarding engineering concerns about implementation infrastructure and model safety, a lack of incentives from intellectual property protection, and nebulous liability rules significantly complicate the ability to develop such open-source models. Ultimately, the reconciliation of open-source machine learning and the proprietary information-driven health care environment requires that policymakers, regulators, and health care organizations actively craft a conducive market in which innovative developers will continue to both work and collaborate.
机器学习应用有望增强临床能力,并且至少已有64种模型获得了美国食品药品监督管理局的批准。这些工具是在监管和市场力量仍不成熟的环境中开发、共享和使用的。评估这种环境时的一个重要考虑因素是引入开源解决方案,在这种方案中创新可以自由共享;此类解决方案长期以来一直是数字文化的一个方面。我们讨论了在基于专有信息构建的医疗保健基础设施中开源机器学习的可行性及其影响。与药品和设备相比,开发成本降低、其他行业长期存在的开源产品文化以及对机器学习友好的监管途径的开端,共同促成了开源机器学习模型的开发和部署。此类工具具有明显优势,包括增强产品完整性、可定制性和更低成本,从而增加了可及性。然而,关于实施基础设施和模型安全的工程问题、缺乏知识产权保护的激励措施以及模糊的责任规则等重大问题,显著增加了开发此类开源模型的难度。最终,要协调开源机器学习与专有信息驱动的医疗保健环境,政策制定者、监管机构和医疗保健组织需要积极营造一个有利的市场,让创新开发者能够继续开展工作并进行合作。