Ahn Sangzin
Department of Pharmacology and Pharmacogenomics Research Center, Inje University College of Medicine, Busan 47392, Korea.
Center for Personalized Precision Medicine of Tuberculosis, Inje University College of Medicine, Busan 47392, Korea.
Transl Clin Pharmacol. 2024 Jun;32(2):73-82. doi: 10.12793/tcp.2024.32.e8. Epub 2024 May 29.
Large language models (LLMs) have emerged as a powerful tool for biomedical researchers, demonstrating remarkable capabilities in understanding and generating human-like text. ChatGPT with its Code Interpreter functionality, an LLM connected with the ability to write and execute code, streamlines data analysis workflows by enabling natural language interactions. Using materials from a previously published tutorial, similar analyses can be performed through conversational interactions with the chatbot, covering data loading and exploration, model development and comparison, permutation feature importance, partial dependence plots, and additional analyses and recommendations. The findings highlight the significant potential of LLMs in assisting researchers with data analysis tasks, allowing them to focus on higher-level aspects of their work. However, there are limitations and potential concerns associated with the use of LLMs, such as the importance of critical thinking, privacy, security, and equitable access to these tools. As LLMs continue to improve and integrate with available tools, data science may experience a transformation similar to the shift from manual to automatic transmission in driving. The advancements in LLMs call for considering the future directions of data science and its education, ensuring that the benefits of these powerful tools are utilized with proper human supervision and responsibility.
大语言模型(LLMs)已成为生物医学研究人员的强大工具,在理解和生成类人文本方面展现出卓越能力。具有代码解释器功能的ChatGPT是一种与编写和执行代码能力相关联的大语言模型,它通过实现自然语言交互简化了数据分析工作流程。使用先前发布教程中的材料,通过与聊天机器人进行对话交互可以执行类似的分析,涵盖数据加载与探索、模型开发与比较、排列特征重要性、部分依赖图以及其他分析和建议。研究结果凸显了大语言模型在协助研究人员完成数据分析任务方面的巨大潜力,使他们能够专注于工作的更高层面。然而,使用大语言模型存在局限性和潜在问题,例如批判性思维的重要性、隐私、安全以及公平使用这些工具等。随着大语言模型不断改进并与现有工具集成,数据科学可能会经历一场类似于驾驶从手动挡向自动挡转变的变革。大语言模型的进步促使我们思考数据科学及其教育的未来方向,确保在适当的人工监督和责任下利用这些强大工具的益处。