Garcia-Milian Rolando, Hersey Denise, Vukmirovic Milica, Duprilot Fanny
Bioinformatics Support Program, Research and Education Services, Cushing/Whitney Medical Library, Yale University, New Haven, CT, United States of America.
Science Libraries, Lewis Science Library, Princeton University, Princeton, NJ, United States of America.
PeerJ. 2018 Sep 11;6:e5553. doi: 10.7717/peerj.5553. eCollection 2018.
High-throughput technologies are rapidly generating large amounts of diverse omics data. Although this offers a great opportunity, it also poses great challenges as data analysis becomes more complex. The purpose of this study was to identify the main challenges researchers face in analyzing data, and how academic libraries can support them in this endeavor.
A multimodal needs assessment analysis combined an online survey sent to 860 Yale-affiliated researchers (176 responded) and 15 in-depth one-on-one semi-structured interviews. Interviews were recorded, transcribed, and analyzed using NVivo 10 software according to the thematic analysis approach.
The survey response rate was 20%. Most respondents (78%) identified lack of adequate data analysis training (e.g., R, Python) as a main challenge, in addition to not having the proper database or software (54%) to expedite analysis. Two main themes emerged from the interviews: personnel and training needs. Researchers feel they could improve data analyses practices by having better access to the appropriate bioinformatics expertise, and/or training in data analyses tools. They also reported lack of time to acquire expertise in using bioinformatics tools and poor understanding of the resources available to facilitate analysis.
The main challenges identified by our study are: lack of adequate training for data analysis (including need to learn scripting language), need for more personnel at the University to provide data analysis and training, and inadequate communication between bioinformaticians and researchers. The authors identified the positive impact of medical and/or science libraries by establishing bioinformatics support to researchers.
高通量技术正在迅速产生大量多样的组学数据。尽管这提供了巨大的机遇,但随着数据分析变得更加复杂,也带来了巨大挑战。本研究的目的是确定研究人员在数据分析中面临的主要挑战,以及学术图书馆如何在这一努力中支持他们。
多模式需求评估分析结合了向860名耶鲁大学附属研究人员发送的在线调查(176人回复)和15次深入的一对一的半结构化访谈。访谈进行了录音、转录,并使用NVivo 10软件根据主题分析方法进行了分析。
调查回复率为20%。大多数受访者(78%)认为缺乏足够的数据分析培训(如R、Python)是主要挑战,此外还没有合适的数据库或软件(54%)来加快分析。访谈中出现了两个主要主题:人员和培训需求。研究人员认为,通过更好地获得适当的生物信息学专业知识和/或数据分析工具培训,他们可以改进数据分析实践。他们还报告说,缺乏时间来掌握使用生物信息学工具的专业知识,并且对有助于分析的可用资源了解不足。
我们的研究确定的主要挑战是:缺乏足够的数据分析培训(包括需要学习脚本语言),大学需要更多人员提供数据分析和培训,以及生物信息学家与研究人员之间沟通不足。作者确定了医学和/或科学图书馆通过为研究人员建立生物信息学支持所产生的积极影响。