Ferrell Brian J
Center for Community Engagement and Impact, Virginia Commonwealth University, Richmond, VA, United States.
JMIR Form Res. 2023 Feb 7;7:e41137. doi: 10.2196/41137.
Community-engaged research (CEnR) involves institutions of higher education collaborating with organizations in their communities to exchange resources and knowledge to benefit a community's well-being. While community engagement is a critical aspect of a university's mission, tracking and reporting CEnR metrics can be challenging, particularly in terms of external community relations and federally funded research programs. In this study, we aimed to develop a method for classifying CEnR studies that have been submitted to our university's institutional review board (IRB) to capture the level of community involvement in research studies. Tracking studies in which communities are "highly engaged" enables institutions to obtain a more comprehensive understanding of the prevalence of CEnR.
We aimed to develop an updated experiment to classify CEnR and capture the distinct levels of involvement that a community partner has in the direction of a research study. To achieve this goal, we used a deep learning-based approach and evaluated the effectiveness of fine-tuning strategies on transformer-based models.
In this study, we used fine-tuning techniques such as discriminative learning rates and freezing layers to train and test 135 slightly modified classification models based on 3 transformer-based architectures: BERT (Bidirectional Encoder Representations from Transformers), Bio+ClinicalBERT, and XLM-RoBERTa. For the discriminative learning rate technique, we applied different learning rates to different layers of the model, with the aim of providing higher learning rates to layers that are more specialized to the task at hand. For the freezing layers technique, we compared models with different levels of layer freezing, starting with all layers frozen and gradually unfreezing different layer groups. We evaluated the performance of the trained models using a holdout data set to assess their generalizability.
Of the models evaluated, Bio+ClinicalBERT performed particularly well, achieving an accuracy of 73.08% and an F-score of 62.94% on the holdout data set. All the models trained in this study outperformed our previous models by 10%-23% in terms of both F-score and accuracy.
Our findings suggest that transfer learning is a viable method for tracking CEnR studies and provide evidence that the use of fine-tuning strategies significantly improves transformer-based models. Our study also presents a tool for categorizing the type and volume of community engagement in research, which may be useful in addressing the challenges associated with reporting CEnR metrics.
社区参与研究(CEnR)涉及高等教育机构与所在社区的组织合作,以交换资源和知识,从而促进社区福祉。虽然社区参与是大学使命的关键方面,但跟踪和报告CEnR指标可能具有挑战性,特别是在外部社区关系和联邦资助的研究项目方面。在本研究中,我们旨在开发一种方法,用于对提交给我校机构审查委员会(IRB)的CEnR研究进行分类,以了解社区在研究中的参与程度。跟踪社区“高度参与”的研究能够使机构更全面地了解CEnR的普及情况。
我们旨在开发一个更新的实验,对CEnR进行分类,并了解社区合作伙伴在研究方向上的不同参与程度。为实现这一目标,我们采用了基于深度学习的方法,并评估了基于Transformer的模型的微调策略的有效性。
在本研究中,我们使用了判别学习率和冻结层等微调技术,基于3种基于Transformer的架构(BERT(来自Transformer的双向编码器表示)、Bio+ClinicalBERT和XLM-RoBERTa)训练和测试了135个略有修改的分类模型。对于判别学习率技术,我们对模型的不同层应用不同的学习率,目的是为更专门处理手头任务的层提供更高的学习率。对于冻结层技术,我们比较了不同层冻结程度的模型,从所有层冻结开始,逐渐解冻不同的层组。我们使用保留数据集评估训练模型的性能,以评估其泛化能力。
在评估的模型中,Bio+ClinicalBERT表现特别出色,在保留数据集上的准确率达到73.08%,F值达到62.94%。本研究中训练的所有模型在F值和准确率方面均比我们之前的模型高出10%-23%。
我们的研究结果表明,迁移学习是跟踪CEnR研究的可行方法,并提供了证据表明使用微调策略可显著改进基于Transformer的模型。我们的研究还提出了一种工具,用于对研究中社区参与的类型和程度进行分类,这可能有助于应对与报告CEnR指标相关的挑战。