School of Medical Humanities, Xinxiang Medical University, Xinxiang, 453000, China.
PLoS One. 2023 Oct 5;18(10):e0285331. doi: 10.1371/journal.pone.0285331. eCollection 2023.
Virtual Reality (VR) technology uses computers to simulate the real world comprehensively. VR has been widely used in college teaching and has a huge application prospect. To better apply computer-aided instruction technology in music teaching, a music teaching system based on VR technology is proposed. First, a virtual piano is developed using the HTC Vive kit and the Leap Motion sensor fixed on the helmet as the hardware platform, and using Unity3D, related SteamVR plug-ins, and Leap Motion plug-ins as software platforms. Then, a gesture recognition algorithm is proposed and implemented. Specifically, the Dual Channel Convolutional Neural Network (DCCNN) is adopted to collect the user's gesture command data. The dual-size convolution kernel is applied to extract the feature information in the image and the gesture command in the video, and then the DCCNN recognizes it. After the spatial and temporal information is extracted, Red-Green-Blue (RGB) color pattern images and optical flow images are input into the DCCNN. The prediction results are merged to obtain the final recognition result. The experimental results reveal that the recognition accuracy of DCCNN for the Curwen gesture is as high as 96%, and the recognition accuracy varies with different convolution kernels. By comparison, it is found that the recognition effect of DCCNN is affected by the size of the convolution kernel. Combining convolution kernels of size 5×5 and 7×7 can improve the recognition accuracy to 98%. The research results of this study can be used for music teaching piano and other VR products, with extensive popularization and application value.
虚拟现实(VR)技术使用计算机全面模拟真实世界。VR 已广泛应用于高校教学,并具有巨大的应用前景。为了更好地将计算机辅助教学技术应用于音乐教学,提出了一种基于 VR 技术的音乐教学系统。首先,使用 HTC Vive 套件和固定在头盔上的 Leap Motion 传感器作为硬件平台,开发了虚拟钢琴,使用 Unity3D、相关 SteamVR 插件和 Leap Motion 插件作为软件平台。然后,提出并实现了一种手势识别算法。具体来说,采用双通道卷积神经网络(DCCNN)采集用户的手势命令数据。双尺寸卷积核用于提取图像中的特征信息和视频中的手势命令,然后由 DCCNN 进行识别。提取时空信息后,将 RGB 彩色模式图像和光流图像输入到 DCCNN 中。合并预测结果,得到最终的识别结果。实验结果表明,DCCNN 对手势的识别准确率高达 96%,且识别准确率随不同卷积核而变化。通过比较发现,DCCNN 的识别效果受卷积核大小的影响。结合大小为 5×5 和 7×7 的卷积核可以将识别准确率提高到 98%。本研究的结果可用于音乐教学钢琴和其他 VR 产品,具有广泛的推广和应用价值。