School of Information and Communication Engineering, Communication University of China, Beijing 100024, China.
Sensors (Basel). 2024 Aug 21;24(16):5401. doi: 10.3390/s24165401.
Traditional broadcasting methods often result in fatigue and decision-making errors when dealing with complex and diverse live content. Current research on intelligent broadcasting primarily relies on preset rules and model-based decisions, which have limited capabilities for understanding emotional dynamics. To address these issues, this study proposed and developed an emotion-driven intelligent broadcasting system, EmotionCast, to enhance the efficiency of camera switching during live broadcasts through decisions based on multimodal emotion recognition technology. Initially, the system employs sensing technologies to collect real-time video and audio data from multiple cameras, utilizing deep learning algorithms to analyze facial expressions and vocal tone cues for emotion detection. Subsequently, the visual, audio, and textual analyses were integrated to generate an emotional score for each camera. Finally, the score for each camera shot at the current time point was calculated by combining the current emotion score with the optimal scores from the preceding time window. This approach ensured optimal camera switching, thereby enabling swift responses to emotional changes. EmotionCast can be applied in various sensing environments such as sports events, concerts, and large-scale performances. The experimental results demonstrate that EmotionCast excels in switching accuracy, emotional resonance, and audience satisfaction, significantly enhancing emotional engagement compared to traditional broadcasting methods.
传统的广播方法在处理复杂多样的实时内容时,往往会导致疲劳和决策错误。目前,智能广播的研究主要依赖于预设规则和基于模型的决策,这些方法在理解情感动态方面的能力有限。为了解决这些问题,本研究提出并开发了一种情感驱动的智能广播系统 EmotionCast,通过基于多模态情感识别技术的决策,提高直播期间摄像机切换的效率。首先,该系统采用传感技术从多个摄像机收集实时视频和音频数据,利用深度学习算法分析面部表情和语调线索进行情感检测。然后,将视觉、音频和文本分析进行集成,为每个摄像机生成一个情感得分。最后,通过将当前情感得分与前一时间窗口的最佳得分相结合,计算当前时刻每个摄像机拍摄的得分。这种方法确保了最佳的摄像机切换,从而能够快速响应情感变化。EmotionCast 可以应用于各种传感环境,如体育赛事、音乐会和大型表演。实验结果表明,EmotionCast 在切换准确性、情感共鸣和观众满意度方面表现出色,与传统广播方法相比,显著提高了情感参与度。