A Study on Real-time Early Warning and Adaptive Intervention for Online Learning Burnout Using Multimodal Temporal Feature Fusion

Authors

DOI:

https://doi.org/10.4108/eetsis.10698

Keywords:

Online Learning Burnout, Multi-modal Fusion, Affective Computing, Real-time Early Warning, Adaptive Intervention

Abstract

 INTRODUCTION: The lack of physical presence in online learning makes it difficult for teachers to perceive students' cognitive and emotional states in real time, with learning burnout being a particularly prominent issue. Existing research primarily relies on single-modality data or lagged learning analysis, making it difficult to achieve precise and timely burnout early warning and intervention.

OBJECTIVES: This paper proposes an online learning burnout early warning model based on multi-modal temporal feature fusion and designs a hierarchical adaptive intervention mechanism.

METHODS: First, utilizing lightweight convolutional neural networks and temporal encoders, facial expression features and body posture features are extracted in real-time from the video stream, respectively. Second, a multi-modal temporal fusion module based on attention mechanisms is designed to model the coordinated temporal evolution of facial expressions and body postures, enabling precise identification of fatigue states such as "confusion", "fatigued" and "bored".  Finally, a hybrid decision model combining rule-based and reinforcement learning approaches is developed. This model dynamically triggers personalized intervention strategies—such as content adjustment, suggested breaks, and interactive Q&A sessions—based on the type, intensity, and duration of fatigue.

RESULTS: Experiments conducted on a self-built authentic online learning dataset demonstrate that this system achieves an F1 score of 0.91 in burnout state recognition, which significantly outperforms unimodal approaches. Furthermore, user studies validate the effectiveness and high acceptance of its intervention mechanism.

CONCLUSION: This study offers a viable technical and practical approach for enabling precise, timely detection and adaptive intervention of burnout in online learning environments.

 

References

[1] Fida A, Umer M, Saidani O, Hamdi M, Alnowaiseret K, Bisogni C, et al. Real time emotions recognition through facial expressions. Multimed Tools Appl. 2025;84(29):34753-34780.

[2] Dubbaka A, Gopalan A. Detecting learner engagement in MOOCs using automatic facial expression recognition. In: IEEE Global Engineering Education Conference; 2020; virtual. Piscataway: IEEE; 2020. p.447-456

[3] Mukhopadhyay M, Pal S, Nayyar A, Pramanik PKD, Dasgupta N, Choudhury P. Facial emotion detection to assess learner's state of mind in an online learning system. In: 5th International Conference on Intelligent Information Technology; 2020; Hanoi, Vietnam. New York: ACM; 2020. p.107-115.

[4] Gupta S, Kumar P, Tekchandani RK. Facial emotion recognition based real-time learner engagement detection system in online learning context using deep learning models. Multimed Tools Appl. 2023;82(8):11365-11394.

[5] Long DT. A facial expressions recognition method using residual network architecture for online learning evaluation. J Adv Comput Intell Intell Inform. 2021;25(6):953-962.

[6] Mohamad Nezami O, Dras M, Hamey L, Richards D, Wan S, Paris C. Automatic recognition of student engagement using deep learning and facial expression. In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases; 2019; Cham, Switzerland. Cham: Springer International Publishing; 2019. p.273-289

[7] Wang TC, Qiu Q, Wang CT, Chen FL. LD-identify: network learning state recognition based on passive RFID. Control Decis. 2024;39(1):219-226.

[8] Cao XM, Zhang YH, Pan M, Zhu S, Yan HL. Research on learning engagement recognition method from the perspective of artificial intelligence: deep learning experimental analysis based on multimodal data fusion. Mod Educ Technol. 2025;35(3):5-12.

[9] Jiang J, Yu WT, Wang HY. Research on students' learning behaviors in smart classroom based on multimodal data. China Educ Inf. 2024;30(4):107-117.

[10] Wang YY, Zheng YH. Multimodal data fusion: core driving force to solve key problems in intelligent education. Mod Dist Educ Res. 2022;34(2):93-102.

[11] Wang LY, He YF, Tian JH. Construction and empirical study of multimodal data fusion model for online learning behavior. Chin J Dist Educ. 2020;6:23-51.

[12] Xue YF, Chen Z, Qiu YS, Zhu FQ. Intelligent recognition of cognitive styles in online learning based on multimodal data. Open Educ Res. 2024;30(5):112-120.

[13] Zhang S, Yin CY. Sequential multimodal sentiment analysis model based on multi-task learning. J Comput Appl. 2021;41(6):1631-1639.

[14] Jagadeesh M, Baranidharan B. Facial expression recognition of online learners from real-time videos using a novel deep learning model. Multimed Syst. 2022;28(6):2285-2305.

[15] Savchenko AV, Savchenko LV, Makarov I. Classifying emotions and engagement in online learning based on a single facial expression recognition neural network. IEEE Trans Affect Comput. 2022;13(4):2132-2143.

[16] Olivetti EC, Violante MG, Vezzetti E, Marcolin F, Eynard B. Engagement evaluation in a virtual learning environment via facial expression recognition and self-reports: A preliminary approach. Appl Sci. 2019;10(1):314.

[17] Aly M. Revolutionizing online education: Advanced facial expression recognition for real-time student progress tracking via deep learning model. Multimed Tools Appl. 2025;84(13):12575-12614.

[18] Ngo D, Nguyen A, Dang B, Ngo H. Facial expression recognition for examining emotional regulation in synchronous online collaborative learning. Int J Artif Intell Educ. 2024;34(3):650-669.

[19] Pabba C, Kumar P. An intelligent system for monitoring students' engagement in large classroom teaching through facial expression recognition. Expert Syst. 2022;39(1):e12839.

[20] Sumalakshmi CH, Vasuki P. Fused deep learning based facial expression recognition of students in online learning mode. Concurr Comput Pract Exp. 2022;34(21):e7137.

[21] Gupta S, Kumar P, Tekchandani RK. Facial emotion recognition based real-time learner engagement detection system in online learning context using deep learning models. Multimed Tools Appl. 2023;82(8):11365-11394.

Downloads

Published

26-05-2026

Issue

Section

Scheduling optimization and load balancing in scalable distributed systems

How to Cite

1.
Liu K, Gu X, Chen J, Shao W. A Study on Real-time Early Warning and Adaptive Intervention for Online Learning Burnout Using Multimodal Temporal Feature Fusion. EAI Endorsed Scal Inf Syst [Internet]. 2026 May 26 [cited 2026 May 26];12(10). Available from: https://publications.eai.eu/index.php/sis/article/view/10698