Heterogeneous Distributed Computing-Based AI Video Generation: Real-Time Load Balancing and Intelligent Scheduling in New Media Art

Authors

DOI:

https://doi.org/10.4108/eetsis.10614

Keywords:

Heterogeneous distributed computing, AI video generation, reinforcement learning scheduling, multimodal co-optimizaiton, real-time new media art

Abstract

INTRODUCTION: The rapid proliferation of Generative AI (AIGC) in new media art has intensified the need for real-time, distributed video generation with stable performance and low latency. Conventional centralized rendering and static scheduling frameworks often encounter load imbalance and communication bottlenecks in heterogeneous environments, resulting in degraded visual coherence and responsiveness. To address these challenges, this study develops a unified and adaptive distributed framework, termed H-RLSCO (Heterogeneity-aware Reinforcement Learning and Scheduling Co-Optimization), designed to enhance both computational efficiency and artistic consistency in large-scale AI video generation. The framework integrates three complementary modules: a Heterogeneity Perception Module (HPM) for node profiling and adaptive task partitioning, a Reinforcement Learning Scheduling Controller (RLSC) for dynamic task migration, and a Generation-Scheduling Co-Optimization (GSCO) mechanism that incorporates content-complexity feedback into scheduling decisions to maintain multimodal synchronization. Experiments on the ArtScene-4K and StageSyn-Real datasets demonstrate that H-RLSCO reduces average latency by 14.4% and decreases Fréchet Video Distance by approximately 12.5% compared with the RL-Scheduler baseline, while limiting performance fluctuation to within 3% under multi-noise conditions (p < 0.01). These gains remain consistent across varying bandwidths and node capabilities on a five-node heterogeneous cluster, confirming robust real-time behavior and balanced utilization. Nevertheless, the scalability of H-RLSCO remains constrained when applied to large-scale node clusters, suggesting future work should explore multi-agent reinforcement learning and lightweight diffusion-Transformer architectures to enhance efficiency and expand applicability.

References

[1] Bianchini S, Muller M, Pelletier P. Drivers and barriers of AI adoption and use in scientific research. Technol Forecast Soc Change. 2025;220: 124303.

[2] Choudhury S, Kurkure P, Banerjee B. Improving visual grounding in remote sensing images with adaptive modality guidance. ISPRS J Photogramm Remote Sens. 2025; 224:42–58.

[3] Wei SN, Liu YY, Yang Y. Architectural framework for multi-modal video generation via fine-tuned stable diffusion models. In: Proceedings of the 2024 3rd International Conference on Artificial Intelligence and Intelligent Information Processing; 2024 Oct. p. 211–6.

[4] Leria E, Makitalo M, Jaaskelainen P, Sjostrom M, Zhang T. Interactive multi-GPU light field path tracing using multi-source spatial reprojection. In: Proceedings of the 30th ACM Symposium on Virtual Reality Software and Technology; 2024 Oct. p. 1–11.

[5] Mohammadabadi SMS, Entezami M, Moghaddam AK, Orangian M, Nejadshamsi S. Generative artificial intelligence for distributed learning to enhance smart grid communication. Int J Intell Netw. 2024;5: 267–74.

[6] Vadisetty R, Polamarasetti A. Generative AI-driven distributed cybersecurity frameworks for AI-integrated global big data systems. In: Proceedings of the 2024 International Conference on Emerging Technologies for Innovation and Sustainability (EmergIN); 2024 Dec. p. 595–600. IEEE.

[7] Bu T, Huang Z, Zhang K, Wang Y, Song H, Zhou J, et al. Task scheduling in the Internet of Things: challenges, solutions, and future trends. Clust Comput. 2024;27(1):1017–46.

[8] Acheampong A, Zhang Y, Xu X, Kumah D. A review of the current task offloading algorithms, strategies and approaches in edge computing systems. Comput Model Eng Sci. 2023;134(1):35.

[9] Hilal W, Gadsden SA, Yawney J. Cognitive dynamic systems: a review of theory, applications, and recent advances. Proc IEEE. 2023;111(6):575–622.

[10] El Saddik A, Ahmad J, Khan M, Abouzahir S, Gueaieb W. Unleashing creativity in the metaverse: generative AI and multimodal content. ACM Trans Multimedia Comput Commun Appl. 2025;21(7):1–43.

[11] Peng Z, Ye X, Zhao W, Liu T, Sun H, Li B, et al. 3D multi-frame fusion for video stabilization. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); 2024. p. 7507–16.

[12] Xu P. Research on video fusion algorithm of rail transit station based on two-stream network. In: Proceedings of the 2023 International Conference on Electronic Devices and Computer Science (ICEDCS); 2023 Sep. p. 287–91. IEEE.

[13] Dasan A, Darshan R, Kumar A, Minu MS, Arthy J. Android-based action recognition with 3D CNN and UCF101 dataset. EPJ Web Conf. 2025; 328:01001.

[14] Sun M. Research on real-scene video face restoration methods based on time consistency and multimodal fusion. J Comput Electron Inf Manag. 2025;18(1):40–6.

[15] An K, Zhang J. Intelligent optimization using multi-objective genetic algorithms in new media art design. Comput Aided Des Appl. 2024; 21:249–63.

[16] Hatami M, Qu Q, Chen Y, Kholidy H, Blasch E, Ardiles-Cruz E. A survey of the real-time metaverse: challenges and opportunities. Future Internet. 2024;16(10):379.

[17] Tang X, Yang K, Wang H, Wu J, Qin Y, Yu W, et al. Prediction uncertainty-aware decision-making for autonomous vehicles. IEEE Trans Intell Veh. 2022;7(4):849–62.

[18] Wang J, Rao S, Liu Y, Sharma PK, Hu J. Load balancing for heterogeneous traffic in datacenter networks. J Netw Comput Appl. 2023;217: 103692.

[19] Le DPC, Wang D, Le VT. A comprehensive survey of recent transformers in image, video and diffusion models. Comput Mater Contin. 2024;80(1).

[20] Chen S, Xu M, Ren J, Cong Y, He S, Xie Y, et al. Gentron: diffusion transformers for image and video generation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); 2024. p. 6441–51.

[21] Xu Z, Zhang J, Liew JH, Yan H, Liu JW, Zhang C, et al. Magicanimate: temporally consistent human image animation using diffusion model. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); 2024. p. 1481–90.

[22] Pozveh AJ, Shahhoseini HS, Khabareh E. Resource management in edge clouds: latency-aware approaches for big data analysis. In: Resource Management in Distributed Systems. Singapore: Springer Nature Singapore; 2024. p. 107–32.

[23] Paulachan J, Onwuchekwa D, Obermaisser R. Task scheduling in multi-cloud environments: a graph partitioning approach enhanced by nested genetic algorithms. In: Proceedings of the 2024 11th International Conference on Internet of Things, Systems, Management and Security (IOTSMS); 2024 Sep. p. 177–84. IEEE.

[24] Sharma G, Khare R, Kulkarni N, Pagare S, Tiwari V. Design of an iterative AI-driven latency prediction and QoS-aware task scheduling in mobile edge computing: a federated and reinforcement learning process. EPJ Web Conf. 2025;328:01071.

[25] Yin W, Yin H, Baraka K, Kragic D, Bjorkman M. Multimodal dance style transfer. Mach Vis Appl. 2023;34(4):48.

[26] Xu Z, Zhang Y, Yang S, Li R, Li X. Chain of generation: multi-modal gesture synthesis via cascaded conditional control. In: Proceedings of the AAAI Conference on Artificial Intelligence; 2024 Mar. Vol. 38(6). p. 6387–95.

[27] Cheng Y, Cao Z, Zhang X, Cao Q, Zhang D. Multi-objective dynamic task scheduling optimization algorithm based on deep reinforcement learning. J Supercomput. 2024;80(5).

[28] Berahmand K, Bahadori S, Abadeh MN, Li Y, Xu Y. SDAC-DA: semi-supervised deep attributed clustering using dual autoencoder. IEEE Trans Knowl Data Eng. 2024;36(11):6989–7002.

[29] Tang KHPY, Ghanem MC, Gasiorowski P, Vassilev V, Ouazzane K. Synchronisation, optimisation, and adaptation of machine learning techniques for computer vision in cyber-physical systems: a comprehensive analysis. IET Cyber-Phys Syst Theory Appl. 2025;1–43.

[30] Zhang Y, Zhao K, Yang Y, Zhou Z. Real-time service migration in edge networks: a survey. J Sens Actuator Netw. 2025;14(4):79.

Downloads

Published

02-12-2025

Issue

Section

Scheduling optimization and load balancing in scalable distributed systems

How to Cite

1.
Fu Q. Heterogeneous Distributed Computing-Based AI Video Generation: Real-Time Load Balancing and Intelligent Scheduling in New Media Art. EAI Endorsed Scal Inf Syst [Internet]. 2025 Dec. 2 [cited 2025 Dec. 4];12(5). Available from: https://publications.eai.eu/index.php/sis/article/view/10614