Generative AI for Immersive Video: Recent Advances and Future Opportunities
Hu, Kaiyuan ; Jin, Yili ; Zhou, Hao ; Du, Linfeng ; Liu, Jiangchuan ; Liu, Xue
Hu, Kaiyuan
Jin, Yili
Zhou, Hao
Du, Linfeng
Liu, Jiangchuan
Liu, Xue
Supervisor
Department
Machine Learning
Embargo End Date
Type
Conference proceeding
Date
2025
License
Language
English
Collections
Research Projects
Organizational Units
Journal Issue
Abstract
Immersive video serves as a key component of eXtended Reality (XR) that aims to create and interact with simulated virtual or hybrid environments. Such a technology allows users to experience immersive sensations that transcend time and space, and meanwhile continuously providing training data for emerging technologies like Embodied AI. Thanks to the advancements in sensing, computing, and display, recent years have witnessed many excellent works for XR and related hardware or software systems. However, challenges like high creation cost, lack of immersion, and limited scalability hinder the practical application of immersive video services. Whilst recently emerged generative artificial intelligence (GenAI) provides us with new insights in tackling existing challenges. In this paper, we conduct a comprehensive survey into the recent advances and future opportunities on how GenAI can benefit immersive video services. By introducing a systematic taxonomy, we meticulously classify the pertinent techniques and applications into three well-defined categories aligned with the pipeline of immersive video service: content creation, network delivery, and client-side display. This categorization enables a structured exploration of the diverse roles on how GenAI can benefit immersive video service, providing a framework for a more comprehensive understanding and evaluation of these technologies. To the best of our knowledge, this work is the first systematic survey of GenAI in XR settings, laying a foundation for future research in this interdisciplinary domain.
Citation
K. Hu, Y. Jin, H. Zhou, L. Du, J. Liu, and X. Liu, “Generative AI for Immersive Video: Recent Advances and Future Opportunities,” Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, vol. 2, pp. 10464–10472, Sep. 2025, doi: 10.24963/IJCAI.2025/1162
Source
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence
Conference
34 International Joint Conference on Artificial Intelligence (IJCAI-25)
Keywords
3D computer vision, Image and video synthesis and generation, Representation learning, Interactive entertainment
Subjects
Source
34 International Joint Conference on Artificial Intelligence (IJCAI-25)
Publisher
International Joint Conferences on Artificial Intelligence
