Item

NVS-SQA: Exploring Self-Supervised Quality Representation Learning for Neurally Synthesized Scenes Without References

Qu, Qiang
Shen, Yiran
Chen, Xiaoming
Chung, Yuk Ying
Cai, Weidong
Liu, Tongliang
Supervisor
Department
Machine Learning
Embargo End Date
Type
Journal article
Date
License
Language
English
Collections
Research Projects
Organizational Units
Journal Issue
Abstract
Neural View Synthesis (NVS), such as NeRF and 3D Gaussian Splatting, effectively creates photorealistic scenes from sparse viewpoints, typically evaluated by quality assessment methods like PSNR, SSIM, and LPIPS. However, these full-reference methods, which compare synthesized views to reference views, may not fully capture the perceptual quality of neurally synthesized scenes (NSS), particularly due to the limited availability of dense reference views. Furthermore, the challenges in acquiring human perceptual labels hinder the creation of extensive labeled datasets, risking model overfitting and reduced generalizability. To address these issues, we propose NVS-SQA, a NSS quality assessment method to learn no-reference quality representations through self-supervision without reliance on human labels. Traditional self-supervised learning predominantly relies on the "same instance, similar representation" assumption and extensive datasets. However, given that these conditions do not apply in NSS quality assessment, we employ heuristic cues and quality scores as learning objectives, along with a specialized contrastive pair preparation process to improve the effectiveness and efficiency of learning. The results show that NVS-SQA outperforms 17 no-reference methods by a large margin (i.e., on average 109.5% in SRCC, 98.6% in PLCC, and 91.5% in KRCC over the second best) and even exceeds 16 full-reference methods across all evaluation metrics (i.e., 22.9% in SRCC, 19.1% in PLCC, and 18.6% in KRCC over the second best).
Citation
Q. Qu, Y. Shen, X. Chen, Y.Y. Chung, W. Cai, T. Liu, "NVS-SQA: Exploring Self-Supervised Quality Representation Learning for Neurally Synthesized Scenes Without References," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 48, no. 3, pp. 2265-2281, 2025, https://doi.org/10.1109/tpami.2025.3626550.
Source
IEEE Transactions on Pattern Analysis and Machine Intelligence
Conference
Keywords
46 Information and Computing Sciences, 4603 Computer Vision and Multimedia Computation, 4611 Machine Learning, Algorithms, Humans, Image Processing, Computer-Assisted, Neural Networks, Computer, Supervised Machine Learning
Subjects
Source
Publisher
IEEE
Additional links
Full-text link