Perceptual QP optimization for VVC with dual hybrid neural
networks

Ruiz Atencia, Javier; Lopez Granado, Otoniel; Pérez Malumbres, Manuel; Martínez-Rach, Miguel

Por favor, use este identificador para citar o enlazar este ítem: https://hdl.handle.net/11000/36863

Registro completo de metadatos

Campo DC	Valor	Lengua/Idioma
dc.contributor.author	Ruiz Atencia, Javier	-
dc.contributor.author	Lopez Granado, Otoniel	-
dc.contributor.author	Pérez Malumbres, Manuel	-
dc.contributor.author	Martínez-Rach, Miguel	-
dc.contributor.other	Departamentos de la UMH::Ingeniería de Computadores	es_ES
dc.date.accessioned	2025-07-14T12:04:57Z	-
dc.date.available	2025-07-14T12:04:57Z	-
dc.date.created	2025	-
dc.identifier.citation	The Journal of Supercomputing (2025) 81:464	es_ES
dc.identifier.issn	0920-8542	-
dc.identifier.uri	https://hdl.handle.net/11000/36863	-
dc.description.abstract	This paper introduces a dual hybrid neural network model combining convolu- tional neural networks (CNNs) and artificial neural networks (ANNs) to optimize the quantization parameter (QP) for both 64 × 64 and 32 × 32 blocks in the versatile video coding (VVC) standard, enhancing video quality and compression efficiency. The model employs CNNs for spatial feature extraction and ANNs for structured data handling, addressing the limitations of current heuristic and just noticeable distortion (JND)-based methods. A dataset of luminance channel image blocks, encoded with various QP values, is generated and preprocessed, and the dual hybrid network structure is designed with convolutional and dense layers. The QP optimi- zation is applied at two levels: the 64 × 64 model provides a global QP offset, while the 32 × 32 model refines the QP for further partitioned blocks. Performance evalu- ations using model error metrics like mean squared error (MSE), root mean squared error (RMSE), mean absolute error (MAE), as well as perceptual metrics like weighted PSNR (WPSNR), MS-SSIM, PSNR-HVS-M, and VMAF, demonstrate the model’s effectiveness. While our approach performs competitively with state-of-the- art algorithms, it significantly outperforms in VMAF, the most advanced and widely adopted perceptual quality metric. Furthermore, the dual-model approach yields bet- ter results at lower resolutions, whereas the single-model approach is more effective at higher resolutions. These results highlight the adaptability of the proposed mod- els, offering improvements in both compression efficiency and perceptual quality, making them highly suitable for practical applications in modern video coding.	es_ES
dc.format	application/pdf	es_ES
dc.format.extent	21	es_ES
dc.language.iso	eng	es_ES
dc.publisher	Springer	es_ES
dc.rights	info:eu-repo/semantics/openAccess	es_ES
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 Internacional	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	*
dc.subject	Hybrid network	es_ES
dc.subject	CNN	es_ES
dc.subject	Perceptual	es_ES
dc.subject	QP	es_ES
dc.subject	VVC	es_ES
dc.subject	AdaptiveQP	es_ES
dc.subject	QPA	es_ES
dc.subject	HVS	es_ES
dc.subject.other	CDU::6 - Ciencias aplicadas::62 - Ingeniería. Tecnología	es_ES
dc.title	Perceptual QP optimization for VVC with dual hybrid neural networks	es_ES
dc.type	info:eu-repo/semantics/article	es_ES
dc.relation.publisherversion	https://doi.org/10.1007/s11227-025-06954-z	es_ES
Aparece en las colecciones: Artículos Ingeniería de computadores

Ver/Abrir:
2025-Supercomputing-Perceptual QP optimization for VVC with dual hybrid neural networks.pdf

1,21 MB
Adobe PDF
Compartir:

Mostrar el registro sencillo del ítem Ver estadísticas

La licencia se describe como: Atribución-NonComercial-NoDerivada 4.0 Internacional.