Saliency Dataset and Predictive Model for Areas of Interest in VVC Perceptual Coding

Kessler Martín, Jorge; Fernández Lagos, Pablo; García Lucas, David; Cebrián Márquez, Gabriel; Ríos, Belén; Vigueras, Guillermo; Díaz Honrubia, Antonio Jesús

Please use this identifier to cite or link to this item: https://hdl.handle.net/11000/39591

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kessler Martín, Jorge	-
dc.contributor.author	Fernández Lagos, Pablo	-
dc.contributor.author	García Lucas, David	-
dc.contributor.author	Cebrián Márquez, Gabriel	-
dc.contributor.author	Ríos, Belén	-
dc.contributor.author	Vigueras, Guillermo	-
dc.contributor.author	Díaz Honrubia, Antonio Jesús	-
dc.contributor.other	Departamentos de la UMH::Ingeniería de Computadores	es_ES
dc.date.accessioned	2026-03-26T12:00:04Z	-
dc.date.available	2026-03-26T12:00:04Z	-
dc.date.created	2024	-
dc.identifier.citation	2024 IEEE International Conference on Multimedia and Expo (ICME)	es_ES
dc.identifier.uri	https://hdl.handle.net/11000/39591	-
dc.description.abstract	Video coding standardization organizations have invested significant efforts in achieving greater compression factors over the years. Approved in 2020, the Versatile Video Coding (VVC) standard reduces the bit rate needed to encode a sequence by half compared to its predecessor. However, users today have increasingly demanding requirements, leading to a significant rise in video traffic on the Internet. In this context, perceptual video coding aims to reduce video bit rate by decreasing the objective quality while maintaining the subjective quality. This work presents a novel dataset designed for training models to predict video saliency, i.e., areas in the video to which viewers are more likely to pay attention. The dataset is publicly available. Furthermore, this work also proposes a machine learning model that classifies each Coding Tree Unit (CTU) as salient or not, and adjusts its quality accordingly. The results show that this model has an accuracy of 95% and correctly classifies as salient 98% of the CTUs that are actually salient.	es_ES
dc.format	application/pdf	es_ES
dc.format.extent	6	es_ES
dc.language.iso	eng	es_ES
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)	es_ES
dc.relation.ispartof	IEEE International Conference on Multimedia and Expo (ICME)	es_ES
dc.rights	info:eu-repo/semantics/closedAccess	es_ES
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 Internacional	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	*
dc.subject	video coding	es_ES
dc.subject	training	es_ES
dc.subject	bit rate	es_ES
dc.subject	standards organizations	es_ES
dc.subject	organizations	es_ES
dc.subject	machine learning	es_ES
dc.subject	predictive models	es_ES
dc.subject.other	CDU::6 - Ciencias aplicadas::62 - Ingeniería. Tecnología	es_ES
dc.subject.other	CDU::0 - Generalidades.::04 - Ciencia y tecnología de los ordenadores. Informática.	es_ES
dc.subject.other	CDU::5 - Ciencias puras y naturales::51 - Matemáticas	es_ES
dc.title	Saliency Dataset and Predictive Model for Areas of Interest in VVC Perceptual Coding	es_ES
dc.type	info:eu-repo/semantics/article	es_ES
dc.relation.publisherversion	https://doi.org/10.1109/ICME57554.2024.10687868	es_ES
Appears in Collections: Artículos Ingeniería de computadores

View/Open:

SALIENCY DATASET AND PREDICTIVE MODEL.pdf

152,14 kB
Adobe PDF
Share:

Show simple item record View statistics

???jsp.display-item.text9???