Por favor, use este identificador para citar o enlazar este ítem: https://hdl.handle.net/11000/39591
Registro completo de metadatos
Campo DC Valor Lengua/Idioma
dc.contributor.authorKessler Martín, Jorge-
dc.contributor.authorFernández Lagos, Pablo-
dc.contributor.authorGarcía Lucas, David-
dc.contributor.authorCebrián Márquez, Gabriel-
dc.contributor.authorRíos, Belén-
dc.contributor.authorVigueras, Guillermo-
dc.contributor.authorDíaz Honrubia, Antonio Jesús-
dc.contributor.otherDepartamentos de la UMH::Ingeniería de Computadoreses_ES
dc.date.accessioned2026-03-26T12:00:04Z-
dc.date.available2026-03-26T12:00:04Z-
dc.date.created2024-
dc.identifier.citation2024 IEEE International Conference on Multimedia and Expo (ICME)es_ES
dc.identifier.urihttps://hdl.handle.net/11000/39591-
dc.description.abstractVideo coding standardization organizations have invested significant efforts in achieving greater compression factors over the years. Approved in 2020, the Versatile Video Coding (VVC) standard reduces the bit rate needed to encode a sequence by half compared to its predecessor. However, users today have increasingly demanding requirements, leading to a significant rise in video traffic on the Internet. In this context, perceptual video coding aims to reduce video bit rate by decreasing the objective quality while maintaining the subjective quality. This work presents a novel dataset designed for training models to predict video saliency, i.e., areas in the video to which viewers are more likely to pay attention. The dataset is publicly available. Furthermore, this work also proposes a machine learning model that classifies each Coding Tree Unit (CTU) as salient or not, and adjusts its quality accordingly. The results show that this model has an accuracy of 95% and correctly classifies as salient 98% of the CTUs that are actually salient.es_ES
dc.formatapplication/pdfes_ES
dc.format.extent6es_ES
dc.language.isoenges_ES
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)es_ES
dc.relation.ispartofIEEE International Conference on Multimedia and Expo (ICME)es_ES
dc.rightsinfo:eu-repo/semantics/closedAccesses_ES
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectvideo codinges_ES
dc.subjecttraininges_ES
dc.subjectbit ratees_ES
dc.subjectstandards organizationses_ES
dc.subjectorganizationses_ES
dc.subjectmachine learninges_ES
dc.subjectpredictive modelses_ES
dc.subject.otherCDU::6 - Ciencias aplicadas::62 - Ingeniería. Tecnologíaes_ES
dc.subject.otherCDU::0 - Generalidades.::04 - Ciencia y tecnología de los ordenadores. Informática.es_ES
dc.subject.otherCDU::5 - Ciencias puras y naturales::51 - Matemáticases_ES
dc.titleSaliency Dataset and Predictive Model for Areas of Interest in VVC Perceptual Codinges_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.relation.publisherversionhttps://doi.org/10.1109/ICME57554.2024.10687868es_ES
Aparece en las colecciones:
Artículos Ingeniería de computadores


no-thumbnailVer/Abrir:

 SALIENCY DATASET AND PREDICTIVE MODEL.pdf



152,14 kB
Adobe PDF
Compartir:


Creative Commons La licencia se describe como: Atribución-NonComercial-NoDerivada 4.0 Internacional.