Por favor, use este identificador para citar o enlazar este ítem:
https://hdl.handle.net/11000/39591Registro completo de metadatos
| Campo DC | Valor | Lengua/Idioma |
|---|---|---|
| dc.contributor.author | Kessler Martín, Jorge | - |
| dc.contributor.author | Fernández Lagos, Pablo | - |
| dc.contributor.author | García Lucas, David | - |
| dc.contributor.author | Cebrián Márquez, Gabriel | - |
| dc.contributor.author | Ríos, Belén | - |
| dc.contributor.author | Vigueras, Guillermo | - |
| dc.contributor.author | Díaz Honrubia, Antonio Jesús | - |
| dc.contributor.other | Departamentos de la UMH::Ingeniería de Computadores | es_ES |
| dc.date.accessioned | 2026-03-26T12:00:04Z | - |
| dc.date.available | 2026-03-26T12:00:04Z | - |
| dc.date.created | 2024 | - |
| dc.identifier.citation | 2024 IEEE International Conference on Multimedia and Expo (ICME) | es_ES |
| dc.identifier.uri | https://hdl.handle.net/11000/39591 | - |
| dc.description.abstract | Video coding standardization organizations have invested significant efforts in achieving greater compression factors over the years. Approved in 2020, the Versatile Video Coding (VVC) standard reduces the bit rate needed to encode a sequence by half compared to its predecessor. However, users today have increasingly demanding requirements, leading to a significant rise in video traffic on the Internet. In this context, perceptual video coding aims to reduce video bit rate by decreasing the objective quality while maintaining the subjective quality. This work presents a novel dataset designed for training models to predict video saliency, i.e., areas in the video to which viewers are more likely to pay attention. The dataset is publicly available. Furthermore, this work also proposes a machine learning model that classifies each Coding Tree Unit (CTU) as salient or not, and adjusts its quality accordingly. The results show that this model has an accuracy of 95% and correctly classifies as salient 98% of the CTUs that are actually salient. | es_ES |
| dc.format | application/pdf | es_ES |
| dc.format.extent | 6 | es_ES |
| dc.language.iso | eng | es_ES |
| dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | es_ES |
| dc.relation.ispartof | IEEE International Conference on Multimedia and Expo (ICME) | es_ES |
| dc.rights | info:eu-repo/semantics/closedAccess | es_ES |
| dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 Internacional | * |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
| dc.subject | video coding | es_ES |
| dc.subject | training | es_ES |
| dc.subject | bit rate | es_ES |
| dc.subject | standards organizations | es_ES |
| dc.subject | organizations | es_ES |
| dc.subject | machine learning | es_ES |
| dc.subject | predictive models | es_ES |
| dc.subject.other | CDU::6 - Ciencias aplicadas::62 - Ingeniería. Tecnología | es_ES |
| dc.subject.other | CDU::0 - Generalidades.::04 - Ciencia y tecnología de los ordenadores. Informática. | es_ES |
| dc.subject.other | CDU::5 - Ciencias puras y naturales::51 - Matemáticas | es_ES |
| dc.title | Saliency Dataset and Predictive Model for Areas of Interest in VVC Perceptual Coding | es_ES |
| dc.type | info:eu-repo/semantics/article | es_ES |
| dc.relation.publisherversion | https://doi.org/10.1109/ICME57554.2024.10687868 | es_ES |
SALIENCY DATASET AND PREDICTIVE MODEL.pdf
152,14 kB
Adobe PDF
Compartir:
La licencia se describe como: Atribución-NonComercial-NoDerivada 4.0 Internacional.
.png)