Título : Saliency Dataset and Predictive Model for Areas of Interest in VVC Perceptual Coding |
Autor : Kessler Martín, Jorge Fernández Lagos, Pablo García Lucas, David Cebrián Márquez, Gabriel Ríos, Belén Vigueras, Guillermo Díaz Honrubia, Antonio Jesús |
Editor : Institute of Electrical and Electronics Engineers (IEEE) |
Departamento: Departamentos de la UMH::Ingeniería de Computadores |
Fecha de publicación: 2024 |
URI : https://hdl.handle.net/11000/39591 |
Resumen :
Video coding standardization organizations have invested significant efforts in achieving greater compression factors over the years. Approved in 2020, the Versatile Video Coding (VVC) standard reduces the bit rate needed to encode a sequence by half compared to its predecessor. However, users today have increasingly demanding requirements, leading to a significant rise in video traffic on the Internet. In this context, perceptual video coding aims to reduce video bit rate by decreasing the objective quality while maintaining the subjective quality. This work presents a novel dataset designed for training models to predict video saliency, i.e., areas in the video to which viewers are more likely to pay attention. The dataset is publicly available. Furthermore, this work also proposes a machine learning model that classifies each Coding Tree Unit (CTU) as salient or not, and adjusts its quality accordingly. The results show that this model has an accuracy of 95% and correctly classifies as salient 98% of the CTUs that are actually salient.
|
Palabras clave/Materias: video coding training bit rate standards organizations organizations machine learning predictive models |
Área de conocimiento : CDU: Ciencias aplicadas: Ingeniería. Tecnología CDU: Generalidades.: Ciencia y tecnología de los ordenadores. Informática. CDU: Ciencias puras y naturales: Matemáticas |
Tipo de documento : info:eu-repo/semantics/article |
Derechos de acceso: info:eu-repo/semantics/closedAccess Attribution-NonCommercial-NoDerivatives 4.0 Internacional |
DOI : https://doi.org/10.1109/ICME57554.2024.10687868 |
Publicado en: 2024 IEEE International Conference on Multimedia and Expo (ICME) |
Nombre Congreso: IEEE International Conference on Multimedia and Expo (ICME) |
Aparece en las colecciones: Artículos Ingeniería de computadores
|