Blanch, Marc Gorriz (2023) Colour technologies for content production and distribution of broadcast content. PhD thesis, Dublin City University.
Abstract
The requirement of colour reproduction has long been a priority driving the development of new colour imaging systems that maximise human perceptual plausibility. This thesis explores machine learning algorithms for colour processing to assist both content production and distribution. First, this research studies colourisation technologies with practical use cases in restoration and processing of archived content. The research targets practical deployable solutions, developing a cost-effective pipeline which integrates the activity of the producer into the processing workflow. In particular, a fully automatic image colourisation paradigm using Conditional GANs is proposed to improve content generalisation and colourfulness of existing baselines. Moreover, a more conservative solution is considered by providing references to guide the system towards more accurate colour predictions. A fast-end-to-end architecture is proposed to improve existing exemplar-based image colourisation methods while decreasing the complexity and runtime. Finally, the proposed image-based methods are integrated into a video colourisation pipeline. A general framework is proposed to reduce the generation of temporal flickering or propagation of errors when such methods are applied frame-to-frame. The proposed model is jointly trained to stabilise the input video and to cluster their frames with the aim of learning scene-specific modes. Second, this research explored colour processing technologies for content distribution with the aim to effectively deliver the processed content to the broad audience. In particular, video compression is tackled by introducing a novel methodology for chroma intra prediction based on attention models. Although the proposed architecture helped to gain control over the reference samples and better understand the prediction process, the complexity of the underlying neural network significantly increased the encoding and decoding time. Therefore, aiming at efficient deployment within the latest video coding standards, this work also focused on the simplification of the proposed architecture to obtain a more compact and explainable model.
Metadata
Item Type: | Thesis (PhD) |
---|---|
Date of Award: | March 2023 |
Refereed: | No |
Supervisor(s): | O'Connor, Noel E. and Mrak, Marta |
Subjects: | Computer Science > Image processing Computer Science > Multimedia systems Computer Science > Information storage and retrieval systems Computer Science > Digital video Computer Science > Video compression |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering Research Initiatives and Centres > INSIGHT Centre for Data Analytics Research Initiatives and Centres > FUJO. Institute for Future Media, Democracy and Society |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 License. View License |
Funders: | European Union’s Horizon 2020 Marie Sklodowska Curie JOLT project |
ID Code: | 27982 |
Deposited On: | 31 Mar 2023 09:27 by Noel Edward O'connor . Last Modified 08 Dec 2023 15:38 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution-Noncommercial-No Derivative Works 4.0 25MB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record