Chen, Shu (2016) Investigating multi-modal features in the design of a multi-media hyperlinking framework. PhD thesis, Dublin City University.
Abstract
Search, as a well-known information retrieval strategy, is widely researched and developed for academic and commercial usage. However, in the context of increasing amounts of multimedia data, search alone cannot satisfy user requirements for exploring multimedia resources. Therefore, preprocessing of multimedia resources is necessary to define potentially related documents to reduce retrieval time and improve the browsing efficiency. Using hyperlinks to connect relevant resources is widely used for multimedia collection. However, the definition of hyperlinks is usually based on textual information. For example, hyperlinks in Wikipedia link a term to relevant webpages. By contrast, content based multimedia retrieval provides the possibility of analysing multimedia materials on the actual content. The availability of these technologies for multimedia search suggests further investigation of content-based hyperlinking for multimedia collections.
This thesis is dedicated to a novel topic of automatically creating hyperlinks within TV data collections for content-based browsing and navigation. Hyperlinks are created between video segments determined to be related based on their multimodal features.
First, we detail the methodologies to create potentially relevant segments across the TV collection in terms of automatically detected spoken information. We present which of these approaches are more efficient to segment video streams.
Next, we involve both low-level and high-level visual features to improve the hyperlinking quality. We detail the implementation of data fusion schemes to combine multimodal features.
Finally, a novel hyperlinking framework associated with query enrichment, spoken data analysis, and multimodal fusion is proposed. The experiments show the effectiveness of this framework at satisfying user experience which is concluded in crowdsourcing study.
Metadata
Item Type: | Thesis (PhD) |
---|---|
Date of Award: | November 2016 |
Refereed: | No |
Supervisor(s): | O'Connor, Noel E. and Jones, Gareth J.F. |
Uncontrolled Keywords: | video hyperlinking |
Subjects: | Computer Science > Machine learning Engineering > Electronic engineering Computer Science > Multimedia systems Computer Science > Information retrieval Computer Science > Digital video Computer Science > Image processing |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering Research Initiatives and Centres > CLARITY: The Centre for Sensor Web Technologies Research Initiatives and Centres > ADAPT |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 3.0 License. View License |
Funders: | European Framework Programme 7, AXES (ICT-269980), Science Foundation Ireland |
ID Code: | 21321 |
Deposited On: | 21 Nov 2016 11:57 by Gareth Jones . Last Modified 25 Oct 2018 09:20 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
6MB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record