Li, Na (2020) Learning and mining from personal digital archives. PhD thesis, Dublin City University.
Abstract
Given the explosion of new sensing technologies, data storage has become significantly cheaper and consequently, people increasingly rely on wearable devices to create personal digital archives. Lifelogging is the act of recording aspects of life in digital format for a variety of purposes such as aiding human memory, analysing human lifestyle and diet monitoring. In this dissertation we are concerned with Visual Lifelogging, a form of lifelogging based on the passive capture of photographs by a wearable camera. Cameras, such as Microsoft's SenseCam can record up to 4,000 images per day as well as logging data from several incorporated sensors. Considering the volume, complexity and heterogeneous nature of such data collections, it is a signifcant challenge to interpret and extract knowledge for the practical use of lifeloggers and others.
In this dissertation, time series analysis methods have been used to identify and extract useful information from temporal lifelogging images data, without benefit of prior knowledge. We focus, in particular, on three fundamental topics: noise reduction, structure and characterization of the raw data; the detection of multi-scale patterns; and the mining of important, previously unknown repeated patterns in the time series of lifelog image data.
Firstly, we show that Detrended Fluctuation Analysis (DFA) highlights the
feature of very high correlation in lifelogging image collections. Secondly, we show that study of equal-time Cross-Correlation Matrix demonstrates atypical or non-stationary characteristics in these images. Next, noise reduction in the Cross-Correlation Matrix is addressed by Random Matrix Theory (RMT) before Wavelet multiscaling is used to characterize the `most important' or `unusual' events through analysis of the associated dynamics of the eigenspectrum. A motif discovery technique is explored for detection of recurring and recognizable episodes of an individual's image data. Finally, we apply these motif discovery techniques to two known lifelog data collections, All I Have Seen (AIHS) and NTCIR-12 Lifelog, in order to examine multivariate recurrent patterns of multiple-lifelogging users.
Metadata
Item Type: | Thesis (PhD) |
---|---|
Date of Award: | March 2020 |
Refereed: | No |
Supervisor(s): | Crane, Martin, Ruskin, Heather J. and Gurrin, Cathal |
Uncontrolled Keywords: | Motifs |
Subjects: | Computer Science > Computer simulation Computer Science > Machine learning Computer Science > Lifelog Physical Sciences > Statistical physics Mathematics > Numerical analysis |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing Research Initiatives and Centres > Scientific Computing and Complex Systems Modelling (Sci-Sym) |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 3.0 License. View License |
ID Code: | 24113 |
Deposited On: | 27 Mar 2020 15:25 by Martin Crane . Last Modified 14 Aug 2020 11:26 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
5MB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record