Guo, Jinlin and Gurrin, Cathal ORCID: 0000-0003-4395-7702 (2012) Short user-generated videos classification using accompanied audio categories. In: AMVA’12, 2 Nov. 2012, Nara, Japan. ISBN 978-1-4503-1585-2
Abstract
This paper investigates the classification of short user-generated videos (UGVs) using the accompanied audio data since short UGVs accounts for a great proportion of the Internet UGVs and many short UGVs are accompanied by singlecategory soundtracks. We define seven types of UGVs corresponding to seven audio categories respectively. We also investigate three modeling approaches for audio feature representation, namely, single Gaussian (1G), Gaussian mixture (GMM) and Bag-of-Audio-Word (BoAW) models. Then using Support Vector Machine (SVM) with three different distance measurements corresponding to three feature representations, classifiers are trained to categorize the UGVs. The accompanying evaluation results show that these approaches are effective for categorizing the short UGVs based on their audio track. Experimental results show that a GMM representation with approximated Bhattacharyya distance (ABD) measurement produces the best performance, and BoAW representation with chi-square kernel also reports comparable results.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Workshop |
Refereed: | Yes |
Uncontrolled Keywords: | User-generated Video; MFCC; Video Classification |
Subjects: | Computer Science > Machine learning Computer Science > Algorithms Computer Science > Information retrieval Computer Science > Digital video |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Electronic Engineering Research Initiatives and Centres > CLARITY: The Centre for Sensor Web Technologies DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing |
Published in: | Proceedings of the 2012 ACM international workshop on Audio and multimedia methods for large-scale video analysis. . Association for Computing Machinery. ISBN 978-1-4503-1585-2 |
Publisher: | Association for Computing Machinery |
Official URL: | http://doi.acm.org/10.1145/2390214.2390220 |
Copyright Information: | © ACM, 2012. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in "Proceedings of the 2012 ACM international workshop on Audio and multimedia methods for large-scale video analysis" http://doi.acm.org/10.1145/2390214.2390220 |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
ID Code: | 17529 |
Deposited On: | 06 Mar 2013 11:00 by Jinlin Guo . Last Modified 02 Nov 2018 15:33 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
217kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record