Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

An experiment in audio classification from compressed data

Jarina, Roman, O'Connor, Noel E. orcid logoORCID: 0000-0002-4033-9135, Murphy, Noel and Marlow, Seán (2004) An experiment in audio classification from compressed data. In: IWSSIP 2004 - International Workshop on Systems, Signals and Image Processing, 13-15 September 2004, Poznan, Poland.

Abstract
In this paper we present an algorithm for automatic classification of sound into speech, instrumental sound/ music and silence. The method is based on thresholding of features derived from the modulation envelope of the frequency limited audio signal. Four characteristics are examined for discrimination: the occurrence and duration of energy peaks, rhythmic content and the level of harmonic content. The proposed algorithm allows classification directly on MPEG-1 audio bitstreams. The performance of the classifier was evaluated on TRECVID test data. The test results are above-average among all TREC participants. The approaches adopted by other research groups participating in TREC are also discussed.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Workshop
Refereed:Yes
Uncontrolled Keywords:speech; music; MPEG; TREC; audio features;
Subjects:Computer Science > Information retrieval
DCU Faculties and Centres:Research Initiatives and Centres > Centre for Digital Video Processing (CDVP)
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:395
Deposited On:01 Apr 2008 by DORAS Administrator . Last Modified 09 Nov 2018 09:56
Documents

Full text available as:

[thumbnail of iwssip_2004.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
187kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record