Jarina, Roman, Murphy, Noel, O'Connor, Noel E. ORCID: 0000-0002-4033-9135 and Marlow, Seán (2001) Speech-music discrimination from MPEG-1 bitstream. In: SSIP 2001 - WSES International Conference on Speech, Signal and Image Processing, 1-6 September 2001, Malta.
Abstract
This paper describes a proposed algorithm for speech/music discrimination, which works on data directly taken from MPEG encoded bitstream thus avoiding the computationally difficult decoding-encoding process. The method is based on thresholding of features derived from the modulation envelope of the frequency-limited audio signal. The discriminator is tested on more than 2 hours of audio data, which contain clean and noisy speech from several speakers and a variety of music content. The discriminator is able to work in real time and despite its simplicity, results are very promising.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Conference |
Refereed: | Yes |
Uncontrolled Keywords: | audio; video; classification; speech; music; signal processing; MPEG; |
Subjects: | Computer Science > Digital video Computer Science > Information retrieval |
DCU Faculties and Centres: | Research Initiatives and Centres > Centre for Digital Video Processing (CDVP) |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
ID Code: | 332 |
Deposited On: | 13 Mar 2008 by DORAS Administrator . Last Modified 09 Nov 2018 10:02 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
562kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record