O'Brien, Darragh (2000) Speech synthesis based on a harmonic model. PhD thesis, Dublin City University.
Abstract
The wide range of potential commercial applications for a com puter system capable of automatically converting text to speech (TTS) has stimulated decades of research.
One of the currently most successful approaches to synthesising speech, concatenative TTS synthesis, combines prerecorded speech units to build full utterances. However, th e prosody of the stored units is often not consistent with that of the target utterance and m ust be altered. Furthermore, several types of mismatch can occur at unit boundaries and must be smoothed. Thus, pitch and time-scale modification techniques as well as smoothing algorithms play a critical role in all concatenative-based systems.
This thesis presents the developm ent of a concatenative TTS system based on a harm onic model and incorporating new pitch and time-scaling as well as smoothing algorithms.
Experim ent has shown our system capable of both very high quality prosodic modification and synthesis. Results com pare very favourably with those of existing state-of-the-art systems.
Metadata
Item Type: | Thesis (PhD) |
---|---|
Date of Award: | 2000 |
Refereed: | No |
Supervisor(s): | Monaghan, Alex |
Uncontrolled Keywords: | Speech synthesis; Automatic speech recognition |
Subjects: | Computer Science > Computational linguistics |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 3.0 License. View License |
ID Code: | 19144 |
Deposited On: | 04 Sep 2013 12:58 by Celine Campbell . Last Modified 30 Sep 2022 15:23 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
37MB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record