Mellebeek, Bart, Owczarzak, Karolina, Groves, Declan, van Genabith, Josef and Way, Andy ORCID: 0000-0001-5736-5930 (2006) A syntactic skeleton for statistical machine translation. In: EAMT 2006 - 11th Annual conference of the European Association for Machine Translation, 19-20 June 2006, Oslo, Norway.
Abstract
We present a method for improving statistical machine translation performance by using linguistically motivated syntactic information. Our algorithm recursively decomposes source language sentences into syntactically simpler and shorter chunks, and recomposes their translation to form target language sentences. This improves both the word order and lexical selection of the translation. We report statistically significant relative improvementsof 3.3% BLEU score in an experiment (English!Spanish) carried out on
an 800-sentence test set extracted from the Europarl corpus.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Conference |
Refereed: | Yes |
Uncontrolled Keywords: | statistical machine translation; |
Subjects: | Computer Science > Machine translating |
DCU Faculties and Centres: | Research Initiatives and Centres > National Centre for Language Technology (NCLT) |
Official URL: | http://eamt.emmtee.net/index.php?page=1 |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
ID Code: | 15279 |
Deposited On: | 10 Mar 2010 16:25 by DORAS Administrator . Last Modified 16 Nov 2018 11:14 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
112kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record