Hearne, Mary and Way, Andy ORCID: 0000-0001-5736-5930 (2006) Disambiguation strategies for data-oriented translation. In: EAMT 2006 - 11th Annual conference of the European Association for Machine Translation, 19-20 June 2006, Oslo, Norway.
Abstract
The Data-Oriented Translation (DOT) model { originally proposed in (Poutsma, 1998, 2003) and based on Data-Oriented Parsing (DOP) (e.g. (Bod, Scha, & Sima'an, 2003)) { is best described as a hybrid model of
translation as it combines examples, linguistic information and a statistical translation model. Although theoretically interesting, it inherits the computational complexity associated with DOP. In this paper, we focus on
one computational challenge for this model: efficiently selecting the `best' translation to output. We present four different disambiguation strategies in terms of how they are implemented in our DOT system, along with experiments
which investigate how they compare in terms of accuracy and
efficiency.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Conference |
Refereed: | Yes |
Uncontrolled Keywords: | data-oriented translation; |
Subjects: | Computer Science > Machine translating |
DCU Faculties and Centres: | Research Initiatives and Centres > National Centre for Language Technology (NCLT) DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing |
Official URL: | http://eamt.emmtee.net/index.php?page=1 |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
Funders: | Science Foundation Ireland, SFI 05/IN/1732 |
ID Code: | 15278 |
Deposited On: | 10 Mar 2010 16:20 by DORAS Administrator . Last Modified 16 Nov 2018 11:15 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
157kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record