Disambiguation strategies for data-oriented translation

Hearne, Mary and Way, Andy ORCID: 0000-0001-5736-5930 (2006) Disambiguation strategies for data-oriented translation. In: EAMT 2006 - 11th Annual conference of the European Association for Machine Translation, 19-20 June 2006, Oslo, Norway.

Abstract
Metadata
Downloads
Documents

[+][-]

Abstract

The Data-Oriented Translation (DOT) model { originally proposed in (Poutsma, 1998, 2003) and based on Data-Oriented Parsing (DOP) (e.g. (Bod, Scha, & Sima'an, 2003)) { is best described as a hybrid model of translation as it combines examples, linguistic information and a statistical translation model. Although theoretically interesting, it inherits the computational complexity associated with DOP. In this paper, we focus on one computational challenge for this model: efficiently selecting the `best' translation to output. We present four different disambiguation strategies in terms of how they are implemented in our DOT system, along with experiments which investigate how they compare in terms of accuracy and efficiency.

Metadata

Item Type:	Conference or Workshop Item (Paper)
Event Type:	Conference
Refereed:	Yes
Uncontrolled Keywords:	data-oriented translation;
Subjects:	Computer Science > Machine translating
DCU Faculties and Centres:	Research Initiatives and Centres > National Centre for Language Technology (NCLT) DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Official URL:	http://eamt.emmtee.net/index.php?page=1
Use License:	This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:	Science Foundation Ireland, SFI 05/IN/1732
ID Code:	15278
Deposited On:	10 Mar 2010 16:20 by DORAS Administrator . Last Modified 16 Nov 2018 11:15

Documents

Full text available as:

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
157kB

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

DORAS | DCU Research Repository

Disambiguation strategies for data-oriented translation

Downloads