Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Tracking relevant alignment characteristics for machine translation

Lambert, Patrik, Ma, Yanjun, Ozdowska, Sylwia and Way, Andy orcid logoORCID: 0000-0001-5736-5930 (2009) Tracking relevant alignment characteristics for machine translation. In: MT Summit XII - The twelfth Machine Translation Summit, 26-30 August 2009, Ottawa, Canada.

Abstract
In most statistical machine translation (SMT) systems, bilingual segments are extracted via word alignment. In this paper we compare alignments tuned directly according to alignment F-score and BLEU score in order to investigate the alignment characteristics that are helpful in translation. We report results for two different SMT systems (a phrase-based and an n-gram-based system) on Chinese to English IWSLT data, and Spanish to English European Parliament data. We give alignment hints to improve BLEU score, depending on the SMT system used and the type of corpus.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Uncontrolled Keywords:statistical machine translation;
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:Research Initiatives and Centres > National Centre for Language Technology (NCLT)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Official URL:http://summitxii.amtaweb.org/
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:Science Foundation Ireland, SFI 05/IN/1732, SFI 07/CE/I1142
ID Code:15161
Deposited On:15 Feb 2010 11:56 by DORAS Administrator . Last Modified 14 Nov 2018 16:34
Documents

Full text available as:

[thumbnail of LambertEtAl_mts_09.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
453kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record