Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Statistical analysis of alignment characteristics for phrase-based machine translation

Lambert, Patrik, Petitrenaud, Simon, Ma, Yanjun and Way, Andy orcid logoORCID: 0000-0001-5736-5930 (2010) Statistical analysis of alignment characteristics for phrase-based machine translation. In: EAMT 2010 - 14th Annual Conference of the European Association for Machine Translation, 27-28 May 2010, Saint-Raphaël, France.

Abstract
In most statistical machine translation (SMT) systems, bilingual segments are extracted via word alignment. However, there lacks systematic study as to what alignment characteristics can benefit MT under specific experimental settings such as the language pair or the corpus size. In this paper we produce a set of alignments by directly tuning the alignment model according to alignment F-score and BLEU score in order to investigate the alignment characteristics that are helpful in translation. We report results for a phrasebased SMT system on Chinese-to-English IWSLT data, and Spanish-to-English European Parliament data. With a statistical analysis into alignment characteristics that are correlated with BLEU score, we give alignment hints to improve BLEU score using a phrase-based SMT system and different types of corpus.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:Research Initiatives and Centres > Centre for Next Generation Localisation (CNGL)
Published in: Proceedings of the 14th Annual Conference of the EAMT. . European Association for Machine Translation.
Publisher:European Association for Machine Translation
Official URL:http://www.mt-archive.info/EAMT-2010-TOC.htm
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:15790
Deposited On:09 Nov 2010 17:08 by Shane Harper . Last Modified 09 Nov 2018 15:44
Documents

Full text available as:

[thumbnail of Statistical_Analysis_of_Alignment_Characteristics_for_Phrase-based.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
144kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record