Mitchell, Linda (2015) Community post-editing of machine-translated user-generated content. PhD thesis, Dublin City University.
Abstract
With the constant growth of user-generated content (UGC) online, the demand for quick translations of large volumes of texts increases. This is often met with a combination of machine translation (MT) and post-editing (PE). Despite extensive research in post-editing with professional translators or translation students, there are few PE studies with lay post-editors, such as domain experts. This thesis explores lay post-editing as a feasible solution for UGC in a technology support forum, machine translated from English into German. This context of lay post-editing in an online community prompts for a redefinition of quality.
We adopt a mixed-methods approach, investigating PE quality quantitatively with an error annotation, a domain specialist evaluation and an end-user evaluation. We further explore post-editing behaviour, i.e. specific edits performed, from a qualitative perspective. With the involvement of community members, the need for a PE competence model becomes even more pressing. We investigate whether Gopferich’s translation competence (TC) model (2009) may serve as a basis for lay post-editing.
Our quantitative data proves with statistical significance that lay post-editing is a feasible concept, producing variable output, however. On a qualitative level, post-editing is successful for short segments requiring ~35% post-editing effort. No post-editing patterns were detected for segments requiring more PE effort. Lastly, our data suggests that PE quality is largely independent of the profile characteristics measured.
This thesis constitutes an important advance in lay post-editing and benchmarking the evaluation of its output, uncovering difficulties in pinpointing reasons for variance in the resulting quality.
Metadata
Item Type: | Thesis (PhD) |
---|---|
Date of Award: | November 2015 |
Refereed: | No |
Supervisor(s): | O'Brien, Sharon, Roturier, Johann and Hollowood, Fred |
Uncontrolled Keywords: | Post-editing; Community |
Subjects: | Computer Science > Machine translating Humanities > Translating and interpreting |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Humanities and Social Science > School of Applied Language and Intercultural Studies Research Initiatives and Centres > Centre for Translation and Textual Studies (CTTS) |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 3.0 License. View License |
Funders: | Symantec |
ID Code: | 20463 |
Deposited On: | 23 Nov 2015 14:11 by Sharon O'brien . Last Modified 19 Jul 2018 15:05 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
3MB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record