Rossetti, Alessandra ORCID: 0000-0002-2162-9639, O'Brien, Sharon ORCID: 0000-0003-4864-5986 and Ried, Juliane (2018) Spanish translations of Cochrane plain language summaries: assessing the impact of a controlled language checker on machine translation quality. In: Cochrane Colloquium 2018, 16-18 Sept 2018, Edinburgh, Scotland.
Abstract
Background
Cochrane aims to make high-quality health information accessible for users worldwide (Chandler et al. 2017). To this end, contributors simplify Systematic Reviews into Plain Language Summaries (PLS), which are translated (Cochrane Community 2016). Simplification and translation may represent onerous and time-consuming tasks when conducted without assistance from technology. Contributors produce PLS by manually implementing a written set of guidelines (Cochrane Methods 2013) and translations are mainly produced by volunteers (Cochrane Community 2016). Cochrane is considering using machine translation (MT) systems and controlled language (CL) checkers (Birch 2017; Von Elm et al. 2013).
A CL is a set of rules adopted to make a text more comprehensible and translatable. A CL checker is the software that checks for adherence to those rules (O’Brien 2010). The adoption of a CL at the authoring stage can increase machine translatability and reduce the effort required to correct the MT outputs (Allen 2003).
Objectives
This study investigated if introducing a CL checker called Acrolinx in the authoring process of PLS increases the quality of the Spanish translations of PLS produced by Google Translate, a freely available MT system.
Method and Materials
The experimental materials were: a corpus of 12 PLS authored by manually implementing Cochrane guidelines and their Spanish MT translations; and a corpus of 12 PLS authored with assistance from Acrolinx and their Spanish MT translations.
41 Spanish native speakers conducted the MT evaluation task. Each evaluator was given two PLS (one per corpus) and their Spanish MT outputs. Evaluators assigned scores to the Spanish outputs on fluency and adequacy, commonly employed in MT quality evaluation (Castilho et al. 2017).
Results and Conclusions
There was little difference in fluency and adequacy scores between the two corpora of PLS, possibly because neither approach to simplification had been developed with MT quality in mind. Moreover, fluency and adequacy scores were relatively high, suggesting that the MT system produced reasonably good translations. While human validation/correction of the MT output is needed for accuracy, the high adequacy and fluency scores for Spanish indicate that integrating MT in the translation workflow might be a viable option. Further studies should be conducted on other language pairs.
Metadata
Item Type: | Conference or Workshop Item (Invited Talk) |
---|---|
Event Type: | Conference |
Refereed: | No |
Subjects: | Humanities > Linguistics Humanities > Translating and interpreting Humanities > Spanish language |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Humanities and Social Science > School of Applied Language and Intercultural Studies |
Official URL: | https://abstracts.cochrane.org/spanish-translation... |
Copyright Information: | © 2018 The Author |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
Funders: | Irish Research Council and European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No 734211 |
ID Code: | 23696 |
Deposited On: | 13 Sep 2019 14:23 by Alessandra Rossetti . Last Modified 13 Sep 2019 14:23 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
402kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record