Spanish translations of Cochrane plain language summaries: assessing the impact of a controlled language checker on machine translation quality

Abstract

Background Cochrane aims to make high-quality health information accessible for users worldwide (Chandler et al. 2017). To this end, contributors simplify Systematic Reviews into Plain Language Summaries (PLS), which are translated (Cochrane Community 2016). Simplification and translation may represent onerous and time-consuming tasks when conducted without assistance from technology. Contributors produce PLS by manually implementing a written set of guidelines (Cochrane Methods 2013) and translations are mainly produced by volunteers (Cochrane Community 2016). Cochrane is considering using machine translation (MT) systems and controlled language (CL) checkers (Birch 2017; Von Elm et al. 2013). A CL is a set of rules adopted to make a text more comprehensible and translatable. A CL checker is the software that checks for adherence to those rules (O’Brien 2010). The adoption of a CL at the authoring stage can increase machine translatability and reduce the effort required to correct the MT outputs (Allen 2003). Objectives This study investigated if introducing a CL checker called Acrolinx in the authoring process of PLS increases the quality of the Spanish translations of PLS produced by Google Translate, a freely available MT system. Method and Materials The experimental materials were: a corpus of 12 PLS authored by manually implementing Cochrane guidelines and their Spanish MT translations; and a corpus of 12 PLS authored with assistance from Acrolinx and their Spanish MT translations. 41 Spanish native speakers conducted the MT evaluation task. Each evaluator was given two PLS (one per corpus) and their Spanish MT outputs. Evaluators assigned scores to the Spanish outputs on fluency and adequacy, commonly employed in MT quality evaluation (Castilho et al. 2017). Results and Conclusions There was little difference in fluency and adequacy scores between the two corpora of PLS, possibly because neither approach to simplification had been developed with MT quality in mind. Moreover, fluency and adequacy scores were relatively high, suggesting that the MT system produced reasonably good translations. While human validation/correction of the MT output is needed for accuracy, the high adequacy and fluency scores for Spanish indicate that integrating MT in the translation workflow might be a viable option. Further studies should be conducted on other language pairs.

Item Type:	Conference or Workshop Item (Invited Talk)
Event Type:	Conference
Refereed:	No
Subjects:	Humanities > Linguistics Humanities > Translating and interpreting Humanities > Spanish language
DCU Faculties and Centres:	DCU Faculties and Schools > Faculty of Humanities and Social Science > School of Applied Language and Intercultural Studies
Official URL:	https://abstracts.cochrane.org/spanish-translation...
Copyright Information:	© 2018 The Author
Use License:	This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:	Irish Research Council and European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No 734211
ID Code:	23696
Deposited On:	13 Sep 2019 14:23 by Alessandra Rossetti . Last Modified 13 Sep 2019 14:23

Item Type:

Conference or Workshop Item (Invited Talk)

Event Type:

Conference

Refereed:

Subjects:

Humanities > Linguistics
Humanities > Translating and interpreting
Humanities > Spanish language

DCU Faculties and Centres:

DCU Faculties and Schools > Faculty of Humanities and Social Science > School of Applied Language and Intercultural Studies

Official URL:

https://abstracts.cochrane.org/spanish-translation...

Use License: