Poncelas, Alberto ORCID: 0000-0002-5089-1687, Sarasola, Kepa ORCID: 0000-0003-4349-6088, Dowling, Meghan ORCID: 0000-0003-1637-4923, Way, Andy ORCID: 0000-0001-5736-5930, Labaka, Gorka ORCID: 0000-0003-4611-2502 and Alegria, Iñaki ORCID: 0000-0002-0272-1472 (2019) Adapting NMT to caption translation in Wikimedia Commons for low-resource languages. Procesamiento de Lenguaje Natural, 63 . pp. 33-40. ISSN 1135-5948
Abstract
This paper presents a successful domain adaptation of a general neural machine translation (NMT) system using a bilingual corpus created with captions for images in Wikimedia Commons for the Spanish-Basque and English-Irish pairs.
Metadata
Item Type: | Article (Published) |
---|---|
Refereed: | Yes |
Subjects: | Computer Science > Machine translating |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing Research Initiatives and Centres > ADAPT |
Publisher: | Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN) |
Official URL: | http://journal.sepln.org/sepln/ojs/ojs/index.php/p... |
Copyright Information: | © 2019 Sociedad Española para el Procesamiento del Lenguaje Natural |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
Funders: | TADEEP project (Spanish Ministry of Economy and Competitiveness TIN2015- 70214-P, with FEDER funding), ADAPT Centre for Digital Content Technology under the SFI (Grant 13/RC/2106), European Regional Development Fund |
ID Code: | 24603 |
Deposited On: | 15 Jun 2020 13:43 by Vidatum Academic . Last Modified 25 Jun 2021 13:02 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
748kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record