Toral, Antonio ORCID: 0000-0003-2357-2960 and Way, Andy ORCID: 0000-0001-5736-5930 (2011) Automatic acquisition of named entities for rule-based machine translation. In: the Second International Workshop on Free/Open-Source Rule-Based Machine Translation., 20-21 Jan 2011, Barcelona (Spain).
Abstract
This paper proposes to enrich RBMT dictionaries with Named Entities (NEs) automatically acquired from Wikipedia. The method is applied to the Apertium English–Spanish system and its performance compared to that of Apertium with and without handtagged NEs. The system with automatic NEs outperforms the one without NEs, while results vary when
compared to a system with handtagged NEs (results are comparable for Spanish! English but slightly worst for English!Spanish). Apart from that, adding automatic NEs contributes to decreasing the amount of unknown terms by more than 10%.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Workshop |
Refereed: | Yes |
Uncontrolled Keywords: | rule based machine translation; RBMT; named entities; NE |
Subjects: | Computer Science > Machine translating |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
ID Code: | 16153 |
Deposited On: | 20 Jun 2011 14:01 by Shane Harper . Last Modified 16 May 2019 11:37 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
161kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record