Uí Dhonnchadha, Elaine and van Genabith, Josef (2006) A Part-of-Speech tagger for Irish using finite state morphology and constraint grammar disambiguation. In: LREC 2006, May 2006, Genoa.
Abstract
This paper describes the methodology used to develop a part-of-speech tagger for Irish, which is used to annotate a corpus of 30 million words of text with part-of-speech tags and lemmas. The tagger is evaluated using a manually disambiguated test corpus and it currently achieves 95% accuracy on unrestricted text. To our knowledge, this is the first part-of-speech tagger for Irish.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Conference |
Refereed: | Yes |
Uncontrolled Keywords: | E-Learning; finite state morphology; constraint grammar disambiguation |
Subjects: | Social Sciences > Educational technology |
DCU Faculties and Centres: | Research Initiatives and Centres > National Centre for Language Technology (NCLT) |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
ID Code: | 16367 |
Deposited On: | 02 Jun 2011 08:42 by Shane Harper . Last Modified 14 Oct 2016 10:00 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
347kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record