Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Automatic acquisition of Spanish LFG resources from the Cast3LB treebank

O'Donovan, Ruth, Cahill, Aoife orcid logoORCID: 0000-0002-3519-7726, van Genabith, Josef and Way, Andy orcid logoORCID: 0000-0001-5736-5930 (2005) Automatic acquisition of Spanish LFG resources from the Cast3LB treebank. In: the LFG05 Conference, 2005, University of Bergen.

Abstract
In this paper, we describe the automatic annotation of the Cast3LB Treebank with LFG f-structures for the subsequent extraction of Spanish probabilistic grammar and lexical resources. We adapt the approach and methodology of Cahill et al. (2004), O’Donovan et al. (2004) and elsewhere for English to Spanish and the Cast3LB treebank encoding. We report on the quality and coverage of the automatic f-structure annotation. Following the pipeline and integrated models of Cahill et al. (2004), we extract wide-coverage probabilistic LFG approximations and parse unseen Spanish text into f-structures. We also extend Bikel’s (2002) Multilingual Parse Engine to include a Spanish language module. Using the retrained Bikel parser in the pipeline model gives the best results against a manually constructed gold standard (73.20% predsonly f-score). We also extract Spanish lexical resources: 4090 semantic form types with 98 frame types. Subcategorised prepositions and particles are included in the frames.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Uncontrolled Keywords:LFG Treebanks
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Published in: Proceedings of LFG '05. . CSLI Publications.
Publisher:CSLI Publications
Copyright Information:© 2005 CSLI Publications
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:16174
Deposited On:24 May 2011 14:33 by Shane Harper . Last Modified 25 Jan 2019 11:50
Documents

Full text available as:

[thumbnail of AUTOMATIC_ACQUISITION_OF_SPANISH_LFG_RESOURCES_FROM_THE_CAST3LB_TREEBANK.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
81kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record