Zhechev, Ventsislav and Way, Andy ORCID: 0000-0001-5736-5930 (2008) Automatic generation of parallel treebanks. In: COLING 2008 - 22nd International Conference on Computational Linguistics, 18-22 August 2008, Manchester, UK.
Abstract
The need for syntactically annotated data for use in natural language processing has increased dramatically in recent years. This is true especially for parallel treebanks, of which very few exist. The ones that exist are mainly hand-crafted and too small for reliable use in data-oriented applications. In this paper we introduce a novel platform for fast and robust automatic generation of
parallel treebanks. The software we have developed based on this platform has been shown to handle large data sets. We also present evaluation results demonstrating the quality of the derived treebanks and discuss some possible modifications and improvements that can lead to even better
results. We expect the presented platform to help boost research in the field of dataoriented machine translation and lead to advancements in other fields where parallel
treebanks can be employed.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Conference |
Refereed: | Yes |
Subjects: | Computer Science > Machine translating |
DCU Faculties and Centres: | Research Initiatives and Centres > National Centre for Language Technology (NCLT) DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing |
Publisher: | Association for Computational Linguistics |
Official URL: | http://aclweb.org/anthology/C/C08/ |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
Funders: | Science Foundation Ireland, SFI 05/RF/CMS064 |
ID Code: | 15197 |
Deposited On: | 16 Feb 2010 16:01 by DORAS Administrator . Last Modified 14 Nov 2018 16:40 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
232kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record