Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Automatic generation of parallel treebanks: an efficient unsupervised system

Zhechev, Ventsislav (2009) Automatic generation of parallel treebanks: an efficient unsupervised system. PhD thesis, Dublin City University.

Abstract
The need for syntactically annotated data for use in natural language processing has increased dramatically in recent years. This is true especially for parallel treebanks, of which very few exist. The ones that exist are mainly hand-crafted and too small for reliable use in data-oriented applications. In this work I introduce a novel open-source platform for the fast and robust automatic generation of parallel treebanks through sub-tree alignment, using a limited amount of external resources. The intrinsic and extrinsic evaluations that I undertook demonstrate that my system is a feasible alternative to the manual annotation of parallel treebanks. Therefore, I expect the presented platform to help boost research in the field of syntaxaugmented machine translation and lead to advancements in other fields where parallel treebanks can be employed.
Metadata
Item Type:Thesis (PhD)
Date of Award:November 2009
Refereed:No
Additional Information:This work was carried out in the ATTEMPT project, funded by an award to Prof. Way by SFI under the Basic Research Grants scheme.
Supervisor(s):Way, Andy
Uncontrolled Keywords:parallel treebanks; machine translation; subtree alignment;
Subjects:Computer Science > Machine translating
Computer Science > Computational linguistics
DCU Faculties and Centres:Research Initiatives and Centres > National Centre for Language Technology (NCLT)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 3.0 License. View License
Funders:Science Foundation Ireland
ID Code:14909
Deposited On:12 Nov 2009 11:37 by Andrew Way . Last Modified 19 Jul 2018 14:48
Documents

Full text available as:

[thumbnail of PhD_Thesis_Zhechev_Final.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
10MB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record