Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

wEBMT: developing and validating an example-based machine translation system using the world wide web

Way, Andy orcid logoORCID: 0000-0001-5736-5930 and Gough, Nano (2003) wEBMT: developing and validating an example-based machine translation system using the world wide web. Computational Linguistics, 29 (3).

Abstract
We have developed an example-based machine translation (EBMT) system that uses the World Wide Web for two different purposes: First, we populate the system’s memory with translations gathered from rule-based MT systems located on the Web. The source strings input to these systems were extracted automatically from an extremely small subset of the rule types in the Penn-II Treebank. In subsequent stages, the (source, target) translation pairs obtained are automatically transformed into a series of resources that render the translation process more successful. Despite the fact that the output from on-line MT systems is often faulty, we demonstrate in a number of experiments that when used to seed the memories of an EBMT system, they can in fact prove useful in generating translations of high quality in a robust fashion. In addition, we demonstrate the relative gain of EBMT in comparison to on-line systems. Second, despite the perception that the documents available on the Web are of questionable quality, we demonstrate in contrast that such resources are extremely useful in automatically postediting translation candidates proposed by our system.
Metadata
Item Type:Article (Published)
Refereed:Yes
Uncontrolled Keywords:example-based machine translation;
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:Research Initiatives and Centres > National Centre for Language Technology (NCLT)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Publisher:Association for Computational Linguistics
Official URL:http://www.aclweb.org/anthology-new/J/J03/
Copyright Information:©2003 Association for Computational Linguistics
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:15318
Deposited On:16 Mar 2010 13:56 by DORAS Administrator . Last Modified 16 Nov 2018 12:00
Documents

Full text available as:

[thumbnail of J03-3004.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
166kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record