Nayak, Prashanth ORCID: 0000-0003-1962-9135 (2024) A critical examination of document-level machine translation systems. PhD thesis, Dublin City University.
Abstract
The need for accurate and effective translation cannot be overstated in an increasingly globalised world where communication is paramount. Bridging language barriers is important for promoting understanding and cooperation among diverse individuals and communities, making translation an indispensable tool. Over the past two decades, Machine Translation (MT) has undergone remarkable advancements, with significant progress attributed to the emergence of Neural Machine Translation (NMT), primarily the groundbreaking Transformer models. This rapid development in MT, which started with a focus on sentence-level translation, has not only bridged communication gaps but also brought MT systems close to delivering human-like performance on various translation tasks. While these advances are significant, focusing mainly on sentence-level modeling and evaluation, they miss the valuable contextual information around each sentence. Contextual information in document-level translation helps resolve language ambiguities and ensures consistency and coherence in the translated text, making the translation more accurate and readable. While considerable efforts have been made to incorporate context into NMT systems, the community has not reached a consensus on the most effective methods and the types of context to be integrated. In this thesis, my primary focus is on understanding document-level systems. Specifically, I explore how these systems incorporate context into their translation processes and investigate the span of context utilised by these systems. I also investigate the terminology translation mechanisms within these systems. Furthermore, with the emergence of modern-day
powerful Large Language Models (LLMs), I examine their capabilities in terminology translation and propose new methodologies to improve terminology translation for these powerful models.
Metadata
Item Type: | Thesis (PhD) |
---|---|
Date of Award: | March 2024 |
Refereed: | No |
Supervisor(s): | Way, Andy, Haque, Rejwanul and Kelleher, John |
Uncontrolled Keywords: | Terminology |
Subjects: | Computer Science > Artificial intelligence Computer Science > Computational linguistics Computer Science > Information technology Computer Science > Machine learning Computer Science > Machine translating Humanities > Language Humanities > Linguistics Humanities > Translating and interpreting |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing Research Initiatives and Centres > ADAPT Research Initiatives and Centres > d-real |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 License. View License |
Funders: | Science Foundation Ireland, Microsoft Research |
ID Code: | 29345 |
Deposited On: | 22 Mar 2024 13:42 by Andrew Way . Last Modified 22 Mar 2024 13:42 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution-Noncommercial 4.0 1MB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record