Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Fast gated neural domain adaptation: language model as a case study

Zhang, Jian, Wu, Xiaofeng, Way, Andy orcid logoORCID: 0000-0001-5736-5930 and Liu, Qun orcid logoORCID: 0000-0002-7000-1792 (2017) Fast gated neural domain adaptation: language model as a case study. In: Proceedings of FETLT 2016: Future and Emerging Trends in Language Technologies, Machine Learning and Big Data, 30 Nov- 2 Dec 2016, Seville, Spain.

Abstract
Neural network training has been shown to be advantageous in many natural language processing applications, such as language modelling or machine translation. In this paper, we describe in detail a novel domain adaptation mechanism in neural network training. Instead of learning and adapting the neural network on millions of training sentences – which can be very timeconsuming or even infeasible in some cases – we design a domain adaptation gating mechanism which can be used in recurrent neural networks and quickly learn the out-of-domain knowledge directly from the word vector representations with little speed overhead. In our experiments, we use the recurrent neural network language model (LM) as a case study. We show that the neural LM perplexity can be reduced by 7.395 and 12.011 using the proposed domain adaptation mechanism on the Penn Treebank and News data, respectively. Furthermore, we show that using the domain-adapted neural LM to re-rank the statistical machine translation n-best list on the French-to-English language pair can significantly improve translation quality
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Subjects:Computer Science > Machine learning
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Research Initiatives and Centres > ADAPT
Published in: Proceedings of FETLT 2016: Future and Emerging Trends in Language Technologies, Machine Learning and Big Data. . Association for Computational Linguistics.
Publisher:Association for Computational Linguistics
Official URL:https://aclweb.org/anthology/C16-1131
Copyright Information:© 2016 ACL
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:ADAPT Centre for Digital Content Technology (www.adaptcentre.ie) at Dublin City University is funded under the Science Foundation Ireland Research Centres Programme (Grant 13/RC/2106) and is co-funded under the European Regional Development Fund.
ID Code:23233
Deposited On:02 May 2019 12:28 by Thomas Murtagh . Last Modified 27 Apr 2023 11:28
Documents

Full text available as:

[thumbnail of Fast_Gated_Neural_Domain_Adaptation.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
237kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record