Lohar, Pintu ORCID: 0000-0002-5328-1585, Afli, Haithem ORCID: 0000-0002-7449-4707 and Way, Andy ORCID: 0000-0001-5736-5930 (2018) Balancing translation quality and sentiment preservation. In: AMTA 2018, 17-21 Mar 2018, Boston, MA. USA.
Abstract
Social media platforms such as Twitter and Facebook are hugely popular websites through
which Internet users can communicate and spread information worldwide. On Twitter, messages (tweets) are generated by users from all over the world in many different languages.
Tweets about different events almost always encode some degree of sentiment. As is often the
case in the field of language processing, sentiment analysis tools exist primarily in English, so
if we want to understand the sentiment of the original tweets, we are forced to translate them
from the source language into English and pushing the English translations through a sentiment
analysis tool.
However, Lohar et al. (2017) demonstrated that using freely available translation tools often
caused the sentiment encoded in the original tweet to be altered. As a consequence, they built
a series of sentiment-specific translation engines and pushed tweets containing either positive,
neutral or negative sentiment through the appropriate engine to improve sentiment preservation
in the target language. For certain tasks, maintaining sentiment polarity in the target language
during the translation process is arguably more important than the absolute translation quality
obtained. In the work of Lohar et al. (2017), a small drop off in translation quality per se was
deemed tolerable. In this work, we focus on maintaining the level of sentiment preservation
while trying to improve translation quality still further. We propose a nearest sentiment classcombination method to extend the existing sentiment-specific translation systems by adding
training data from the nearest-sentiment class. Our experimental results on German-to-English
reveal that our approach is capable of achieving a proper balance between translation quality
and sentiment preservation.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Conference |
Refereed: | Yes |
Uncontrolled Keywords: | social media; sentiment polarity; German-to-English |
Subjects: | Computer Science > Machine translating |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing Research Initiatives and Centres > ADAPT |
Published in: | Proceedings of the 13th Conference of the Association for Machine Translation in the Americas (Volume 1: Research Papers). . Association for Machine Translation in the Americas. |
Publisher: | Association for Machine Translation in the Americas |
Official URL: | https://www.aclweb.org/anthology/W18-1808 |
Copyright Information: | © 2018 |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
Funders: | ADAPT Centre for Digital Content Technology at Dublin City University is funded under the Science Foundation Ireland Research Centres Programme (Grant 13/RC/2106) and is co-funded under the European Regional Development Fund. |
ID Code: | 23206 |
Deposited On: | 25 Apr 2019 11:42 by Thomas Murtagh . Last Modified 05 May 2023 16:31 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
221kB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record