Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Demographic word embeddings for racism detection on Twitter

Hasanuzzaman, Mohammed orcid logoORCID: 0000-0003-1838-0091, Dias, Gaël and Way, Andy orcid logoORCID: 0000-0001-5736-5930 (2017) Demographic word embeddings for racism detection on Twitter. In: 8th International Joint Conference on Natural Language Processing, 27 Nov- 1 Dec 2017, Taipei, Taiwan.

Abstract
Most social media platforms grant users freedom of speech by allowing them to freely express their thoughts, beliefs, and opinions. Although this represents incredible and unique communication opportunities, it also presents important challenges. Online racism is such an example. In this study, we present a supervised learning strategy to detect racist language on Twitter based on word embedding that incorporate demographic (Age, Gender, and Location) information. Our methodology achieves reasonable classification accuracy over a gold standard dataset (F1=76.3%) and significantly improves over the classification performance of demographic-agnostic models.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Research Initiatives and Centres > ADAPT
Published in: Proceedings of the The 8th International Joint Conference on Natural Language Processing. . Asian Federation of Natural Language Processing.
Publisher:Asian Federation of Natural Language Processing
Official URL:https://www.aclweb.org/anthology/I17-1093
Copyright Information:© 2017 AFNLP
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:ADAPT Centre for Digital Content Technology is funded under the SFI Research Centres Programme (Grant 13/RC/2106) and is co-funded under the European Regional Development Fund.
ID Code:23334
Deposited On:21 May 2019 15:44 by Thomas Murtagh . Last Modified 04 Jan 2021 16:57
Documents

Full text available as:

[thumbnail of Demographic_Word_Embeddings_for_Racism_Detection_on_Twitter[1].pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
205kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record