Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

"cba to check the spelling" investigating parser performance on discussion forum posts

Foster, Jennifer orcid logoORCID: 0000-0002-7789-4853 (2010) "cba to check the spelling" investigating parser performance on discussion forum posts. In: NAACL HLT 2010 - Human Language Technologies: 2010 Annual Conference of the North American Chapter of the ACL, 2-4 June 2010, Los Angeles, CA, USA.

Abstract
We evaluate the Berkeley parser on text from an online discussion forum. We evaluate the parser output with and without gold tokens and spellings (using Sparseval and Parseval), and we compile a list of problematic phenomena for this domain. The Parseval f-score for a small development set is 77.56. This increases to 80.27 when we apply a set of simple transformations to the input sentences and to the Wall Street Journal (WSJ) training sections.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Conference
Refereed:Yes
Subjects:Computer Science > Machine translating
DCU Faculties and Centres:Research Initiatives and Centres > National Centre for Language Technology (NCLT)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Publisher:Association for Computational Linguistics
Official URL:http://www.aclweb.org/anthology/N/N10/
Copyright Information:© 2010 Association for Computational Linguistics
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:15984
Deposited On:08 Dec 2010 14:14 by Shane Harper . Last Modified 10 Oct 2018 15:11
Documents

Full text available as:

[thumbnail of cba_to_check_the_spelling._Investigating_Parser_Performance_on_Discussion_Forum_Posts.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
36kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record