Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

An investigation of decompounding for cross-language patent search

Leveling, Johannes orcid logoORCID: 0000-0003-0603-4191, Magdy, Walid and Jones, Gareth J.F. orcid logoORCID: 0000-0003-2923-8365 (2011) An investigation of decompounding for cross-language patent search. In: The 34th Annual ACM SIGIR Conference, 24-28 Jul 2011, Beijing, China.

Abstract
Decompounding has been found to improve information retrieval (IR) effectiveness in general domains for languages such as German or Dutch. We investigate if cross-language patent retrieval can profit from decompounding. This poses two challenges: i) There may be few resources such as parallel corpora available for training an machine translation system for a compounding language. ii) Patents have a specific writing style and vocabulary (“patentese”), which may affect the performance of decompounding and translation methods. Experiments on data from the CLEF-IP 2010 task show that decompounding patents for translation can overcome out-of-vocabulary problems (OOV) and that decompounding improves IR performance significantly for small training corpora.
Metadata
Item Type:Conference or Workshop Item (Poster)
Event Type:Conference
Refereed:Yes
Uncontrolled Keywords:Experimentation; Performance; Measurement; Patent Retrieval; Decompounding
Subjects:Computer Science > Information retrieval
DCU Faculties and Centres:Research Initiatives and Centres > Centre for Next Generation Localisation (CNGL)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:16447
Deposited On:25 Jul 2011 13:10 by Shane Harper . Last Modified 25 Oct 2018 10:18
Documents

Full text available as:

[thumbnail of An_Investigation_of_Decompounding_for_Cross-Language_Patent_Search.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
102kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record