Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

DCU at the NTCIR-9 spokendoc passage retrieval task

Eskevich, Maria orcid logoORCID: 0000-0002-1242-0753 and Jones, Gareth J.F. orcid logoORCID: 0000-0003-2923-8365 (2011) DCU at the NTCIR-9 spokendoc passage retrieval task. In: The 9th NTCIR Workshop Meeting, 6-9 Dec 2011, Tokyo, Japan. ISBN 978-4-86049-056-0

Abstract
We describe details of our runs and the results obtained for the "IR for Spoken Documents (SpokenDoc) Task" at NTCIR-9. The focus of our participation in this task was the investigation of the use of segmentation methods to divide the manual and ASR transcripts into topically coherent segments. The underlying assumption of this approach is that these segments will capture passages in the transcript relevant to the query. Our experiments investigate the use of two lexical coherence based segmentation algorithms (Text-Tiling, C99). These are run on the provided manual and ASR transcripts, and the ASR transcript with stop words removed. Evaluation of the results shows that TextTiling consistently performs better than C99 both in segmenting the data into retrieval units as evaluated using the centre located relevant information metric and in having higher content precision in each automatically created segment.
Metadata
Item Type:Conference or Workshop Item (Paper)
Event Type:Workshop
Refereed:Yes
Uncontrolled Keywords:Speech search; passage retrieval; automatic segmentation
Subjects:Computer Science > Multimedia systems
Computer Science > Information retrieval
DCU Faculties and Centres:Research Initiatives and Centres > Centre for Digital Video Processing (CDVP)
Research Initiatives and Centres > Centre for Next Generation Localisation (CNGL)
DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing
Published in: Proceedings of the NTCIR 9 Workshop. . National Institute of Informatics. ISBN 978-4-86049-056-0
Publisher:National Institute of Informatics
Official URL:http://research.nii.ac.jp/ntcir/workshop/OnlinePro...
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:Science Foundation Ireland
ID Code:16894
Deposited On:16 Apr 2012 10:44 by Gareth Jones . Last Modified 10 Oct 2018 09:20
Documents

Full text available as:

[thumbnail of DCU submission to NTCIR 9 SpokenDoc task]
Preview
PDF (DCU submission to NTCIR 9 SpokenDoc task) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
332kB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record