Awad, George M., Snoek, Cees G.M., Smeaton, Alan F. ORCID: 0000-0003-1028-8389 and Quénot, Georges (2016) TRECVid semantic indexing of video: a 6-year retrospective. ITE Transactions on Media Technology and Applications, 4 (3). pp. 187-208.
Abstract
Semantic indexing, or assigning semantic tags to video samples, is a key component for content-based access to video documents and collections. The Semantic Indexing task has been run at TRECVid from 2010 to 2015 with the support of NIST and the Quaero project. As with the previous High-Level Feature detection task which ran from 2002 to 2009, the semantic indexing task aims at evaluating methods and systems for detecting visual, auditory or multi-modal concepts in video shots. In addition to the main semantic indexing task, four secondary tasks were proposed namely the “localization” task, the “concept pair” task, the “no annotation” task, and the “progress” task. It attracted over 40 research teams during its running period.
The task was conducted using a total of 1,400 hours of video data drawn from Internet Archive videos with Creative Commons licenses gathered by NIST. 200 hours of new test data was made available each year plus 200 more as development data in 2010. The number of target concepts to be detected started from 130 in 2010 and was extended to 346 in 2011. Both the increase in the volume of video data and in the number of target concepts favored the development of generic and scalable methods. Over 8 millions shots×concepts direct annotations plus over 20 millions indirect ones were produced by the participants and the Quaero project on a total of 800 hours of development data.
Significant progress was accomplished during the period as this was accurately measured in the context of the progress task but also from some of the participants’ contrast experiments. This paper describes the data, protocol and metrics used for the main and the secondary tasks, the results obtained and the main approaches used by participants.
Metadata
Item Type: | Article (Published) |
---|---|
Refereed: | Yes |
Uncontrolled Keywords: | TRECVid; video; semantic indexing; concept detection; benchmark |
Subjects: | Computer Science > Multimedia systems |
DCU Faculties and Centres: | Research Initiatives and Centres > INSIGHT Centre for Data Analytics DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing |
Publisher: | The Institute of Image Information and Television Engineers |
Official URL: | http://www.ite.or.jp/content/mta/ |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
Funders: | Science Foundation Ireland |
ID Code: | 21279 |
Deposited On: | 15 Jul 2016 14:54 by Alan Smeaton . Last Modified 31 Oct 2018 11:37 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
2MB |
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record