Proceedings chapter
Open access

Semantic segmentation of video collections using boosted random fields

Presented at Edinburgh (UK)
Publication date2005

Multimedia documentalists need effective tools to organize and search into large video collections. Semantic video structuring consists in automatically extracting from the raw data the inner structure of a video collection. This high-level information if automatically extracted would provide important meta information enabling the development of an important new range of applications to browse and search video collections. In this paper, we present the feature extraction process providing a compact description of the audio, visual and text modalities. To reach the semantic level required, a contextual model is then proposed: it is a complex model which takes into account not only the link between features and labels but also the compatibility between labels associated with different modalities for improved consistency of the results. Boosted Random Fields are used to learn these relationships. It provides an iterative optimization framework to learn the model parameters and uses the abilities of boosting to reduce classification errors, to avoid over-fitting and to achieve the task of feature selection. We experiment using the TRECvid corpus and show results that validate the approach over existing studies.

Citation (ISO format)
JANVIER, Bruno et al. Semantic segmentation of video collections using boosted random fields. In: Proceedings of the 2005 second Workshop on Machine Learning and Multimodal Interaction, MLMI′05. Edinburgh (UK). [s.l.] : [s.n.], 2005.
Main files (1)
Proceedings chapter (Accepted version)
  • PID : unige:47682

Technical informations

Creation03/06/2015 5:12:07 PM
First validation03/06/2015 5:12:07 PM
Update time03/14/2023 10:58:36 PM
Status update03/14/2023 10:58:36 PM
Last indexation01/16/2024 5:08:21 PM
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack