UNIGE document Chapitre d'actes
previous document  unige:47729  next document
add to browser collection
Title

Interactive Video Retrieval based on Multimodal Dissimilarity Representation

Authors
Published in Proceedings of the Machine Learning Techniques for Processing Multimedia Content- ICML workshop, MLMM'05. Bonn (Germany). 2005
Abstract We present an approach to learn user semantic queries from dissimilarity representations of video audio-visual content. When dealing with large corpora of videos documents, using a feature-based representation calls for the online computation of distances between all documents and the query. Hence, a dissimilarity representation may be preferred because its offline computation speeds up the retrieval process. We show how distances related to visual and audio video features can directly be used to learn complex concepts from a set of positive and negative examples provided by the user. Based on the idea of dissimilarity spaces, we derive a lowdimensional multimodal representation space where an on-line and real-time classification is performed to learn user queries. The classification consists in maximizing a non-linear Fisher criterion to separate positive from negative examples. The evaluation, performed on the complete annotated TRECVid corpus, shows that our technique enables us to improve the precision of retrieval results.
Keywords multimodal dissimilarity representationinteractive video retrieval
Full text
Structures
Research groups Viper group
Computer Vision and Multimedia Laboratory
Citation
(ISO format)
BRUNO, Eric, MOENNE-LOCCOZ, Nicolas, MARCHAND-MAILLET, Stéphane. Interactive Video Retrieval based on Multimodal Dissimilarity Representation. In: Proceedings of the Machine Learning Techniques for Processing Multimedia Content- ICML workshop, MLMM'05. Bonn (Germany). [s.l.] : [s.n.], 2005. https://archive-ouverte.unige.ch/unige:47729

77 hits

30 downloads

Update

Deposited on : 2015-03-06

Export document
Format :
Citation style :