Learning to compress and search visual data in large-scale systems

Ferdowsi, Sohrab

doi:10.13097/archive-ouverte/unige:114990

Doctoral thesis

English

Learning to compress and search visual data in large-scale systems

ContributorsFerdowsi, Sohrab

DirectorsVoloshynovskyy, Svyatoslav

Defense date2018-12-11

Abstract

The problem of high-dimensional and large-scale representation of visual data is addressed from an unsupervised learning perspective, where an emphasis is put on discrete representations. The algorithmic infrastructure is developed based on the synthesis and analysis prior models whose rate-distortion properties are carefully optimized. These are then extended to multi-layers, namely the RRQ and the ML-STC frameworks, where the latter is further evolved as a powerful deep neural network architecture with fast and sample-efficient training. For these frameworks, three important applications are considered. First, large-scale similarity search in retrieval systems is addressed, where a double-stage solution is proposed leading to faster query times and smaller storage. Second, the problem of learned image compression is targeted, where the proposed models can capture more redundancies within images than the conventional compression codecs. Finally, the proposed algorithms are used to solve ill-posed inverse problems with promising results in image denoising and compressive sensing.

Keywords

Unsupervised learning
Representation learning
Learned compression
Similarity search
Approximate nearest neighbor search
Rate-distortion theory
Ill-posed inverse problems
Image processing

Affiliation entities

Research groups

Stochastic Information Processing Group

Citation (ISO format)

FERDOWSI, Sohrab. Learning to compress and search visual data in large-scale systems. Doctoral Thesis, 2018. doi: 10.13097/archive-ouverte/unige:114990

Thesis

Identifiers

PID : unige:114990
DOI : 10.13097/archive-ouverte/unige:114990
URN : urn:nbn:ch:unige-1149905
Thesis number : Sc. 5295

536views

219downloads

Creation12/03/2019 15:14:00

First validation12/03/2019 15:14:00

Update time08/02/2024 13:40:24

Status update08/02/2024 13:40:24

Last indexation31/10/2024 13:56:08

Archive ouverte UNIGE

Learning to compress and search visual data in large-scale systems

Technical informations