Master
OA Policy
English

Automated Image Captioning: Exploring the Potential of Microsoft Computer Vision for English and Spanish

DirectorsDe Wilde, Max
Master program titleMaîtrise universitaire en traitement informatique multilingue
Defense date2019
Abstract

With the rise of deep learning, reflected by the creation of architectures such as Convolutional Neural Networks (CNNs), researchers are becoming increasingly interested in the utility and relevance of machines that can properly generate information about images in different languages. This thesis focuses on Microsoft Azure's Computer Vision API, specifically its functionality for image description in both English and Spanish applied to a corpus of flora pictures. To assess the accuracy of the API's captions, a combination of human and machine evaluation was used. Although the initial hypothesis was that the CNNs of the API were robust enough to generate pertinent captions in both languages, the evaluations seemed to indicate that the technology is not yet mature enough to accomplish this task. This exploratory study therefore serves as a reflection on the use of automated image captioning for multilingual purposes and of the potential and limits of this technology.

Keywords
  • Automatic Image Captioning
  • Deep Learning
  • Convolutional Neural Networks
  • CNNs
  • Computer Vision
  • Microsoft Azure
  • Flora
  • Machine Learning
  • Machine-generated captions
Citation (ISO format)
MARTINEZ GUTIERREZ, Maria Fernanda. Automated Image Captioning: Exploring the Potential of Microsoft Computer Vision for English and Spanish. Master, 2019.
Main files (1)
Master thesis
accessLevelPublic
Identifiers
  • PID : unige:132748
378views
799downloads

Technical informations

Creation03/23/2020 10:48:00 AM
First validation03/23/2020 10:48:00 AM
Update time03/15/2023 9:18:36 PM
Status update03/15/2023 9:18:35 PM
Last indexation10/31/2024 6:04:49 PM
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack