en
Proceedings chapter
Open access
English

Evaluating a Multilingual Pre-trained Model for the Automatic Standard German Captioning of Swiss German TV

Published inProceedings of the 8th edition of the Swiss Text Analytics Conference, Editors Hatem Ghorbel, Maria Sokhn, Mark Cieliebak, Manuela Hürlimann, Emmanuel de Salis, Jonathan Guerne, p. 14-22
Presented at Neuchâtel, June 2023
PublisherNeuchâtel : Association for Computational Linguistics
Publication date2023
Abstract

In Switzerland, two thirds of the population speak Swiss German, a primarily spoken language with no standardised written form. It is widely used on Swiss TV, for example in news reports, interviews and talk shows, and captions are required for people who do not understand this spoken language. This paper focuses on the second part of a cascade approach for the automatic Standard German captioning of spoken Swiss German. We apply a multilingual pre-trained model to translate automatic speech recognition of Swiss German into Standard German suitable for captioning. Results of several evaluations, both human and automatic, show that the system succeeds in improving the content, but is currently not capable of producing entirely correct Standard German.

eng
Keywords
  • Low-resource language
  • Captioning
  • Swiss German
  • Neural machine translation
NoteFunded by the Initiative for Media Innovation based at the EPFL’s Media Center in Lausanne, Switzerland
Research group
Citation (ISO format)
GERLACH, Johanna et al. Evaluating a Multilingual Pre-trained Model for the Automatic Standard German Captioning of Swiss German TV. In: Proceedings of the 8th edition of the Swiss Text Analytics Conference. Neuchâtel. Neuchâtel : Association for Computational Linguistics, 2023. p. 14–22.
Main files (1)
Proceedings chapter (Published version)
Identifiers
  • PID : unige:174988
49views
7downloads

Technical informations

Creation02/20/2024 8:40:03 AM
First validation02/20/2024 4:33:07 PM
Update time02/26/2024 9:06:37 AM
Status update02/26/2024 9:06:37 AM
Last indexation02/26/2024 9:06:41 AM
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack