en
Proceedings chapter
Open access
English

English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling

Published inProceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), Editors European Language Resources Association (ELRA)
Presented at Reykjavik, Iceland, 26-31 mai
PublisherNicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis
Publication date2014
Abstract

This paper presents a method for verb phrase (VP) alignment in an English-French parallel corpus and its use for improving statistical machine translation (SMT) of verb tenses. The method starts from automatic word alignment performed with GIZA++, and relies on a POS tagger and a parser, in combination with several heuristics, in order to identify non-contiguous components of VPs, and to label the aligned VPs with their tense and voice on each side. This procedure is applied to the Europarl corpus, leading to the creation of a smaller, high-precision parallel corpus with about 320,000 pairs of finite VPs, which is made publicly available. This resource is used to train a tense predictor for translation from English into French, based on a large number of surface features. Three MT systems are compared: (1) a baseline phrase-based SMT; (2) a tense-aware SMT system using the above predictions within a factored translation model; and (3) a system using oracle predictions from the aligned VPs. For several tenses, such as the French ""imparfait"", the tense-aware SMT system improves significantly over the baseline and is closer to the oracle system

Keywords
  • Multilinguality
  • Machine Translation
  • Tense Translation
Citation (ISO format)
LOAICIGA SANCHEZ, Sharid, MEYER, Thomas, POPESCU-BELIS, Andréi. English-French Verb Phrase Alignment in Europarl for Tense Translation Modeling. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC′14). Reykjavik, Iceland. [s.l.] : Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis, 2014.
Main files (1)
Proceedings chapter (Published version)
accessLevelPublic
Identifiers
  • PID : unige:40625
ISBN978-2-9517408-8-4
905views
271downloads

Technical informations

Creation09/10/2014 2:22:00 PM
First validation09/10/2014 2:22:00 PM
Update time03/14/2023 9:48:43 PM
Status update03/14/2023 9:48:43 PM
Last indexation01/16/2024 11:59:22 AM
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack