Proceedings chapter
Open access

Discontinuous Verb Phrases in Parsing and Machine Translation of English and German

Presented at Portorož, Slovenia, 23-28 May
Publication date2016

In this paper, we focus on the verb-particle (V-Prt) split construction in English and German and its difficulty for parsing and Machine Translation (MT). For German, we use an existing test suite of V-Prt split constructions, while for English, we build a new and comparable test suite from raw data. These two data sets are then used to perform an analysis of errors in dependency parsing, word-level alignment and MT, which arise from the discontinuous order in V-Prt split constructions. In the automatic alignments of parallel corpora, most of the particles align to NULL. These mis-alignments and the inability of phrase-based MT system to recover discontinuous phrases result in low quality translations of V-Prt split constructions both in English and German. However, our results show that the V-Prt split phrases are correctly parsed in 90% of cases, suggesting that syntactic-based MT should perform better on these constructions. We evaluate a syntactic-based MT system on German and compare its performance to the phrase-based system.

  • Verb-particle split
  • Discontinuous phrases
  • Test suite
  • Parsing
  • Machine translation
Citation (ISO format)
LOAICIGA SANCHEZ, Sharid, GULORDAVA, Kristina. Discontinuous Verb Phrases in Parsing and Machine Translation of English and German. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). Portorož, Slovenia. [s.l.] : [s.n.], 2016.
Main files (1)
Proceedings chapter (Accepted version)
  • PID : unige:84454

Technical informations

Creation05/31/2016 3:58:00 PM
First validation05/31/2016 3:58:00 PM
Update time03/15/2023 12:27:09 AM
Status update03/15/2023 12:27:08 AM
Last indexation01/16/2024 9:03:06 PM
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack