UNIGE document Chapitre d'actes
previous document  unige:84454  next document
add to browser collection
Title

Discontinuous Verb Phrases in Parsing and Machine Translation of English and German

Authors
Published in Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). Portorož, Slovenia - 23-28 May - . 2016
Abstract In this paper, we focus on the verb-particle (V-Prt) split construction in English and German and its difficulty for parsing and Machine Translation (MT). For German, we use an existing test suite of V-Prt split constructions, while for English, we build a new and comparable test suite from raw data. These two data sets are then used to perform an analysis of errors in dependency parsing, word-level alignment and MT, which arise from the discontinuous order in V-Prt split constructions. In the automatic alignments of parallel corpora, most of the particles align to NULL. These mis-alignments and the inability of phrase-based MT system to recover discontinuous phrases result in low quality translations of V-Prt split constructions both in English and German. However, our results show that the V-Prt split phrases are correctly parsed in 90% of cases, suggesting that syntactic-based MT should perform better on these constructions. We evaluate a syntactic-based MT system on German and compare its performance to the phrase-based system.
Keywords Verb-particle splitDiscontinuous phrasesTest suiteParsingMachine translation
Full text
Structures
Research group Laboratoire d'Analyse et de Traitement du Langage (LATL)
Citation
(ISO format)
LOAICIGA SANCHEZ, Sharid, GULORDAVA, Kristina. Discontinuous Verb Phrases in Parsing and Machine Translation of English and German. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). Portorož, Slovenia. [s.l.] : [s.n.], 2016. https://archive-ouverte.unige.ch/unige:84454

138 hits

41 downloads

Update

Deposited on : 2016-06-13

Export document
Format :
Citation style :