Proceedings chapter
OA Policy
English

Post-editors as Gatekeepers of Lexical and Syntactic Diversity: Comparative Analysis of Human Translation and Post-editing in Professional Settings

Presented atSheffield, June 24-27
Published inXingyi Song, Edward Gow-Smith, Carolina Scarton, Vera Cabarrão, Konstantinos Chatzitheodorou, Patrick Cadwell, Ekaterina Lapshinova-Koltunski, Rachel Bawden, Víctor M. Sánchez-Cartagena, Barry Haddow, Diptesh Kanojia, Mary Nurminen, Helena Moniz, Mikel Forcada, Chris Oakley (Ed.), Proceedings of the 25th Annual Conference of the European Association for Machine Translation : Volume 1: Research And Implementations & Case Studies, p. 387-395
PublisherEuropean Association for Machine Translation
Publication date2024
Abstract

This paper presents a comparative analysis between human translation (HT) and post-edited machine translation (PEMT) from a lexical and syntactic perspective to verify whether the tendency of neural machine translation (NMT) systems to produce lexically and syntactically poorer translations shines through after post-editing (PE). The analysis focuses on three datasets collected in professional contexts containing translations from English into French and German into French. Through a comparison of word translation entropy (HTRa) scores, we observe a lower degree of lexical diversity in PEMT compared to HT. Additionally, metrics of syntactic equivalence indicate that PEMT is more likely to mirror the syntactic structure of the source text in contrast to HT. By incorporating raw machine translation (MT) output into our analysis, we underline the important role post-editors play in adding lexical and syntactic diversity to MT output. Our findings provide relevant input for MT users and decision-makers in language services as well as for MT and PE trainers and advisers.

Keywords
  • Machine translation
  • Post-editing
  • Lexical variety
  • Lexical richness
  • Word translation entropy
  • HTRa
  • ASTrED
  • Lexical diversity
  • Translation
  • Syntactic variation
Citation (ISO format)
VOLKART, Lise, BOUILLON, Pierrette. Post-editors as Gatekeepers of Lexical and Syntactic Diversity: Comparative Analysis of Human Translation and Post-editing in Professional Settings. In: Proceedings of the 25th Annual Conference of the European Association for Machine Translation : Volume 1: Research And Implementations & Case Studies. Xingyi Song, Edward Gow-Smith, Carolina Scarton, Vera Cabarrão, Konstantinos Chatzitheodorou, Patrick Cadwell, Ekaterina Lapshinova-Koltunski, Rachel Bawden, Víctor M. Sánchez-Cartagena, Barry Haddow, Diptesh Kanojia, Mary Nurminen, Helena Moniz, Mikel Forcada, Chris Oakley (Ed.). Sheffield. [s.l.] : European Association for Machine Translation, 2024. p. 387–395.
Main files (1)
Proceedings chapter (Published version)
Identifiers
  • PID : unige:179439
ISBN978-1-0686907-0-9
224views
245downloads

Technical informations

Creation22/08/2024 12:46:00
First validation27/08/2024 07:38:36
Update time27/08/2024 07:38:36
Status update27/08/2024 07:38:36
Last indexation17/12/2024 15:38:53
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack