en
Proceedings chapter
Open access
English

Rule-based Automatic Post-processing of SMT Output to Reduce Human Post-editing Effort

Presented at Londres (Royaume-Uni), 27-28 novembre 2014
Publication date2014
Abstract

To enhance sharing of knowledge across the language barrier, the ACCEPT project focuses on improving machine translation of user-generated content by investigating pre- and post-editing strategies. Within this context, we have developed automatic monolingual post-editing rules for French, aimed at correcting frequent errors automatically. The rules were developed using the AcrolinxIQ technology, which relies on shallow linguistic analysis. In this paper, we present an evaluation of these rules, considering their impact on the readability of MT output and their usefulness for subsequent manual post-editing. Results show that the readability of a high proportion of the data is indeed improved when automatic post-editing rules are applied. Their usefulness is confirmed by the fact that a large share of the edits brought about by the rules are in fact kept by human post-editors. Moreover, results reveal that edits which improve readability are not necessarily the same as those preserved by post-editors in the final output, hence the importance of considering both readability and post-editing effort in the evaluation of post-editing strategies.

Keywords
  • Post-editing
  • Statistical machine translation
  • User-generated content
  • Language communities
Research group
Citation (ISO format)
PORRO RODRIGUEZ, Victoria et al. Rule-based Automatic Post-processing of SMT Output to Reduce Human Post-editing Effort. In: Translating and the Computer 36. Londres (Royaume-Uni). [s.l.] : [s.n.], 2014.
Main files (1)
Proceedings chapter (Accepted version)
accessLevelPublic
Identifiers
  • PID : unige:42657
786views
274downloads

Technical informations

Creation12/04/2014 2:12:00 PM
First validation12/04/2014 2:12:00 PM
Update time03/14/2023 10:21:52 PM
Status update03/14/2023 10:21:52 PM
Last indexation05/02/2024 3:48:12 PM
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack