UNIGE document Chapitre d'actes
previous document  unige:30952  next document
add to browser collection
Title

Combining pre-editing and post-editing to improve SMT of user-generated content

Authors
Lehmann, Sabine
Published in O’Brien, S., Simard, M. & Specia, L. Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice. Nice (France) - 2 Sept. 2013 - . 2013, p. 45-53
Abstract The poor quality of user-generated content (UGC) found in forums hinders both readability and machine-translatability. To improve these two aspects, we have developed human- and machine-oriented pre-editing rules, which correct or reformulate this content. In this paper we pre-sent the results of a study which investigates whether pre-editing rules that improve the quality of statistical machine translation (SMT) output also have a positive impact on post-editing productivity. For this study, pre-editing rules were applied to a set of French sentences extracted from a technical forum. After SMT, the post-editing temporal effort and final quality are compared for translations of the raw source and its pre-edited version. Results obtained suggest that pre-editing speeds up post-editing and that the combination of the two processes is worthy of further investigation.
Keywords User-generated contentPre-editingPost-editingStatistical machine translation
Full text
Structures
Research group TIM/ISSCO
Citation
(ISO format)
GERLACH, Johanna et al. Combining pre-editing and post-editing to improve SMT of user-generated content. In: O’Brien, S., Simard, M. & Specia, L. (Ed.). Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice. Nice (France). [s.l.] : [s.n.], 2013. p. 45-53. https://archive-ouverte.unige.ch/unige:30952

552 hits

543 downloads

Update

Deposited on : 2013-11-05

Export document
Format :
Citation style :