Proceedings chapter
OA Policy
English

Two Approaches to Correcting Homophone Confusions in a Hybrid Machine Translation System

Presented atSofia (Bulgaria), 8 Aug. 2013
PublisherAssociation for Computational Linguistics
Publication date2013
Abstract

In the context of a hybrid French-to-English SMT system for translating online forum posts, we present two methods for addressing the common problem of homophone confusions in colloquial written language. The first is based on hand-coded rules; the second on weighted graphs derived from a large-scale pronunciation resource, with weights trained from a small bicorpus of domain language. With automatic evaluation, the weighted graph method yields an improvement of about +0.63 BLEU points, while the rulebased method scores about the same as the baseline. On contrastive manual evaluation, both methods give highly significant improvements (p < 0.0001) and score about equally when compared against each other.

Research groups
Citation (ISO format)
BOUILLON, Pierrette et al. Two Approaches to Correcting Homophone Confusions in a Hybrid Machine Translation System. In: Proceedings of the Second Workshop on Hybrid Approaches to Translation. Sofia (Bulgaria). [s.l.] : Association for Computational Linguistics, 2013. p. 109–116.
Main files (1)
Proceedings chapter (Published version)
accessLevelPublic
Identifiers
  • PID : unige:30954
720views
257downloads

Technical informations

Creation05/11/2013 12:50:00
First validation05/11/2013 12:50:00
Update time14/03/2023 21:36:03
Status update14/03/2023 21:36:02
Last indexation17/12/2024 16:35:28
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack