Book chapter
English

A Multilingual Integrated Framework for Processing Lexical Collocations

ContributorsSeretan, Violeta
Published inSpringer (Ed.), Computational Linguistics - Applications, p. 87-108
Collection
  • Studies in Computational Intelligence; 458
Publication date2013
Abstract

Lexical collocations are typical combinations of words, such as "heavy rain", "close collaboration", or "to meet a deadline". Pervasive in language, they are a key issue for NLP systems since, as other types of multi-word expressions like idioms, they do not allow for word-by-word processing. We present a multilingual framework that lays emphasis on the accurate acquisition of collocational knowledge from corpora and its exploitation in two large-scale applications (parsing and machine translation), as well as for lexicographic support and for reading assistance. The underlying methodology departs from mainstream approaches by relying on deep parsing to cope with the high morphosyntactic flexibility of collocations. We review theoretical claims and contrast them with practical work, showing our efforts to model collocations in an adequate and comprehensive way. Experimental results show the efficiency of our approach and the impact of collocational knowledge on the performance of parsing and machine translation.

Citation (ISO format)
SERETAN, Violeta. A Multilingual Integrated Framework for Processing Lexical Collocations. In: Computational Linguistics - Applications. Springer (Ed.). [s.l.] : [s.n.], 2013. p. 87–108. (Studies in Computational Intelligence) doi: 10.1007/978-3-642-34399-5_5
Identifiers
ISBN978-3-642-34398-8
690views
0downloads

Technical informations

Creation25/08/2014 17:03:00
First validation25/08/2014 17:03:00
Update time14/03/2023 21:33:51
Status update14/03/2023 21:33:51
Last indexation30/10/2024 19:53:14
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack