en
Book chapter
English

A Multilingual Integrated Framework for Processing Lexical Collocations

ContributorsSeretan, Violeta
Published inComputational Linguistics - Applications, Editors Springer, p. 87-108
Collection
  • Studies in Computational Intelligence; 458
Publication date2013
Abstract

Lexical collocations are typical combinations of words, such as "heavy rain", "close collaboration", or "to meet a deadline". Pervasive in language, they are a key issue for NLP systems since, as other types of multi-word expressions like idioms, they do not allow for word-by-word processing. We present a multilingual framework that lays emphasis on the accurate acquisition of collocational knowledge from corpora and its exploitation in two large-scale applications (parsing and machine translation), as well as for lexicographic support and for reading assistance. The underlying methodology departs from mainstream approaches by relying on deep parsing to cope with the high morphosyntactic flexibility of collocations. We review theoretical claims and contrast them with practical work, showing our efforts to model collocations in an adequate and comprehensive way. Experimental results show the efficiency of our approach and the impact of collocational knowledge on the performance of parsing and machine translation.

Citation (ISO format)
SERETAN, Violeta. A Multilingual Integrated Framework for Processing Lexical Collocations. In: Computational Linguistics - Applications. [s.l.] : [s.n.], 2013. p. 87–108. (Studies in Computational Intelligence) doi: 10.1007/978-3-642-34399-5_5
Identifiers
ISBN978-3-642-34398-8
621views
0downloads

Technical informations

Creation08/25/2014 5:03:00 PM
First validation08/25/2014 5:03:00 PM
Update time03/14/2023 9:33:51 PM
Status update03/14/2023 9:33:51 PM
Last indexation01/16/2024 11:43:52 AM
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack