Doctoral thesis
OA Policy
English

Collocation extraction based on syntactic parsing

ContributorsSeretan, Violeta
DirectorsWehrli, Eric
Defense date2008-06-06
Abstract

Collocations (typical word associations like "to meet a condition", "to believe firmly", "highly controversial", "a deep concern") are pervasive in language. Due to their encoding idiomaticity, collocations are of paramount importance for text production tasks, for Foreign Language Learning as well as for Natural Language Processing applications such as machine translation. This thesis tackles the problem of automatic acquisition of collocations from text corpora, and proposes a methodological framework for their identification based on syntactic criteria. It is shown that (1) the results obtained are more reliable than those of standard methods based on linear proximity constraints, and (2) the syntax-based approach enables a more advanced treatment of these expressions, which is now largely absent in the related computational work: extraction of n-ary collocations (n>2), induction of syntactic patterns for collocations, Web-based extraction of collocations, and translation of collocations.

Keywords
  • Linguistique informatique
  • Collocations
  • Analyse syntaxique
  • Acquisition lexicale
Citation (ISO format)
SERETAN, Violeta. Collocation extraction based on syntactic parsing. Doctoral Thesis, 2008. doi: 10.13097/archive-ouverte/unige:78
Main files (1)
Thesis
accessLevelPublic
Identifiers
1090views
1130downloads

Technical informations

Creation29/10/2008 00:00:00
First validation29/10/2008 00:00:00
Update time14/03/2023 14:56:48
Status update14/03/2023 14:56:48
Last indexation13/05/2025 15:15:49
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack