Exploiting non-linear probabilistic models in natural language parsing and reranking

Titov, Ivan

doi:10.13097/archive-ouverte/unige:8495

Doctoral thesis

English

Exploiting non-linear probabilistic models in natural language parsing and reranking

ContributorsTitov, Ivan

DirectorsPellegrini, Christian; Henderson, James

Defense date2008-01-28

Abstract

The thesis considers non-linear probabilistic models for natural language parsing, and it primarily focuses on the class of models which do not impose strict constraints on the structure of statistical dependencies. The main contribution is the demonstration that such models are appropriate for natural language parsing tasks and provide advantages over the use of standard 'linear' methods. We demonstrate this, first, by showing that though exact inference is intractable for the studied class of models, there exist accurate and tractable approximations. Second, we show that using non-linear representations results in powerful feature induction methods simplifying construction of parsers for new problems and domains, and leading to the state-of-the-art performance. Also, we demonstrate that the latent space induced by the model can be exploited in discriminative rerankers, and that this results in the significant improvement both in the standard parsing settings and in domain adaptation.

Keywords

Natural language parsing
Syntax
Dependency parsing
Constituent parsing
Natural language learning
Bayes risk minimization
Dynamic Bayesian networks
Variational approximations

Affiliation entities

Citation (ISO format)

TITOV, Ivan. Exploiting non-linear probabilistic models in natural language parsing and reranking. Doctoral Thesis, 2008. doi: 10.13097/archive-ouverte/unige:8495

Thesis

Identifiers

PID : unige:8495
DOI : 10.13097/archive-ouverte/unige:8495
URN : urn:nbn:ch:unige-84958
Thesis number : Sc. 3943

813views

400downloads

Creation29/06/2010 14:14:00

First validation29/06/2010 14:14:00

Update time14/03/2023 15:49:32

Status update14/03/2023 15:49:32

Last indexation13/05/2025 15:45:48

Archive ouverte UNIGE

Exploiting non-linear probabilistic models in natural language parsing and reranking

Technical informations