en
Doctoral thesis
Open access
English

Automatic extraction of causal knowledge from natural language texts

ContributorsGrivaz, Cécile
Defense date2012-05-16
Abstract

This thesis studies the automatic recognition of implicit causal relations between clauses. Previous work, although more accurate than a random baseline, does not achieve sufficient accuracy for practical use. First we show, using annotation experiments, that recognising implicit causation is a subjective task, in spite of its association with several observable features. We then propose an evaluation protocol that takes the subjectivity of the task into account. Previous linguistics work uses the notion of world knowledge. We show that the most likely feature for representing this knowledge -verb pairs- is not predictive of causation in practice. We then show that the current state of the art is not sufficient to allow us to represent nor to acquire the world knowledge that is necessary for this task. We conclude that the field cannot make important progress without first solving the problem of abstract eventuality representation and clustering.

eng
Keywords
  • Natural language processing
  • Computational linguistics
  • Information extraction
  • Computational semantics
  • Computational pragmatics
  • Discoure relation classification
  • Causation
  • Causal relation recognition
Citation (ISO format)
GRIVAZ, Cécile. Automatic extraction of causal knowledge from natural language texts. 2012. doi: 10.13097/archive-ouverte/unige:24660
Main files (1)
Thesis
accessLevelPublic
Identifiers
859views
954downloads

Technical informations

Creation12/17/2012 1:14:00 PM
First validation12/17/2012 1:14:00 PM
Update time03/14/2023 5:47:29 PM
Status update03/14/2023 5:47:29 PM
Last indexation01/29/2024 7:37:21 PM
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack