UNIGE document Doctoral Thesis
previous document  unige:24660  next document
add to browser collection

Automatic extraction of causal knowledge from natural language texts

Defense Thèse de doctorat : Univ. Genève, 2012 - L. 747 - 2012/05/16
Abstract This thesis studies the automatic recognition of implicit causal relations between clauses. Previous work, although more accurate than a random baseline, does not achieve sufficient accuracy for practical use. First we show, using annotation experiments, that recognising implicit causation is a subjective task, in spite of its association with several observable features. We then propose an evaluation protocol that takes the subjectivity of the task into account. Previous linguistics work uses the notion of world knowledge. We show that the most likely feature for representing this knowledge -verb pairs- is not predictive of causation in practice. We then show that the current state of the art is not sufficient to allow us to represent nor to acquire the world knowledge that is necessary for this task. We conclude that the field cannot make important progress without first solving the problem of abstract eventuality representation and clustering.
Keywords Natural language processingComputational linguisticsInformation extractionComputational semanticsComputational pragmaticsDiscoure relation classificationCausationCausal relation recognition
URN: urn:nbn:ch:unige-246603
Full text
Thesis (1.6 MB) - public document Free access
(ISO format)
GRIVAZ, Cécile. Automatic extraction of causal knowledge from natural language texts. Université de Genève. Thèse, 2012. https://archive-ouverte.unige.ch/unige:24660

517 hits



Deposited on : 2012-12-17

Export document
Format :
Citation style :