Towards Automatic Identification of Discourse Markers in Dialogs: The Case of 'Like'

Zufferey, Sandrine; Popescu-Belis, Andréi

Proceedings chapter

English

Towards Automatic Identification of Discourse Markers in Dialogs: The Case of 'Like'

ContributorsZufferey, Sandrine; Popescu-Belis, Andréi

Presented atCambridge (Mass., USA)

Published inMichael Strube and Candy Sidner (Ed.), SIGdial 2004 (5th SIGdial Workshop on Discourse and Dialogue), p. 63-71

PublisherACL - Association for Computational Linguistics

Publication date2004

Abstract

This article discusses the detection of discourse markers (DM) in dialog transcriptions, by human annotators and by automated means. After a theoretical discussion of the definition of DMs and their relevance to natural language processing, we focus on the role of like as a DM. Results from experiments with human annotators show that detection of DMs is a difficult but reliable task, which requires prosodic information from soundtracks. Then, several types of features are defined for automatic disambiguation of like: collocations, part-of-speech tags and duration-based features. Decision-tree learning shows that for like, nearly 70% precision can be reached, with near 100% recall, mainly using collocation filters. Similar results hold for well, with about 91% precision at 100% recall.

Affiliation entities

Faculté de traduction et d'interprétation / Département de traitement informatique multilingue

Research groups

TIM/ISSCO

Citation (ISO format)

ZUFFEREY, Sandrine, POPESCU-BELIS, Andréi. Towards Automatic Identification of Discourse Markers in Dialogs: The Case of “Like”. In: SIGdial 2004 (5th SIGdial Workshop on Discourse and Dialogue). Michael Strube and Candy Sidner (Ed.). Cambridge (Mass., USA). [s.l.] : ACL - Association for Computational Linguistics, 2004. p. 63–71.

Proceedings chapter (Published version)

Identifiers

PID : unige:2273

866views

445downloads

Creation29/07/2009 15:30:00

First validation29/07/2009 15:30:00

Update14/03/2023 15:09:46

Status update14/03/2023 15:09:46

Last indexation29/10/2024 11:49:33

Archive ouverte UNIGE

Towards Automatic Identification of Discourse Markers in Dialogs: The Case of 'Like'

Technical informations