Proceedings chapter
OA Policy
English

An Analysis of Quantitative Aspects in the Evaluation of Thematic Segmentation Algorithms

Presented atSydney (Australia), 15-16 July
Published inACL - Association for Computational Linguistics (Ed.), Proceedings of the 7th SIGdial Workshop on Discourse and Dialogue, p. 144-151
Publication date2006
Abstract

We consider here the task of linear thematic segmentation of text documents, by using features based on word distributions in the text. For this task, a typical and often implicit assumption in previous studies is that a document has just one topic and therefore many algorithms have been tested and have shown encouraging results on artificial data sets, generated by putting together parts of different documents. We show that evaluation on synthetic data is potentially misleading and fails to give an accurate evaluation of the performance on real data. Moreover, we provide a critical review of existing evaluation metrics in the literature and we propose an improved evaluation metric.

Citation (ISO format)
GEORGESCUL, Maria, CLARK, Alexander, ARMSTRONG, Susan. An Analysis of Quantitative Aspects in the Evaluation of Thematic Segmentation Algorithms. In: Proceedings of the 7th SIGdial Workshop on Discourse and Dialogue. ACL - Association for Computational Linguistics (Ed.). Sydney (Australia). [s.l.] : [s.n.], 2006. p. 144–151. doi: 10.3115/1654595.1654622
Main files (1)
Proceedings chapter (Published version)
accessLevelPublic
Identifiers
554views
817downloads

Technical informations

Creation02/10/2009 09:31:16
First validation02/10/2009 09:31:16
Update time29/01/2026 06:43:43
Status update29/01/2026 06:43:43
Last indexation29/01/2026 06:43:44
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack