UNIGE document Chapitre d'actes
previous document  unige:22775  next document
add to browser collection
Title

Recovering dialect geography from an unaligned comparable corpus

Author
Published in Butt, M. ; Prokic, J. ; Mayer, T. & Cysouw, M. Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH (Visualization of Linguistic Patterns and Uncovering Language History from Multilingual Resources). Avignon (France) - 23-24 avril 2012 - Stroudsburg, PA (USA): Association for Computational Linguistics. 2012, p. 63-71
Abstract This paper proposes a simple metric of dialect distance, based on the ratio between identical word pairs and cognate word pairs occurring in two texts. Different variations of this metric are tested on a corpus containing comparable texts from different Swiss German dialects and evaluated on the basis of spatial autocorrelation measures. The visualization of the results as cluster dendrograms shows that closely related dialects are reliably clustered together, while multidimensional scaling produces graphs that show high agreement with the geographic localization of the original texts.
Identifiers
ISBN: 978-1-937284-19-0
Full text
Proceedings chapter (199 Kb) - public document Free access
Other version: http://aclweb.org/anthology-new/W/W12/
Structures
Research group Laboratoire d'Analyse et de Traitement du Langage (LATL)
Citation
(ISO format)
SCHERRER, Yves. Recovering dialect geography from an unaligned comparable corpus. In: Butt, M. ; Prokic, J. ; Mayer, T. & Cysouw, M. (Ed.). Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH (Visualization of Linguistic Patterns and Uncovering Language History from Multilingual Resources). Avignon (France). Stroudsburg, PA (USA) : Association for Computational Linguistics, 2012. p. 63-71. https://archive-ouverte.unige.ch/unige:22775

315 hits

332 downloads

Update

Deposited on : 2012-09-07

Export document
Format :
Citation style :