Master
English

A Glimpse into Terminology Research with R: Two Experiments Exploring Diastratic Variation in a Large Specialized Corpus

Master program titleMaîtrise universitaire en traduction et technologies mention Terminologie
Defense date2021
Abstract

The increasing possibilities for the study of specialized discourse have seen terminologists dealing with larger and more heterogenous corpus data, a context in which ready-made tools might fall short. This work investigates if the R programming language can help researchers analyze complex specialized corpora and uncover clues of diastratic variation in language for specific purposes, a phenomenon understood as the coexistence of different language uses among groups of experts in the same field. Since one selling point of R is its capacity to stay up to date with techniques in both statistics and machine learning, an experiment is proposed in each of these areas: one dives into the exploratory analysis of categorical data, while the other relies on distributional semantics and deep-learning technology. Together, these series of tests make it possible to discuss key perspectives and limitations for R in terminology studies.

Keywords
  • R
  • specialized corpora
  • large corpora
  • diastratic variation
  • exploratory statistics
  • distributional analysis
Citation (ISO format)
GONZALEZ GRANADO, Nicolas. A Glimpse into Terminology Research with R: Two Experiments Exploring Diastratic Variation in a Large Specialized Corpus. Master, 2021.
Main files (1)
Master thesis
accessLevelPrivate
Identifiers
  • PID : unige:153976
188views
3downloads

Technical informations

Creation18/08/2021 13:47:00
First validation18/08/2021 13:47:00
Update time16/03/2023 01:04:41
Status update16/03/2023 01:04:41
Last indexation31/10/2024 22:47:59
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack