Master
2025/2026
Linguistic Data: Quantitative Analysis and Visualisation
Type:
Compulsory course (Linguistic Theory and Language Description)
Delivered by:
School of Linguistics
When:
1 year, 3, 4 module
Online hours:
16
Open to:
students of all HSE University campuses
Language:
English
Contact hours:
64
Course Syllabus
Abstract
First year: The course is devoted to modern methods of data analysis, as applied to linguistic data, including methods of statistical inference and explanatory data analysis with visualizations. We begin with theoretical background in mathematical statistics and discuss limitations of statistical methods and their applicability to linguistical problems. From practical point of view, we use R system to do actual analysis with real datasets. We also discuss different visualization techniques using popular library ggplot2.Second year: Preprocessing of linguistic data in Python is designed to further the students’ knowledge of natural language processing and to polish their programming skills. The course aims to provide the students with the programming and natural language processing knowledge and competencies necessary to plan and conduct research projects of their own leading to the M.Sc. dissertation and scientific publications.