2025/2026




Введение в методы сбора и анализа больших данных
Статус:
Маго-лего
Кто читает:
Департамент социологии
Охват аудитории:
для своего кампуса
Преподаватели:
Михайлова Оксана Рудольфовна
Язык:
русский
Контактные часы:
36
Программа дисциплины
Аннотация
The growth of Internet penetration and the possibility of collecting and analyzing big data have produced new challenges and have offered new opportunities for researchers and official statistics. Within several years nonreactive and big data has become the main trend in the social sciences. Nonreactive methods include nonparticipant observation and analysis of digital fingerprints such as likes or shares, as well as private documents such as blogs, social media profiles and comments, or public online documents such as mass media materials. This course will give an introduction to key quantitative approaches to the collection of nonreactive data in social sciences. The course is taught in the form of lectures, seminars, and individual work using R studio. All teaching is conducted in English. The goal of the course is to introduce the opportunities of nonreactive and big data for social scientists and learn basic methods and tools to collect nonreactive data. Within the course some R studio packages will be used for data analysis. Basic knowledge of quantitative sociological methods is required. Familiarity with R studio is very helpful but not required. To run R studio, install it or use cloud version (freely available at: https://www.rstudio.com/products/rstudio/download/).
Цель освоения дисциплины
- Know basic methods of collecting nonreactive data in social sciences
- Know different types of big data in social sciences
- Use skills to collect online data (Wikipedia, YouTube, etc).
- Use skills to analyze textual data
Планируемые результаты обучения
- Have skills to analyze textual data
- Have skills to scrap online data through various APIs, automatization of actions in browser, and etc
- Have skills to write R code for basic data analysis tasks
- Know basic concepts of Big data, its opportunities, limitations, and relevance to social sciences
- Know basic concepts of reactive and nonreactive data, its opportunities, limitations, and applications in social sciences
- The student will know the fundamental concepts of big data, its opportunities, and limitations in the context of social research.
- The student will establish a comprehensive ethical framework for social media research.
- The student will master the basics of R and develop computational thinking skills for application in the social sciences.
- The student will perform comprehensive cleaning and preprocessing of consumer behavior data.
- The student will develop advanced skills in collecting data from web sources and APIs using R.
- The student will implement the YouTube API for academic research on video content and comments.
- The student will be able to apply computational methods for analyzing textual data and social networks.
- The student will master bibliometric methods and visualization tools for analyzing academic research.
- The student will develop expertise in bibliometric analysis using R tools and specialized software.
- The student will be able to integrate multiple methods and understand the full cycle from research to publication.
- The student will understand the entire publication process from manuscript preparation to final publication.
Содержание учебной дисциплины
- Topic 1: Introduction to Big Data in Social Sciences
- Topic 2: Introduction to R Programming for Social Scientists
- Topic 3: Data Scraping and Collection in R
- Topic 4: Text Mining and Network Analysis in R
- Topic 5: Bibliometric Analysis and Science Mapping
- Topic 6: Advanced Applications and Academic Publication Process
Элементы контроля
- Class attendance
- Participation in article discussion
- Essay
- Test on articles
- Laboratory work
Промежуточная аттестация
- 2025/2026 1st module0.05 * Class attendance + 0.4 * Essay + 0.3 * Laboratory work + 0.15 * Participation in article discussion + 0.1 * Test on articles