• A
  • A
  • A
  • АБB
  • АБB
  • АБB
  • А
  • А
  • А
  • А
  • А
Обычная версия сайта
Магистратура 2024/2025

Современные методы анализа данных

Статус: Курс по выбору (Науки о данных (Data Science))
Направление: 01.04.02. Прикладная математика и информатика
Когда читается: 1-й курс, 3, 4 модуль
Формат изучения: без онлайн-курса
Охват аудитории: для всех кампусов НИУ ВШЭ
Прогр. обучения: Науки о данных
Язык: английский
Кредиты: 6

Course Syllabus

Abstract

The course “Modern Methods of Data Analysis” introduces students to a new and actively evolving interdisciplinary field of modern data analysis. Started as a branch of Artificial Intelligence, it attracted attention not only from mathematicians and computer scientists, but also that of physicists, economists, computational biologists, linguists and others and became a truly interdisciplinary field of study. In spite of the variety of data sources that could be analyzed, objects and attributes that form a particular dataset possess common statistical and structural properties. The interplay between known data and unknown ones gives rise to complex pattern structures and machine learning methods that are the focus of the study. In the course we will consider methods of Machine Learning and Data Mining. Special attention will be given to the hands-on practical analysis of the real world datasets using available software tools and modern programming languages and libraries.
Learning Objectives

Learning Objectives

  • Learning objectives of the course “Modern Methods of Data Analysis” (MMDA) are to familiarize students with a new rapidly evolving field of machine learning and mining, and provide practical knowledge experience in analysis of real world data.
Expected Learning Outcomes

Expected Learning Outcomes

  • Know basic notions and terminology used in Machine Learning and Data Mining
  • Understand fundamental principles of modern data analysis
  • Learn to develop mathematical models of Machine Learning and Data Mining
  • Be capable of analyzing real world data
Course Contents

Course Contents

  • Introduction to Machine Learning and Data Mining
  • Clustering and its basic techniques
  • Classification and its basic techniques
  • Frequent Itemset Mining and Association Rules
  • Regression and its basic techniques
  • Feature Selection and Dimensionality Reduction. Outlier detection.
  • Recommender Systems and Algorithms
  • Ensemble Clustering and Classification
  • Multimodal relational clustering
Assessment Elements

Assessment Elements

  • non-blocking Homework 1. Classification.
  • non-blocking Homework 2. Clustering.
  • non-blocking Homework 3. Frequent Closed Itemsets and Association Rules.
  • non-blocking Homework 4. Recommender Systems.
  • non-blocking Data Science Project
Interim Assessment

Interim Assessment

  • 2024/2025 4th module
    0,8*(0,5 * homework + 0,5 * project report) + 0,2 * exam
Bibliography

Bibliography

Recommended Core Bibliography

  • The elements of statistical learning : data mining, inference, and prediction, Hastie, T., 2017

Recommended Additional Bibliography

  • Han, J., Kamber, M., Pei, J. Data Mining: Concepts and Techniques, Third Edition. – Morgan Kaufmann Publishers, 2011. – 740 pp.

Authors

  • Антропова Лариса Ивановна
  • IGNATOV Dmitrii IGOREVICH