91传媒在线

Navigated to
Syllabus:

Statistical learning with high-dimensional data, 7.5 credits

Swedish name: Statistisk inlärning med högdimensionella data
This syllabus is valid: 2026-08-31 and until further notice
Course code: 5MS084
Credit points: 7.5
Education level: Second cycle
Main Field of Study and progress level: Mathematical Statistics: Second cycle, has only first-cycle course/s as entry requirements
Grading scale: TH teknisk betygsskala
Responsible department: Department of Mathematics and Mathematical Statistics
Established by: Prefekt vid Institutionen för matematik och matematisk statistik, 2025-12-08,

Contents

This course provides comprehensive knowledge, both regarding breadth and depth, about data science and statistical learning. In the course, both traditional and state of the art methods and algorithms in these fields are discussed. The related fundamental theories are also covered. After passing the course, the students should have a strong ability to solve problems through data. Meanwhile, students are also expected to have a strong self-study ability for understanding and learning any newly developed methods and algorithms.

Module 1 (3hp): Theory
The course gives an overview of important techniques and concepts in statistical learning. Supervised learning techniques, both classical as well as more modern, are treated. In particular, ensembling (e.g. random forests), deep neural networks and regularized linear regression are treated. Additionally, techniques for unsupervised learning (clustering) as well as dimension reduction are treated. The course especially covers mathematical theory about machine learning in general as well as the presented methods in particular.

Module 2 (4.5hp): Computer labs
The module covers the analysis of several data sets, using the statistical methods that are included in the course. The analyses are conducted in one of the the programming languages R or Python. In the module, students write thorough reports of the analyses and the results from them.

Expected learning outcomes

For a passing grade, the student must be able to

Knowledge and understanding

  • describe in detail the basic ideas and formulations of different algorithms for dimensionality reduction, supervised learning, and unsupervised learning problems.
  • describe and derive the theoretical results

Skills

  • apply the basic ideas and common techniques for building statistical models and machine learning models
  • implement methods and algorithms with programming languages like R or Python
  • identify suitable analysis methods, suitable variable selection methods and dimension reduction methods for given classification and cluster analysis problems
  • apply validation methods in order to choose among suitable analysis, variable selection and dimension reduction methods, and pick the most suitable one for specific problems
  • present the results of the analyses in written form

Judgement and approach

  • critically evaluate classification methods and cluster analysis methods from a scientific point of view

Required Knowledge

The course requires 90 ECTS including 7,5 ECTS Computer Programming, 7,5 ECTS Multivariate Data Analysis and 12 ECTS Mathematical Statistics or equivalent. Proficiency in English and Swedish equivalent to the level required for basic eligibility for higher studies.

Form of instruction

The teaching in Module 1 takes the form of lectures and lessons. The teaching in Module 2 takes the form of supervised lab work.

Examination modes

The course is examined in form of a written exam (module 1) and written reports (module 2). Modules 1 and 2 are awarded either a failing (U) or a passing grade (G). For the course as a whole, one of the following grades is awarded: Fail (U), Pass (3), Pass with merit (4), and Pass with distinction (5). The grade for the whole course is determined through a weighted average of the performance in each of the two modules, where module 2 should be weighed higher.

Deviations from the syllabus examination form can be made for a student who has a decision on pedagogical support due to disability. Individual adaptation of the examination form shall be considered based on the student's needs. The examination form is adapted within the framework of the expected learning outcomes of the course syllabus. At the request of the student, the course coordinator, in consultation with the examiner, must promptly decide on the adapted examination form. The decision shall then be communicated to the student.

A student who has been awarded a passing grade for the course cannot be re-assessed for a higher grade. Students who do not pass a test or examination on the original date are given another date to retake the examination. A student who has sat two examinations for a course or a part of a course, without passing either examination, has the right to have another examiner appointed, provided there are no specific reasons for not doing so (Chapter 6, Section 22, HEO). The request for a new examiner is made to the Head of the Department of Mathematics and Mathematical Statistics. Examinations based on this course syllabus are guaranteed to be offered for two years after the date of the student's first registration for the course.

Credit transfer
All students have the right to have their previous education or equivalent, and their working life experience evaluated for possible consideration in the corresponding education at Ume氓 university. Application forms should be addressed to Student ser-vices/Degree evaluation office. More information regarding credit transfer can be found on the student web pages of Ume氓 university, http://www.student.91传媒在线, and in the Higher Education Ordinance (chapter 6). If denied, the application can be ap-pealed (as per the Higher Education Ordinance, chapter 12) to 脰verklaganden盲mnden f枚r h枚gskolan. This includes partially denied applications

Other regulations

This course can not be included in a degree together with another course with similar contents. When in doubt, the student should consult the director of study at the department of mathematics and mathematical statistics. The course can also be included in the subject area of computational science and engineering.

Transitional provisions

In the event that the syllabus ceases to apply or undergoes major changes, students are guaranteed at least three examinations (including the regular examination opportunity) according to the regulations in the syllabus that the student was originally registered on for a period of a maximum of two years from the time that the previous syllabus ceased to apply or that the course ended.

Literature

Valid from: 2024 week 35

An introduction to statistical learning : with applications in R
James Gareth, Witten Daniela, Hastie Trevor, Tibshirani Robert
Second edition. : New York, NY : Springer : [2021] : xv, 607 pages :
ISBN: 9781071614204
Mandatory