Unit name | Statistical Machine Learning |
---|---|
Unit code | MATH30028 |
Credit points | 20 |
Level of study | H/6 |
Teaching block(s) |
Teaching Block 2 (weeks 13 - 24) |
Unit director | Professor. Anthony Lee |
Open unit status | Not open |
Units you must take before you take this one (pre-requisite units) |
Statistics 2 or Econometrics 1 Students are expected to be familiar with programming, and assessed coursework will involve substantial amounts of programming in a high-level language such as R |
Units you must take alongside this one (co-requisite units) |
N/A |
Units you may not take alongside this one |
N/A |
School/department | School of Mathematics |
Faculty | Faculty of Science |
Unit Aims
Why is this unit important?
Statistical machine learning is an increasingly important approach to extracting valuable information from data. In particular, and when used appropriately, it allows for a data-driven approach to solving various problems that cannot be solved from first principles alone.
This unit will develop theoretically and practically a selection of fundamental machine learning problems and commonly used solutions.
How does this unit fit into your programme of study
Although there are several statistical units available to Mathematics students, there is not at present a statistical machine learning unit. We expect that such a unit would complement very well the other statistical units while also being of interest to some students who are less focused on statistics. This unit would also be very suitable for BSc Data Science students.
In contrast to statistical units that cover some similar ideas, there is a stronger emphasis on algorithms in this machine learning unit.
Machine learning is concerned with algorithms that process relevant data and then perform some task. Often, performance of machine learning algorithms is measured statistically, and the algorithms themselves are heavily influenced by statistical ideas. For example, after observing several (x,y) pairs an algorithm may be able to predict with high accuracy the corresponding value of y for an unseen x. When the data is complex and/or high-dimensional, a number of statistical and algorithmic issues arise: a sufficiently rich class of statistical models must be used effectively and irrelevant data should be identified and then discarded.
Students will understand the statistical approach to analyzing data, and how it can be used to effectively perform tasks under appropriate assumptions. This will then enable students to formulate various real-life problems as statistical learning tasks and use common techniques to develop solutions.
By the end of the course the students should be able to:
Students will be able to:
perform some unsupervised learning and dimension reduction tasks.
In addition to lectures introducing the concepts and various algorithms, students will learn:
interactively, if they choose to work in a pair on assessed coursework.
Students will be offered reassessment for the exam and for coursework.
If this unit has a Resource List, you will normally find a link to it in the Blackboard area for the unit. Sometimes there will be a separate link for each weekly topic.
If you are unable to access a list through Blackboard, you can also find it via the Resource Lists homepage. Search for the list by the unit name or code (e.g. MATH30028).
How much time the unit requires
Each credit equates to 10 hours of total student input. For example a 20 credit unit will take you 200 hours
of study to complete. Your total learning time is made up of contact time, directed learning tasks,
independent learning and assessment activity.
See the Faculty workload statement relating to this unit for more information.
Assessment
The Board of Examiners will consider all cases where students have failed or not completed the assessments required for credit.
The Board considers each student's outcomes across all the units which contribute to each year's programme of study. If you have self-certificated your absence from an
assessment, you will normally be required to complete it the next time it runs (this is usually in the next assessment period).
The Board of Examiners will take into account any extenuating circumstances and operates
within the Regulations and Code of Practice for Taught Programmes.