F71RA Machine Learning for Risk and Insurance 1

Course co-ordinator(s): Prof Gareth Peters (Edinburgh).

Aims:

The intention of this course is to introduce students to core mathematical and statistical components of modern machine learning methods that are directly of applicability in the risk, insurance and financial mathematics contexts. In addition, the applications presented, and R computer packages explored in the applications will be focussed primarily on this discipline specific context. This course will aim to cover unsupervised learning methods of relevance to insurance and risk management.

Detailed Information

Course Description: Link to Official Course Descriptor.

Pre-requisites: none.

Location: Edinburgh.

Semester: 1.

Syllabus:

Clustering Methods
• To understand the impossibility theorem of clustering and the axioms of Clustering methods: to gain an understanding of mathematical trade-offs for different classes of clustering method.
• Data preparation for clustering methods: transforms and features.
• Classes of clustering methods and algorithms.
• Breiman Bias and robustness.
• Cluster selection and assessment criteria.
• Kernel methods and clustering.

Projection Methods and Dimension Reduction for Feature Extraction
• Principle Component Analysis PCA
• Independent Component Analysis ICA
• Probabilistic PCA
• Expectation Maximisation Algorithm
• Treating missingness in data and robust variations
• Kernel PCA
• Kernel ICA

Johnson-Lindenstauss Lemma and Compressive Sampling
• Efficient Singular Value Decompositions

Kernel Methods in Hypothesis Testing and Inference Procedures
• Maximum Mean Discrepancy Hypothesis Tests for Goodness of Fit
• Kernel Two Sample Tests
• Kernel Independence Tests

Learning Outcomes: Subject Mastery

• An understanding of selected core fundamental concepts in data science, statistics and machine learning of unsupervised learning.
• An ability to apply unsupervised learning methods to problems arising in insurance and finance
• An understanding of the mathematics underpinning unsupervised machine learning techniques.
• Critical awareness of the appropriateness and performance of the different techniques, as well as the relationships between them.

Learning Outcomes: Personal Abilities

• Rational problem identification and definition.
• Proficiency in the implementation of machine learning methods in R software for Risk and Insurance applications and data analysis.
• Critical analysis and solution selection
• Demonstrate the ability to learn independently.
• Manage time, work to deadlines, and prioritise workloads.
• Use appropriate computer software to process data.
• Present results in a way that demonstrates a good understanding of the technical and broader issues of data earning.

Assessment Methods: Due to covid, assessment methods for Academic Year 2020-21 may vary from those noted on the official course descriptor. Please see the Computer Science Course Weightings and the Maths Course Weightings for 2020-21 Semester 1 assessment methods.

SCQF Level: 11.

Credits: 15.

Other Information

Help: If you have any problems or questions regarding the course, you are encouraged to contact the lecturer

VISION: further information and course materials are available on VISION