Back to my home page; to my teaching page.
Here is the timetable of lectures and coursework information for 2012
Courseworks A, B, [must do and pass, but no marks] -- Courseworks 1 (30%), 2 (40%), 3 (30%)
Your lecturer is:
Professor David Corne, http://www.macs.hw.ac.uk/~dwcorne/
I want to arrange my office hour in a way that suits the
majority of students I am teaching this semester, so please make your
preferences known at this doodle poll http://doodle.com/pe9p9e2f2d8bip7u
|
week |
date |
Thursday 13:15 EM183 |
Friday 12:15 WP108 |
Coursework handout |
Coursework handin |
|
|
1 |
w/b
Mon 10th Sep |
C/W
A C/W B |
|
|
||
|
2 |
w/b
Mon 17th Sep |
basics 3 |
|
|
|
|
|
3 |
w/b
Mon 24th Sep |
|
|
|
||
|
4 |
w/b
Mon 1st Oct |
C/W A Friday 5th 23:59pm |
|
|||
|
5 |
w/b
Mon 8th Oct |
|
|
C/W B
Friday 12th 23:59pm |
|
|
|
6 |
w/b
Mon 15th Oct |
C/W 1 Friday 19th 23:59pm |
|
|||
|
7 |
w/b
Mon 22nd Oct |
|
|
|
||
|
8 |
w/b
Mon 29th Oct |
|
|
C/W 2 Sunday
4th November 23:59pm |
|
|
|
9 |
w/b
Mon 5th Nov |
|
|
|
||
|
10 |
w/b
Mon 12thNov |
|
|
|
|
|
|
11 |
w/b
Mon 19th Nov |
|
|
C/W 3 |
|
|
|
12 |
w/b
Mon 26th Nov |
|
|
|
|
|
OLD THINGS
I am redoing the 2012 DM lectures and courseworks a bit; below is what it was in 2011, and
gradually the correct things will be listed in the table above as soon as they
are done. In the meantime you can
refer to below to get some idea of what is coming up.
|
thing |
slides |
coursework handouts |
otherthings |
|
DWC
Lecture 1 |
COURSEWORK A
(in the slides) |
a
paper describing some retail basket data |
|
|
DWC
Lecture 2 |
Some basic issues: classification, 1-NN,
normalisation, discretization; |
COURSEWORK B (in the slides) |
|
|
DWC
Lecture 3 |
Some
awk scripts: znorm.awk,
cs2ss.awk, fiddlefield.awk,
removefields.awk, fixcomdata.awk. |
||
|
DWC
Lecture 4 |
|
the
A priori paper |
|
|
DWC
Lecture 5 |
The naive bayes awk program for use in CW2 A paper concerned with feature selection when you want
to find a very small number of features. Recommended read. |
||
|
DWC
Lecture 6 |
|||
|
DM
Lecture 7 |
|
the
original relief paper; a paper with
many variants of the relief method; the Dash
and Liu survey on FS A couple
of my papers with PhD students concerning feature selection: -about
Text features -about
Genes |
|
|
DM
Lecture 8 |
|
||
|
DM
Lecture 9 |
TBA |
|
|
|
|
|
|
|
The assessment of this module is entirely by coursework. My aim in setting the coursework is to get you to learn about the essentials of data mining by working on real data and real issues that myself and/or my colleagues are currently working on. You will implement and use techniques that are in fact quite straightforward, but are nevertheless very important, and in fact these techniques are often misused or not used at all (but of course they should be) by data analysts in science and industry.