Syllabus
by
Kuiyu Chang
—
last modified
Aug 25, 2008 10:46 PM
Tentative, subject to change.
- 08.05 Lecture 1
Introduction (chap 1)
- History
- Definitions
- Data Mining Tasks
- Data Mining standards
- 08.12 Lecture 2
Data (chap 2)
- Data Attributes
- Data Preprocessing
- Similarity Measures
- 08.19 Lecture 3
Exploring Data (chap 3)
- Summary Statistics
- Visualization
- OLAP and Multidimensional Data Analysis
- 08.26 Lecture 4
Overview of Predictive Learning & Function Approximation (tentative)
- Meet @ SPMS-LT5 (new School next to Canteen B, south spine)
- Based on Friedman's 1994 paper of the same name
- 09.02 Lecture 5
Classification 1 (chap 4)
- Decision Trees
- Model Overfitting
- Bias/Variance
- Comparing Classifiers
- 09.09 Lecture 6
Classification 2 (chap 5)
- Rule-Based Classifier
- Nearest Neighbour Classifier
- Bayesisan Classifiers
- Artificial Neural Network (ANN)
- Support Vector Machine (SVM)
- Ensemble Methods
- Class Imbalance Problem
- 09.16 Lab 1 @ N4-b3c-14
Lab 1 of 2 (self-paced)
- 2 registered students to share one machine
- Sorry, due to limited machines, we can only accomodate registered students
- Meet @ CR7 (optional for those who do not know the way)
- Walk to N4-b3c-14 (CAIS) @ 6:45 pm
- Getting familar with SAS Enterprise Miner
- Download (and optionally print & bind) from SAS the following 150+ page document:
- Also download the Lab1 dataset during the lab
- We will be following the tutorial in this guide.
- 2 registered students to share one machine
- 09.23 Term Break
No Lecture
- 09.30 Lecture 7
Cluster Analysis 1 (chap 8)
- K-Means
- Agglomerative Hierarchical Clustering
- DBSCAN
- Cluster Evaluation
Cluster Analysis 2 (chap 9)
- Prototype-Based Clustering
- Density-Based Clustering
- Graph-Based Clustering
- Scalable Clustering Algorithms
- 10.07 Lecture 8
Anomaly Detection (chap 10)
- Statistical Approaches
- Proximity-Based Outlier Detection
- Density-Based Outlier Detection
- Clustering-Based Techniques
- 10.14 Lecture 9
Association Analysis 1 (chap 6)
- Frequent Itemset Generation
- Rule Generation
- Compact Representation
- Evaluation
Association Analysis 2 (chap 7)
- Handling Categorical, Continous attributes
- Handling concept hierarchy
- Sequential patterns
- subgraph patterns
- infrequent patterns
- 10.21 Lab 2 @ N4-b3c-14 (CAIS)
Lab 2 of 2
- Cluster Analysis
- Association Rules
- Download 28-page Lab 2 manual
- Dataset will be loaded from SAS server during lab
- 10.28 Seminar Presentations
- 11.04 Seminar Presentations + Review Lecture

