Personal tools
You are here: Home Teaching CI6227 Data Mining Syllabus

Syllabus

by Kuiyu Chang last modified Aug 25, 2008 10:46 PM

Tentative, subject to change.

  1. 08.05 Lecture 1

    Introduction (chap 1)

    • History
    • Definitions
    • Data Mining Tasks
    • Data Mining standards
  2. 08.12 Lecture 2

    Data (chap 2)

    • Data Attributes
    • Data Preprocessing
    • Similarity Measures
  3. 08.19 Lecture 3

    Exploring Data (chap 3)

    • Summary Statistics
    • Visualization
    • OLAP and Multidimensional Data Analysis
  4. 08.26 Lecture 4

    Overview of Predictive Learning & Function Approximation (tentative)

    • Meet @ SPMS-LT5 (new School next to Canteen B, south spine)
    • Based on Friedman's 1994 paper of the same name
  5. 09.02 Lecture 5

    Classification 1 (chap 4)

    • Decision Trees
    • Model Overfitting
    • Bias/Variance
    • Comparing Classifiers
  6. 09.09 Lecture 6

    Classification 2 (chap 5)

    • Rule-Based Classifier
    • Nearest Neighbour Classifier
    • Bayesisan Classifiers
    • Artificial Neural Network (ANN)
    • Support Vector Machine (SVM)
    • Ensemble Methods
    • Class Imbalance Problem
  7. 09.16 Lab 1 @ N4-b3c-14

    Lab 1 of 2 (self-paced)

    • 2 registered students to share one machine
      • Sorry, due to limited machines, we can only accomodate registered students
    • Meet @ CR7 (optional for those who do not know the way)
    • Walk to N4-b3c-14 (CAIS) @ 6:45 pm
    • Getting familar with SAS Enterprise Miner
  8. 09.23 Term Break

    No Lecture

  9. 09.30 Lecture 7

    Cluster Analysis 1 (chap 8)

    • K-Means
    • Agglomerative Hierarchical Clustering
    • DBSCAN
    • Cluster Evaluation

    Cluster Analysis 2 (chap 9)

    • Prototype-Based Clustering
    • Density-Based Clustering
    • Graph-Based Clustering
    • Scalable Clustering Algorithms
  10. 10.07 Lecture 8

    Anomaly Detection (chap 10)

    • Statistical Approaches
    • Proximity-Based Outlier Detection
    • Density-Based Outlier Detection
    • Clustering-Based Techniques
  11. 10.14 Lecture 9

    Association Analysis 1 (chap 6)

    • Frequent Itemset Generation
    • Rule Generation
    • Compact Representation
    • Evaluation

    Association Analysis 2 (chap 7)

    • Handling Categorical, Continous attributes
    • Handling concept hierarchy
    • Sequential patterns
    • subgraph patterns
    • infrequent patterns
  12. 10.21 Lab 2 @ N4-b3c-14 (CAIS)

    Lab 2 of 2

    • Cluster Analysis
    • Association Rules
    • Download 28-page Lab 2 manual
    • Dataset will be loaded from SAS server during lab
  13. 10.28 Seminar Presentations
  14. 11.04 Seminar Presentations + Review Lecture
Document Actions
« January 2009 »
January
MoTuWeThFrSaSu
1234
567891011
12131415161718
19202122232425
262728293031