Basic principles and methods for data analysis and knowledge
discovery. Emphasizes developing basic skills for modeling and
prediction and performance evaluation. Topics include system design;
data quality, preprocessing, and association; event classification;
clustering; biometrics; business intelligence; and mining complex types
of data.
Monday/Wednesday, 12-1:15pm
Exploratory Hall L003
Dr. Jessica Lin
Office: Engineering Building 4419
Phone: 703-993-4693
Email: jessica [AT] gmu [DOT] edu
Office Hours: Monday 10:30-11:30am, Wednesday 1:30-2:30pm
Li Zhang
Office Hours: Wednesday 4-6pm
Office: Engineering Building 5321
There will be one exam covering lectures and readings (in class, closed book). The exam must be taken at the scheduled time and place, unless prior arrangement has been made with the instructor. Missed exam cannot be made up.
Required: Introduction
to Data Mining by Pang-Ning Tan, Michael Steinbach, and Vipin
Kumar (click on the link for the companion website)
The GMU Honor Code is in effect at all times. In addition, the CS Department has further honor code policies regarding programming projects, which are detailed here. Any deviation from the GMU or the CS department Honor Code is considered an Honor Code violation. All assignments for this class are individual unless otherwise specified.
If you have a documented learning disability or other condition which may affect academic performance, make sure this documentation is on file with the Office of Disability Services and then discuss with the professor about accommodations.
Week |
Date |
Topic |
Assigned |
Due |
Note / Reading |
1 |
1/21 1/23 |
MLK Day - No Class Introduction |
HW0 |
Ch. 1 |
|
2 |
1/28 1/30 |
Introduction (continued) HW1 background |
HW1 |
HW0 |
|
3 |
2/4 2/6 |
HW1 background HW1 background |
Ch. 4.3 (KNN) Ch. 3.6-3.8 |
||
4 |
2/11 2/13 |
Data 1 Data 2 |
Ch. 2 |
||
5 |
2/18 2/20 |
Data 3 / Classification Snow Day - No Class |
HW1 |
||
6 |
2/25 2/27 |
Classification (Decision trees) Classification (Decision trees) |
HW2 |
Ch. 3 |
|
7 |
3/4 3/6 |
Classification (Probability review; Naive Bayes Classifier) Classification (Naive Bayes Classifier) |
Ch. 4.1, 4.3, 4.4. Read 4.11.2 for HW2 |
||
8 |
3/11 3/13 |
Spring Break - No Class Spring Break - No Class |
|||
9 |
3/18 3/20 |
Classification (model evaluation, variance & bias) Classification (SVM; ensemble methods) |
HW2 (3/19) |
Ch. 4.10, 4.11 Ch. 4.9, 4.10 |
|
10 |
3/25 3/27 |
Classification (Neural Network) / Clustering (k-means) Clustering (k-means, hierarchical clustering) |
HW3 |
Ch. 4.7, Ch. 7.1-7.2 Ch. 7.3 |
|
11 |
4/1 4/3 |
Clustering (hierarchical clustering) Clustering (hierarchical clustering/density-based clustering) |
|||
12 |
4/8 4/10 |
Clustering (cluster validity) Review |
HW4 |
HW3 (4/11) |
|
13 |
4/15 4/17 |
Exam (new date) Post-exam review/Association Rule Mining |
|||
14 |
4/22 4/24 |
Association Rule Mining |
HW4 |
||
15 |
4/29 5/1 |
Association Rule Mining TBA: (possibly presentations) |
|||
16 |
5/6 5/7 (make-up) |
Presentations Presentations |