CS 780/INFS 780

Data Mining for Multimedia Databases

Dr. Jessica Lin

FALL 2009


 

HOME


News & Announcements

9/9 - The lecture slides will be posted on Blackboard: http://gmu.blackboard.com


Instructor:

Dr. Jessica Lin 

Office: Engineering Building, Room 4419

Phone: 703-993-4693

Email: jessica [AT] cs [DOT] gmu [DOT] edu

Office Hours: Wednesday 1-3pm

Classes

Thursday
4:30-7:10pm
School of Art Building 2003

Prerequisite:

INFS 755 or CS 750, or permission of instructor. Some programming skills required.

Textbooks

Required 

You will be given reading materials during the class.

Optional

Data Mining: Concepts and Techniques, 2nd Edition, Morgan Kauffmann Publishers, March 2006. ISBN 1-55860-901-6.

Course Description:

This course covers advanced techniques for data management, learning, and mining large multimedia databases. Issues related to handling such data including feature selection, compression, high dimensional indexing, interactive search and information retrieval, pattern discovery, and scalability to large datasets are discussed. Mining techniques and data types to be covered include texts/web, images, videos, DNA, temporal, spatial, spatiotemporal databases, stream mining and data visualization.

Grading

Grading will be based on assignments, midterm, final, and a project. 


Tentative Schedule


No Dates Topics Reading Assignment Notes
1 9/3 Introduction / Data Mining Review I    
2 9/10 Introduction / Data Mining Review II    
3 9/17 Indexing / Spatial Access Methods I #1
   
4 9/24 Spatial Access Methods II / Text Mining I  #2  
5 10/1 Text Mining II  #3  
6 10/8 Web   #4-5  HW1 due
7 10/15 Time Series I  #6  
8 10/22 Time Series II    HW2 due
9 10/29 Midterm    Midterm Topics
10 11/5 Spatial / Spatiotemporal Mining #7   
11 11/12 Image I
 
12 11/19 Image II #8    
13 11/26 Thanksgiving    
14 12/3 Video / Audio #9-10   
15 12/10 Project Presentation    
16 12/17 Final Exam    extra credit due


 


Reading Materials:


1. Christian Bohm, Stefan Berchtold, and Daniel A. Keim. 2001.
Searching in High-Dimensional Spaces - Index Structures for Improving the Performance of Multimedia Databases. ACM Comput. Surv. 33, 3 (September 2001), 322-373.

2. Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze. 2008. Introduction to Information Retrieval. Chapter 13 & 14.

3. Scott Deerwester, Susan Dumais, Richard Harshman. 1990. Indexing by latent semantic analysis. Journal of the American society for information science.

4. Brin, S. and L. Page (1998). Anatomy of a Large-Scale Hypertextual Web Search Engine. 7th Intl World Wide Web Conf.

5. Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze. 2008. Introduction to Information Retrieval. Chapter 19 & 21.

6. Chotirat Ann Ratanamahatana, Jessica Lin, Dimitrios Gunopulos, Eamonn Keogh, Michail Vlachos, and Gautam Das. 2005. Mining time series data. Data Mining and Knowledge Discovery Handbook 2005. Eds. Oded Maimon, Lior Rokach. Springer. Pages 1069-1103.

7. Pang-Ning Tan, Michael Steinback, Vipin Kumar, Christopher Potter, Steven Klooster, and Alicia Torregrosa. Finding Spatio-Temporal Patterns in Earth Science Data.

8. Yong Rui, Thomas S. Huang, and Shih-Fu Chang. Image retrieval: current techniques, promising directions and open issues. Journal of Visual Communication and Image Representation, Vol. 10, no. 4, pp. 39-62. 1999.

9. (Shazam) Avery Wang. 2003. An Industrial-Strength Audio Search Algorithm.

10. R. J. Demopoulos and M. J. Katchabaw. Music Information Retrieval: A Survey of Issues and Approaches. Technical Report #677, Department of Computer Science, The University of Western Ontario, London, Canada, January 2007, 72 pages.