CS 609: Data Management and Exploration on the Web

Prof. Wendy Hui Wang 
F 1pm-3:30pm Lieb 210

Office Hours: Mondays 3:30-4:30pm, Babbio 620. All other times by appointment.


Prerequisites

CS 442 (Database Management System) or CS 561

Textbook

Ramakrishnan and Gehrke. "Database Management Systems", 3rd Edition, McGraw-Hill, 2003. ISBN-10: 0072465638


Policy

?/span>?Students are expected to attend all classes.

?/span>?The use of cell phones, PDAs, etc. in class is not allowed. Devices must be turned off during class.

?/span>?There are no make-up quizzes. The quizzes will be given at the end of classes.

?/span>?Laptops may be used in class only if explicitly authorized by the instructor.

?/span>?A make-up midterm/final will be granted only if the instructor is notified before the exam and there is serious illness or similarly important reason for missing the exam.

?/span>?For fairness to all students there will be no individual extra credit work.

?/span>?It is cheating to collaboratively work out a detailed solution, to copy a solution/material from another student or some other resource without specifying it, or to give away a solution/material. Self-plagarism is also considered cheating.

?/span>?ALL parties involved in a case of cheating get an automatic grade of zero (0) in the assignment/exam. Repeated cases get an F in the course. Any case of cheating will be reported to the honor board and/or the Dean of the Graduate School.

?/span>?Any complaint regarding a grade must be presented no later than seven weekdays following the pronunciation of grades of respective assignments, exams or project assignment.

Grading Assignment Breakdown

Submission Policy

Assignments are to be submitted via Moodle.

Late Policy

10 points (out of 100) will be taken off for each day that an assignment is turned in late. In other words, 10 points will be taken off if the assignment is turned in before midnight the day after the due date, and so on. Assignments may be turned in late up to 5 days after the due date/time.


Syllabus

Warning: The following is a tentative schedule and is subject to change.

 

Lecture

Date

Topic

Assignment

1

Jan 18

Introduction. IR (I): Modeling

 

2

Jan 25

IR (II): indexing

3

Feb 1

IR (III): text categorization

4

Feb 8

IR (IV): evaluation, recommender system

Project team-up and sign-up of project

5

Feb 15

IR (5): information extraction

Assignment 1 due

5

Feb 22

XML (1)

 1-page project proposal due; Assignment 2 assigned.

6

Mar 1

XML (2).

7

Mar 8

Information integration (I). Midterm overview

Assignment 2 due. Assignment 3 assigned

8

Mar 15

Spring break

 

9

Mar 22

Midterm

Project mid-term progress report

10

Mar 29

Good Friday

11

Apr 5

Information integration (II)

12

Apr 12

In-class paper presentations (I)

Assignment 3 due. Project scheduling

13

Apr 19

In-class paper presentations (II)

14

Apr 26 

In-class project presentations 

15

May 3

 

Project final report due