Mshabab Alrizah

data warehousing and data mining

“Without data, you’re just another person with an opinion.” Edwards Deming, Statistician

“If you torture the data long enough, it will confess.”  Ronald H. Coase

Announcements

Please make sure to check this section frequently!

Announcement #1 2/9/2021: This course consists of two parts: Data Warehousing and Data mining. Keep in your mind it is not a Data mining course (Advanced) 

About the course


This course provides advanced aspects of data warehousing and data mining (design, management, acquisition, analysis, query, mining, and visualization). The main objective is to focus on serving the informational and analytical needs of the academic and an enterprise. More precisely, the course consists of two main components. The first component is an introduction to data warehousing and the foundation of understating the subjects involved in building and designing a successful data warehouse. The second component covers data mining, which refers to the discovery of interesting and useful knowledge from the data associated with the usage, content, and structure of different data resources. This includes an overview of data mining and knowledge discovery, data mining pre-processes, and data mining tasks.

Instructor

Dr. Mshabab Alrizah

Location: Faculty office 6

Office hours: Monday 9AM-11AM  and Thursday 2PM-4PM

Lectures

Class Sunday: 7-8pm: online (Blackboard) 

Class Monday: 7-8pm: classroom 3

Class Thursday: 8-10am: lab 3

Teaching Assistants

TBA

Office Location: nan

Office hours: nan

Email: nan

Prerequisites

Database Management Systems 222-CCS-3

Programing Language: Java, R or Python

Textbook(s)

Data Mining: Concepts and Techniques  (3rd Edition) by Jiawei Han , Micheline Kamber , and Jian Pei.

Data Warehouse Design: Modern Principles and Methodologies (1st Edition) by Matteo Golfarelli and  Stefano Rizzi

(Extra and not a requirement )The Elements of Statistical Learning: Data Mining, Inference, and Prediction ( 2nd edition). 2009. by Trevor Hastie 

More resours

TBA

Homework and programming assignments

Assignments #1:

Assignment #2:

Assignment #3:

Assignment #4:

Assignment #5:

Evaluation and grade

Assignments: 5 %  (Assignement1, assignement2, …)

Lab assignments: 5% (Lab1, Lab2,….)

Quizzes: 5% (Quize1, Quize2,…)

Lab Quiz: 5%

One Midterm exams: 20%

Lab exam and tutorial: 10%

Final exam: 50%

Schedule

This schedule is subject to change depending on how the course progresses.

DateWEEKTopicSlidesAssignment/ HW DueNote
12-Sep3No class: registrations issues
13-Sep3Overview of Data Warehousing (Warm-up)
16-Sep3Overview of the course & Data Warehousing: basic concept
19-Sep4Data warehousing characteristics and OLAP
20-Sep4Data Cube and OLAP Quiz1
23-Sep4Holiday
21-Sep5OLAP vs. OLTP
22-Sep5Introducing Dimensional Modeling
30-Sep5Introduction to Lab Section Assignment 1Lab
23-Sep6Designing the Dimensional Modeling Quiz 2
24-Sep6Processing Advanced Kinds of Queries by Exploring Cube technology
7-Oct6Revision of SQL Assignment 2Lab
25-Sep7Dimensional Modeling Case Study
26-Sep7Midterm Midterm
14-Oct7Data Warehousing Physical design – Implementation in SQL with indexes Lab Assignment 1Lab
27-Sep8Long weekend
28-Sep8Long weekend
21-Oct8Dimensional Modeling tutorials – Case Studies Lab
29-Sep9Introduction to Data Mining and Knowledge Discovery
30-Sep9Pattern Evaluation Methods Quiz 3
28-Oct9Introduction, interfaces and use of Wake (We may change Wake to PyCharm and python) Assignment 3Lab
1-Oct10Web Mining (Web content)
2-Oct10Data mining with Weka- Data Preprocessing (We may change Wake to PyCharm and python)
4-Nov10Long weekend
3-Oct11Web Mining ( Structure, and Usage Mining)
4-Oct11Data Pre-Processing (Data preparation for knowledge discovery)
11-Nov11Data mining with Weka- Data Preprocessing (We may change Wake to PyCharm and python) lab Quiz1Lab
5-Oct12Data Pre-Processing (Data preparation for knowledge discovery)
6-Oct12Data Transformation and Data Discretization
18-Nov12Data mining with Weka- Data Preprocessing (We may change Wake to PyCharm and python) Assignment 4Lab
7-Oct13Mining Frequent Patterns
8-Oct13 Associates Rules Quiz 4
25-Nov13Association Rules mining with Weka Using Apriorism Algorithm (We may change Wake to PyCharm and python) Lab
9-Oct14Holiday
10-Oct14Holiday
2-Dec14Holiday
11-Oct15Classification and Prediction Clustering
12-Oct15Techniques to Improve Classification Accuracy
9-Dec15Classificational via Decision Tree in Weka (We may change Wake to PyCharm and python) Lab Assignment 2Lab
13-Oct16Clustering
14-Oct16Evaluation of Clustering Quiz 5
16-Dec16Clustering: K-means Clustering in Weka (We may change Wake to PyCharm and python) lab quiz2Lab
15-Oct17Long weekend
16-Oct17Review
23-Dec17Final Lab Assessment Final Lab Assessment
17-Oct18Final Exam Final Exam
18-Oct18Final Exam Final Exam
30-Dec18Final Exam Final Exam