Course Code Course Title L T P C
CSA16 Data Warehousing and Data Mining 3 0 2 4
Prerequisite Nil
Course 1. To identify the scope and essentiality of Data Warehousing.
Objectives 2. To learn various types of Data Mining.
3. To analyze data, choose relevant models and algorithms for respective
applications.
4. To study spatial and web data mining.
5. To understand the applications of Data Warehousing and Data Mining.
Course On successful completion of the course, the student will be able to:
Outcomes 1. Understand Data Warehouse fundamentals, Data Mining Principles.
2. Design data warehouse with dimensional modeling and apply OLAP
operations.
3. Identify appropriate data mining algorithms to solve real world problems.
4. Compare and evaluate different data mining techniques like classification,
prediction, clustering and association rule mining.
5. Describe complex data types with respect to spatial and web mining.
UNIT I INTRODUCTION TO DATA WAREHOUSING
An overview - Data Warehouse Architecture - Data Warehouse Multidimensional Data Model -
Data Warehouse Implementation - Data Warehouse OLAP Technology. Data Analytics Tools –
Data Processing – Operational Vs Decision Support Systems – ETL Process - Data Cleaning –
Data Extraction - Data Integration and Transformation – Data Loading - Data Reduction - Data
Analytics Tools.
UNIT II INTRODUCTION TO DATA MINING
Introduction - Important of Data mining - Various kind of data - Data mining Functionalities -
Various kinds of Patterns - Interesting Patterns - Classification of Data mining Systems - Data
mining Task Primitives- Integration of Data Mining System - Major issues in Data Mining.
UNIT III ASSOCIATION RULE MINING
Mining - Frequent Patterns Associations Correlations - Basic Concepts Road Map Efficient
Scalable Frequent Tamest Mining methods Mining - Various Kinds of Association rules.
UNIT IV CLASSIFICATION, PREDICTION AND CLUSTERING
Classification by Decision Tree induction – Bayesian classification - Rule-Based - Back
propagation - Support Vector Machines - Prediction Algorithms- Clustering: Hierarchical
Algorithms-distance-based agglomerative and divisible clustering -Partitional Algorithms- k-
means.
UNIT V DATA MINING APPLICATIONS
Applications Trends - Data mining Applications - System Products - Research Prototype-
Additional Themes on Data Mining - Social impact of Data mining - Trends in Data mining –
Spatial and Web Data Mining.
TEXT BOOKS
1. J. Han and M. Kamber, "Data Mining: Concepts and Techniques", Morgan Kaufman, Third
Edition, 2011.
2. Alex Berson, Stephen J. Smith, "Data Warehousing, Data Mining, and OLAP", MGH,
1998.
REFERENCES
1. Karguta, Joshi, Sivakumar & Yesha, “Data Mining”, Printice Hall of India (2007)
2. Ian H. Witten & Eibe Frank , “Data Mining”, (II Edition) , Morgan Kaufmann Publishers
WEB LINKS
1. https://nptel.ac.in/courses/106/105/106105174/
2. https://mitmecsept.files.wordpress.com/2017/04/data-mining-concepts-and-techniques-2nd-
edition-impressao.pdf