0% found this document useful (0 votes)
147 views2 pages

Data Mining Syllabus Overview

This syllabus covers 5 units on data mining and preprocessing. Unit I introduces data mining, the KDD process, and data preprocessing techniques. Unit II covers frequent pattern mining, association rules, and correlation analysis. Unit III discusses classification, prediction, and classifier evaluation. Unit IV focuses on clustering, outlier detection methods. Unit V presents data warehousing, OLAP, and web mining techniques. Each unit includes self-study topics to supplement the core content.

Uploaded by

hellrider22
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
147 views2 pages

Data Mining Syllabus Overview

This syllabus covers 5 units on data mining and preprocessing. Unit I introduces data mining, the KDD process, and data preprocessing techniques. Unit II covers frequent pattern mining, association rules, and correlation analysis. Unit III discusses classification, prediction, and classifier evaluation. Unit IV focuses on clustering, outlier detection methods. Unit V presents data warehousing, OLAP, and web mining techniques. Each unit includes self-study topics to supplement the core content.

Uploaded by

hellrider22
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

SYLLABUS

Data Mining and Pre-processing


Introduction: Need of Data Mining, Knowledge Discovery in Database
(KDD), Architecture of Data Mining System; Data Objects and Attribute
Types, Statistical Description of Data, Data Visualization.
Data Preprocessing: Introduction to Data mining, Data mining
Unit – I
Functionalities, Data preprocessing (data summarization, data cleaning, data
integration and transformation, Feature selection and extraction, data
discretization)
Self-Study: Integration of Data Mining with a Database or Data Warehouse
System, Issues in Data Mining
Mining Frequent Patterns, Association and Correlations
Frequent Itemset Mining:
Interesting Item Set Mining, Market Basket Analysis, Generating Association
Rules, Apriori Algorithm, A pattern growth approach for mining frequent item

Unit – II set, Mining frequent item-sets using vertical data, Evaluation of Association
Patterns, From Association Analysis to Correlation Analysis
Self-Study: Sequential Pattern Mining Algorithms, Pattern mining in multi-
level, multi-dimensional space Data Integration: different types of digital data
and their sources, ETL (extract transform and load)Tools
Classification and Prediction
Classification: Decision Tree Classifier, Lazy Learner: KNN Classifier,
Unit –
Classifier Accuracy Measures, techniques for Evaluating Classifier Accuracy.
III
Prediction: Linear, Non-Linear Regression.
Self-Study: Case-Based Reasoning, Associative Classification
Clustering and Outlier Detection
Cluster Analysis:
Categories of Clustering methods, Different Types of Clusters, Partitioning
methods: k-Means, k-Medoids; Hierarchical Clustering Methods:
Unit – Agglomerations Grid Based Methods: STING, Cluster Evaluation
IV Outlier Analysis:
Types of outlier, Proximity based approach: distance based, Density based
approach
Self-Study: Grid Based Methods: CLIQUE, Density based Clustering:

OPTICS, Deviation based outlier detection approach: grid based


Data Warehouse & Web Mining
Introduction to Data Warehouse: OLTP Vs. OLAP, ETL process and Tools,
Data Warehouse design: Star schema, Snowflakes schema etc. OLAP query
and Reporting Tools: Micro strategy, Cognos etc.
Unit – V
Web Mining: Introduction, Web Mining, Web Content Mining, Web Structure
Mining, Web Usage Mining, Text Mining, Unstructured Text, Episode Rule
Discovery for Texts, Hierarchy of Categories, Text Clustering.
Self-Study: Time Series analysis, Graph Mining, Data Mining Applications

You might also like