0% found this document useful (0 votes)

199 views

Data Mining Syllabus and Question

Syllabus of Data Warehouding and Data Mining - 8th Semester of B.Sc.CSIT program of Tribhuwan University, Nepal

Uploaded by

ComfortablyNumb

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

199 views

Data Mining Syllabus and Question

Syllabus of Data Warehouding and Data Mining - 8th Semester of B.Sc.CSIT program of Tribhuwan University, Nepal

Uploaded by

ComfortablyNumb

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Course Title: Data Warehousing and Data Mining

Course no: CSC-451

Full Marks: 60+20+20

Credit hours: 3

Pass Marks: 24+8+8

Nature of course: Theory (3 Hrs.) + Lab (3 Hrs.)

Course Synopsis: Analysis of advanced aspect of data warehousing and data mining.
Goals: This course introduces advanced aspects of data warehousing and data mining,
encompassing the principles, research results and commercial application of the current
technologies
Unit

Lecture

Course content-breakdown
Introduction

What motivated Data mining? What is Data Mining?

Types of databases (Relational database, Data

Hours
5

Warehouses, Transactional Database)

Functionalities of data mining What kinds of Pattern

can be mined?

Association Analysis, Cluster Analysis, Outlier

Analysis, Evolution Analysis

Stages of Knowledge discovery in database(KDD)

Setting up a KDD environment

Issues in Data Warehouse and Data Mining

Application of Data Warehouse and Data Mining

Data Warehouse for Data mining

Differences between operational database systems and

data warehouses

Data Warehouse Architecture

Distributed and Virtual Data Warehouse

Data Warehouse Manager

Data marts, Metadata, Multidimensional data model

From Tables and Spread Sheets to Data Cubes

Remarks

Star schema, Snowflake schema and Fact constellation

schema

OLAP technology for Data Mining

On-line analytical processing models and operations

(drill down, drill up, slice, dice, pivot)

Types of OLAP Servers: ROLAP versus MOLAP

versus HOLAP

OLTP

Tuning for data warehouse

Computation of Data Cubes, modeling

OLAP data, OLAP queries

Data Warehouse back end tools

Tuning and testing of Data Warehouse

Data Mining techniques

Data Mining definition and Task

KDD versus Data Mining

Data Mining techniques, tools and application

Data mining query languages

Data specification, specifying knowledge, hierarchy

specification, pattern presentation & visualization

specification

Data mining languages and standardization of data

mining

Association analysis

Association Rule Mining (Market basket analysis)

Why Association Mining is necessary?

Pros and Cons of Association Rules

Apriori Algorithm

Cluster analysis, Classification and Predication

What is classification? What is predication?

Issues regarding classification and prediction (Preparing

the data for classification and prediction, Comparing
classification methods)

Classification by decision tree induction (Extracting

classification rules from decision trees)

Bayesian Classification

Classification by back propagation

Introduction to Regression (Types of Regression)

Clustering Algorithm (K-mean and K-Mediod

Algorithms)

Advanced concepts in data mining

Mining Text Databases

Mining the World Wide Web

Mining Multimedia and Spatial Databases

Laboratory:
1. Creating a simple data warehouse
2. Concepts of data cleaning and preparing for operation
3. Implementing classification and clustering algorithms in any programming language
4. Association rule mining though data mining tools
5. Data Classification through data mining tools
6. Clustering through data mining tools
7. Data visualization through data mining tools

Text Books:

1. Data Mining Concepts and Techniques, Morgan Kaufmann J. Han, M Kamber Second Edition
ISBN: 978-1-55860-901-3
2. Data Warehousing in the Real World Sam Anahory and Dennis Murray, Pearson Edition
Asia.

References:
1. Data Mining Techniques Arun K Pujari, University Press.
2. Data Mining- Pieter Adriaans, Dolf Zantinge
3. Data Mining, Alex Berson,Stephen Smith,Korth Theorling,TMH.
4. Data Mining, Adriaans, Addison-Wesley Longman.

Model Question
Full marks:
60
Pass marks: 24
Time: 3 hours.
Bachelor Level/ Fourth Year/Eight Semester/Science
Data Warehousing and Data Mining (CSC-451)
Candidates are required to give their answers in their own words as far as practicable. The
figures in the margin indicate full marks.
Group-A
Long Answer Questions (Attempt any Two questions)

[2x10=20]

1. Suppose that a data warehouse for Big University consists of the following four
dimensions: student, course, semester, and instructor, and two measures count and avggrade. When at the lowest conceptual level (e.g., for a given student, course, semester, and
instructor combination), the avg-grade measure stores the actual course grade of the
student. At higher conceptual levels, avg-grade stores the average grade for the given
combination.
a) Draw a snowflake schema diagram for the data warehouse.
b) Starting with the base cuboid [student, course, semester, instructor], what specific
OLAP operations (e.g., roll-up from semester to year) should one perform in order to
list the average grade of CS courses for each Big University Student.
c) If each dimension has five levels (including all), such as student < major < status <
university < all, how many cuboids will this cube contain (including the base and apex
cuboids)?
2. A= {A1, A2, A3, A4, A5, A6}, Assume = 35%. Use A priori algorithm to get the desired
solution.
A1
0
0
1
1
1
0
0

A2
0
1
0
1
0
1
0

A3
0
1
0
0
1
1
0

A4
1
1
1
1
0
1
1

A5
1
0
1
0
1
0
1

A6
1
0
1
0
1
1
0

0
1
1

1
0
1

0
0
1

1
1
1

0
0
1

1
0
1

3. What kind of data preprocessing do we need before applying data mining algorithm to any
data set. Explain binning method to handle noisy data with example.

Group- B
Short Answer Questions (Attempt any Eight questions)
Question number 13 is compulsory.

[8x5=40]

4. Explain the use of frequent item set generation process.

[5]

5. Differentiate between data marts and data cubes.

[5]

6. Explain OLAP operations with example?

[5]

7. List the drawbacks of ID3 algorithm with over-fitting and its remedy techniques

[5]

8. Write the algorithm for K-means clustering. Compare it with k-nearest neighbor
algorithm.

[5]

9. What is text mining? Explain the text indexing techniques.

[5]

10. Describe genetic algorithm using as problem solving technique in data mining.

[5]

11. What do you mean by WWW mining? Explain WWW mining techniques.

[5]

12. What is DMQL? How do you define Star Schema using DMQL?

[5]

13. Write short notes (Any Two)

a)
b)
c)
d)

[2x2.5=5]

Text Database Mining

Back propagation Algorithm
Regression
HOLAP

*****

Calculus (Bradley & Smith) - Instructor's Edition
100% (3)
Calculus (Bradley & Smith) - Instructor's Edition
1,107 pages
TG SHS Earth Science
100% (2)
TG SHS Earth Science
107 pages
Vocabulary Workshop Level Unit 4
No ratings yet
Vocabulary Workshop Level Unit 4
1 page
664734la PDF
No ratings yet
664734la PDF
16 pages
CSE602 - Data Warehousing & Data Mining
No ratings yet
CSE602 - Data Warehousing & Data Mining
6 pages
IT223 Advance Database System Course Pack Module 4
No ratings yet
IT223 Advance Database System Course Pack Module 4
21 pages
Dbms 2 Syllabus
No ratings yet
Dbms 2 Syllabus
15 pages
University of Cagayan Valley
No ratings yet
University of Cagayan Valley
5 pages
Discrete Structures
No ratings yet
Discrete Structures
404 pages
Acctg20. AIS Course Outline
No ratings yet
Acctg20. AIS Course Outline
6 pages
Olytechnic Niversity of The Hilippines: Graduate School Master in Information Technology
No ratings yet
Olytechnic Niversity of The Hilippines: Graduate School Master in Information Technology
6 pages
Python Programming ppt0
100% (1)
Python Programming ppt0
25 pages
Quantitative Methods I
No ratings yet
Quantitative Methods I
4 pages
Syllabus EC5001 Embedded Systems
No ratings yet
Syllabus EC5001 Embedded Systems
3 pages
COEN 3134 - Logic Circuits and Switching Theory Syllabus - CANSINO
No ratings yet
COEN 3134 - Logic Circuits and Switching Theory Syllabus - CANSINO
4 pages
The Basics of Capital Budgeting: Should We Build This Plant?
No ratings yet
The Basics of Capital Budgeting: Should We Build This Plant?
24 pages
Design and Analysis of Algorithm
100% (1)
Design and Analysis of Algorithm
6 pages
DWDM Lecture Notes
No ratings yet
DWDM Lecture Notes
139 pages
Metacognitive Analogy Intruction
No ratings yet
Metacognitive Analogy Intruction
21 pages
Cse2021 - Data Mining CH
No ratings yet
Cse2021 - Data Mining CH
13 pages
What Is Engineering
No ratings yet
What Is Engineering
11 pages
WITHOUT SYLLABUS (Updated)
No ratings yet
WITHOUT SYLLABUS (Updated)
5 pages
CSC 122 Data Structure Syllabi
No ratings yet
CSC 122 Data Structure Syllabi
7 pages
Department of Computer Science & Application
No ratings yet
Department of Computer Science & Application
13 pages
11 20CMO No. 25 S. 2015
No ratings yet
11 20CMO No. 25 S. 2015
10 pages
Algorithms and Complexity Lect 1
No ratings yet
Algorithms and Complexity Lect 1
10 pages
Second Syllabus CC 105 Information Management
No ratings yet
Second Syllabus CC 105 Information Management
6 pages
Recurrence Tree Example PDF
No ratings yet
Recurrence Tree Example PDF
10 pages
Introduction To Computing
No ratings yet
Introduction To Computing
17 pages
PROF ELEC 2 - Integrative Programming and Technologies 2
No ratings yet
PROF ELEC 2 - Integrative Programming and Technologies 2
1 page
Marasigan, Razzel Reyes 201906693MN0 Bachelor of Science in Information Technology Enrolled Subjects
No ratings yet
Marasigan, Razzel Reyes 201906693MN0 Bachelor of Science in Information Technology Enrolled Subjects
1 page
It Workshop Lab Manual: Csi Wesley Institute of Tech &SC
No ratings yet
It Workshop Lab Manual: Csi Wesley Institute of Tech &SC
82 pages
CSC 409 Algorithms and Complexity Analysis
No ratings yet
CSC 409 Algorithms and Complexity Analysis
223 pages
CTS-285 Study Guide
100% (1)
CTS-285 Study Guide
95 pages
Compro1-C++ Syllabus 1011
No ratings yet
Compro1-C++ Syllabus 1011
4 pages
Computer Fundamentals Syllabus
No ratings yet
Computer Fundamentals Syllabus
5 pages
Chapter 1-Basics of MNGT Modified
No ratings yet
Chapter 1-Basics of MNGT Modified
39 pages
Syllabus Dca 180 Hours 1
No ratings yet
Syllabus Dca 180 Hours 1
12 pages
Itc P1
No ratings yet
Itc P1
19 pages
CS449 - Syllabus-Algorithms & Complexity
No ratings yet
CS449 - Syllabus-Algorithms & Complexity
5 pages
The Knowledge Management Strategy in Organizations: Summer Semester, 2016/2017
No ratings yet
The Knowledge Management Strategy in Organizations: Summer Semester, 2016/2017
22 pages
Claret College of Isabela Isabela City, Basilan First Semester School Year 2016-2017 Syllabus in Fundamentals of Information System
No ratings yet
Claret College of Isabela Isabela City, Basilan First Semester School Year 2016-2017 Syllabus in Fundamentals of Information System
11 pages
Components of Computer System
No ratings yet
Components of Computer System
16 pages
01 ELMS Activity 1 (In Computer Engineering As A Discipline)
No ratings yet
01 ELMS Activity 1 (In Computer Engineering As A Discipline)
1 page
It0423 Ipt Manual 2012-13
100% (1)
It0423 Ipt Manual 2012-13
78 pages
Com 113 Notes
No ratings yet
Com 113 Notes
37 pages
2.5.1 Information System Security
No ratings yet
2.5.1 Information System Security
32 pages
Introduction To Computers Module PDF
No ratings yet
Introduction To Computers Module PDF
218 pages
Computer Networking Course Syllabus Gene
No ratings yet
Computer Networking Course Syllabus Gene
4 pages
Slide 01 Introduction To SNA
No ratings yet
Slide 01 Introduction To SNA
33 pages
Unit 5
No ratings yet
Unit 5
41 pages
Software Engineering and Project Management
No ratings yet
Software Engineering and Project Management
23 pages
Ccs0003 Computer Programming 1 Lec Syllabus
No ratings yet
Ccs0003 Computer Programming 1 Lec Syllabus
6 pages
IoT Version 4
No ratings yet
IoT Version 4
11 pages
Netwrorking 2
No ratings yet
Netwrorking 2
3 pages
Module-5 - Simulation Techniques
100% (1)
Module-5 - Simulation Techniques
96 pages
Emerging Technologies Report PDF
No ratings yet
Emerging Technologies Report PDF
130 pages
Algorithm and Complexity Course Sillaby
No ratings yet
Algorithm and Complexity Course Sillaby
3 pages
Lab Mannual
No ratings yet
Lab Mannual
53 pages
Data Communications and Network Technologies
No ratings yet
Data Communications and Network Technologies
568 pages
Basic Electro-Mechanical Engineering (EE-170) : Lecture#02
No ratings yet
Basic Electro-Mechanical Engineering (EE-170) : Lecture#02
41 pages
Computer Organization and Architecture Overview
No ratings yet
Computer Organization and Architecture Overview
7 pages
IT543 - Tran Nguyen Quynh Tram - Project 6.1
0% (1)
IT543 - Tran Nguyen Quynh Tram - Project 6.1
5 pages
Equity of Cybersecurity in the Education System: High Schools, Undergraduate, Graduate and Post-Graduate Studies.
From Everand
Equity of Cybersecurity in the Education System: High Schools, Undergraduate, Graduate and Post-Graduate Studies.
Joseph O. Esin
No ratings yet
Report, Governor's Task Force On Law Enforcement Training
No ratings yet
Report, Governor's Task Force On Law Enforcement Training
29 pages
Test 2
No ratings yet
Test 2
4 pages
CorGov Competency - Framework
No ratings yet
CorGov Competency - Framework
28 pages
Guidelines To Prepare A PPT For Project Reviews
No ratings yet
Guidelines To Prepare A PPT For Project Reviews
22 pages
ComputerScience-SQP) 2024 Preboard
No ratings yet
ComputerScience-SQP) 2024 Preboard
15 pages
MDRU Staff Recruitment - Final - 19 - 03 - 21 - New
No ratings yet
MDRU Staff Recruitment - Final - 19 - 03 - 21 - New
6 pages
Chapter 13 - The Group 13 Elements Answers
No ratings yet
Chapter 13 - The Group 13 Elements Answers
3 pages
First-Quarter-Examination-Shs Fil
No ratings yet
First-Quarter-Examination-Shs Fil
6 pages
Republic Act No. 4670: The Magna Carta For Public School Teachers
No ratings yet
Republic Act No. 4670: The Magna Carta For Public School Teachers
18 pages
Gerunds and Infinitive ESL Worksheet
No ratings yet
Gerunds and Infinitive ESL Worksheet
1 page
fs 1 chapter 4
No ratings yet
fs 1 chapter 4
13 pages
Vstep Speaking Part 2 Topics
100% (1)
Vstep Speaking Part 2 Topics
3 pages
Placement & Training Willingness Form 2026 Batch
No ratings yet
Placement & Training Willingness Form 2026 Batch
2 pages
Tips For PTE (Academic) Retell Lecture
0% (1)
Tips For PTE (Academic) Retell Lecture
2 pages
AVG Anti-Virus Free Edition 9.0 No-Frills Protection To Meet Your Basic Security Needs
No ratings yet
AVG Anti-Virus Free Edition 9.0 No-Frills Protection To Meet Your Basic Security Needs
4 pages
Military Leadership: Submitted By: Submitted To: Cadet Jenny T Padsungay
No ratings yet
Military Leadership: Submitted By: Submitted To: Cadet Jenny T Padsungay
22 pages
Curriculum Vitae of Mahabuba Siddika: Career Objectives
No ratings yet
Curriculum Vitae of Mahabuba Siddika: Career Objectives
2 pages
The General Systems Theory (Boulding, 1956)
No ratings yet
The General Systems Theory (Boulding, 1956)
13 pages
Science: Quarter 1 - Module 2
No ratings yet
Science: Quarter 1 - Module 2
29 pages
Teenage Pregnancies Are Often Associated With Social Development Issues Such As Lack of Sufficient Education and Poverty
No ratings yet
Teenage Pregnancies Are Often Associated With Social Development Issues Such As Lack of Sufficient Education and Poverty
10 pages
Central Student Government S.Y. 2018-2019: Minutes of The Meeting
No ratings yet
Central Student Government S.Y. 2018-2019: Minutes of The Meeting
6 pages
Neupane Sagar
No ratings yet
Neupane Sagar
36 pages
The Use of Animation Video To Teach English To Junior High School Students
No ratings yet
The Use of Animation Video To Teach English To Junior High School Students
7 pages
PMT Online Schedule-2023
No ratings yet
PMT Online Schedule-2023
3 pages
1st CO - Gen. Math
No ratings yet
1st CO - Gen. Math
3 pages
Maths PPT 2022 K Region
No ratings yet
Maths PPT 2022 K Region
43 pages