0% found this document useful (0 votes)

37 views

KM Notes Unit-3

The document discusses multi-dimensional analysis and data mining. It defines data mining and describes the data mining architecture. It then discusses different types of databases and applications of data mining such as in healthcare, retail, education, manufacturing, customer relationship management, fraud detection, and banking.

Uploaded by

prince pal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views

KM Notes Unit-3

Uploaded by

prince pal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

MAHARANA PRATAP GROUP OF INSTITUTIONS

KOTHI MANDHANA, KANPUR

(Approved by AICTE, New Delhi and Affiliated to Dr. AKTU, Lucknow)

Digital Notes
[Department of Computer Applications]

Subject Name : Knowledge Management

Subject Code : BCA-5001
Course : BCA
Branch : BCA
Semester : Vth
Prepared by : Mr. Sandeep Tripathi
Unit - 3

Multi- Dimensional Analysis

1. Data Mining
Data Mining is the process of investigating hidden patterns of information to various
perspectives for categorization into useful data, which is collected and assembled in particular
areas such as data warehouses, efficient analysis, data mining algorithm, helping decision
making and other data requirement to eventually cost-cutting and generating revenue.
Data mining is the act of automatically searching for large stores of information to find trends
and patterns that go beyond simple analysis procedures. Data mining utilizes complex
mathematical algorithms for data segments and evaluates the probability of future events. Data
Mining is also called Knowledge Discovery of Data (KDD).
Data Mining is similar to Data Science carried out by a person, in a specific situation, on a
particular data set, with an objective. This process includes various types of services such as text
mining, web mining, audio and video mining, pictorial data mining, and social media mining. It
is done through software that is simple or highly specific. By outsourcing data mining, all the
work can be done faster with low operation costs.

1.1 Data Mining Architecture

2
Page
Data mining architecture has many elements like Data Warehouse, Data Mining Engine, Pattern
evaluation, User Interface and Knowledge Base.

Data Warehouse:
A data warehouse is a place which store information collected from multiple sources under
unified schema. Information stored in a data warehouse is critical to organizations for the process
of decision-making.

Data Mining Engine:

Data Mining Engine is the core component of data mining process which consists of various
modules that are used to perform various tasks like clustering, classification, prediction and
correlation analysis.

Pattern Evaluation:
Pattern Evaluation is responsible for finding various patterns with the help of Data Mining
Engine.

User Interface:
User Interface provides communication between user and data mining system. It allows user to
use the system easily even if user doesn't have proper knowledge of the system.

Knowledge Base:
Knowledge Base consists of data that is very important in the process of data mining.Knowledge
Base provides input to the data mining engine which guides data mining engine in the process of
pattern search.

1.2 Types of Database

Relational Database:
A relational database is a collection of multiple data sets formally organized by tables, records,
and columns from which data can be accessed in various ways without having to recognize the
database tables. Tables convey and share information, which facilitates data searchability,
3

reporting, and organization.

Page
Data warehouses:
A Data Warehouse is the technology that collects the data from various sources within the
organization to provide meaningful business insights. The huge amount of data comes from
multiple places such as Marketing and Finance. The extracted data is utilized for analytical
purposes and helps in decision- making for a business organization. The data warehouse is
designed for the analysis of data rather than transaction processing.

Data Repositories:
The Data Repository generally refers to a destination for data storage. However, many IT
professionals utilize the term more clearly to refer to a specific kind of setup within an IT
structure. For example, a group of databases, where an organization has kept various kinds of
information.

Object-Relational Database:
A combination of an object-oriented database model and relational database model is called an
object-relational model. It supports Classes, Objects, Inheritance, etc.
One of the primary objectives of the Object-relational data model is to close the gap between the
Relational database and the object-oriented model practices frequently utilized in many
programming languages, for example, C++, Java, C#, and so on.

Transactional Database:
A transactional database refers to a database management system (DBMS) that has the potential
to undo a database transaction if it is not performed appropriately. Even though this was a unique
capability a very long while back, today, most of the relational database systems support
transactional database activities.

1.3 Data Mining Applications

Data Mining is primarily used by organizations with intense consumer demands- Retail,
Communication, Financial, marketing company, determine price, consumer preferences, product
positioning, and impact on sales, customer satisfaction, and corporate profits. Data mining
enables a retailer to use point-of-sale records of customer purchases to develop products and
4

promotions that help the organization to attract the customer.

Page
These are the following areas where data mining is widely used:

Data Mining in Healthcare:

Data mining in healthcare has excellent potential to improve the health system. It uses data and
analytics for better insights and to identify best practices that will enhance health care services
and reduce costs. Analysts use data mining approaches such as Machine learning, Multi-
dimensional database, Data visualization, Soft computing, and statistics. Data Mining can be
used to forecast patients in each category. The procedures ensure that the patients get intensive
care at the right place and at the right time. Data mining also enables healthcare insurers to
recognize fraud and abuse.

Data Mining in Market Basket Analysis:

Market basket analysis is a modeling method based on a hypothesis. If you buy a specific group
of products, then you are more likely to buy another group of products. This technique may
enable the retailer to understand the purchase behavior of a buyer. This data may assist the
retailer in understanding the requirements of the buyer and altering the store's layout
accordingly. Using a different analytical comparison of results between various stores, between
customers in different demographic groups can be done.

Data mining in Education:

Education data mining is a newly emerging field, concerned with developing techniques that
explore knowledge from the data generated from educational Environments. EDM objectives are
recognized as affirming student's future learning behavior, studying the impact of educational
support, and promoting learning science. An organization can use data mining to make precise
decisions and also to predict the results of the student. With the results, the institution can
concentrate on what to teach and how to teach.

Data Mining in Manufacturing Engineering:

Knowledge is the best asset possessed by a manufacturing company. Data mining tools can be
beneficial to find patterns in a complex manufacturing process. Data mining can be used in
system-level designing to obtain the relationships between product architecture, product
5
Page
portfolio, and data needs of the customers. It can also be used to forecast the product
development period, cost, and expectations among the other tasks.

Data Mining in CRM (Customer Relationship Management):

Customer Relationship Management (CRM) is all about obtaining and holding Customers, also
enhancing customer loyalty and implementing customer-oriented strategies. To get a decent
relationship with the customer, a business organization needs to collect data and analyze the data.
With data mining technologies, the collected data can be used for analytics.

Data Mining in Fraud detection:

Billions of dollars are lost to the action of frauds. Traditional methods of fraud detection are a
little bit time consuming and sophisticated. Data mining provides meaningful patterns and
turning data into information. An ideal fraud detection system should protect the data of all the
users. Supervised methods consist of a collection of sample records, and these records are
classified as fraudulent or non-fraudulent. A model is constructed using this data, and the
technique is made to identify whether the document is fraudulent or not.

Data Mining in Lie Detection:

Apprehending a criminal is not a big deal, but bringing out the truth from him is a very
challenging task. Law enforcement may use data mining techniques to investigate offenses,
monitor suspected terrorist communications, etc. This technique includes text mining also, and it
seeks meaningful patterns in data, which is usually unstructured text. The information collected
from the previous investigations is compared, and a model for lie detection is constructed.

Data Mining Financial Banking:

The Digitalization of the banking system is supposed to generate an enormous amount of data
with every new transaction. The data mining technique can help bankers by solving business-
related problems in banking and finance by identifying trends, casualties, and correlations in
business information and market costs that are not instantly evident to managers or executives
because the data volume is too large or are produced too rapidly on the screen by experts. The
manager may find these data for better targeting, acquiring, retaining, segmenting, and maintain
a profitable customer.
6
Page
2. Knowledge Discovery Process (KDD Process)
Data mining is the core part of the knowledge discovery process.
KDD is a process of finding knowledge in data, it does this by using data mining methods
(algorithms) in order to extract demanding knowledge from large amount of data.

Knowledge Discovery Process (KDD)

Knowledge Discovery Process may consist of the following steps:

1 Data cleaning -
First step in the Knowledge Discovery Process is Data cleaning in which noise and inconsistent
data is removed.

2 Data Integration -
Second step is Data Integration in which multiple data sources are combined.

3 Data Selection -
7

Next step is Data Selection in which data relevant to the analysis task are retrieved from the
Page

database.
4 Data Transformation -
In Data Transformation, data are transformed into forms appropriate for mining by performing
summary or aggregation operations.

5 Data Mining -
In Data Mining, data mining methods (algorithms) are applied in order to extract data patterns.

6 Pattern Evaluation -
In Pattern Evaluation, data patterns are identified based on some interesting measures.

7 Knowledge Presentation -
In Knowledge Presentation, knowledge is represented to user using many knowledge
representation techniques.

3. Data Mining Techniques

Data Mining Techniques

1. Classification:
This analysis is used to retrieve important and relevant information about data, and metadata.
This data mining method helps to classify data in different classes.
8
Page
2. Clustering:
Clustering analysis is a data mining technique to identify data that are like each other. This
process helps to understand the differences and similarities between the data.

3. Regression:
Regression analysis is the data mining method of identifying and analyzing the relationship
between variables. It is used to identify the likelihood of a specific variable, given the presence
of other variables.

4. Association Rules:
This data mining technique helps to find the association between two or more Items. It discovers
a hidden pattern in the data set.

5. Outer detection:
This type of data mining technique refers to observation of data items in the dataset which do not
match an expected pattern or expected behavior. This technique can be used in a variety of
domains, such as intrusion, detection, fraud or fault detection, etc. Outer detection is also called
Outlier Analysis or Outlier mining.

6. Sequential Patterns:
This data mining technique helps to discover or identify similar patterns or trends in transaction
data for certain period.

7. Prediction:
Prediction has used a combination of the other techniques of data mining like trends, sequential
patterns, clustering, classification, etc. It analyzes past events or instances in a right sequence for
predicting a future event.

3.1 Benefits of Data Mining:

 Data mining technique helps companies to get knowledge-based information.
 Data mining helps organizations to make the profitable adjustments in operation and
production.

9

The data mining is a cost-effective and efficient solution compared to other statistical
Page

data applications.
 Data mining helps with the decision-making process.
 Facilitates automated prediction of trends and behaviors as well as automated discovery
of hidden patterns.
 It can be implemented in new systems as well as existing platforms
 It is the speedy process which makes it easy for the users to analyze huge amount of data
in less time.

4. Multidimensional Data Model

A multidimensional model views data in the form of a data-cube. A data cube enables data to be
modeled and viewed in multiple dimensions. It is defined by dimensions and facts.
The dimensions are the perspectives or entities concerning which an organization keeps records.
A multidimensional database allows to rapidly and reliably providing data-related responses to
complicated market questions. The Multidimensional Data Model can be defined as a way to
arrange the data in the database, to help structure and organize the contents of the database. The
Multidimensional Data Model can include two or three dimensions of objects from the database
structure, versus a system of one dimension, such as a list.
In organisations, it is usually used for objective findings and report production, which can be
used as the primary source for imperative decision-making processes. Usually, this model is
extended to applications working with OLAP techniques (Online Analytical Processing).

10
Page
4.1 How does the Multidimensional Data Model work?
The Multidimensional Data Model, like every other system, often operates based on preset steps
to preserve the same pattern in the industry and to allow the database structures already built or
developed to be reusable. Any project should go all the way through the steps below to construct
a multidimensional data model.
 Congregating the requirements from the client
 Categorizing the various modules of the system
 Spotting the various dimensions based on which the system needs to be designed
 Drafting the real-time dimensions and the corresponding properties
 Discovering the facts from the already listed dimensions and their properties
 Constructing the Schema to place the data, for the information gathered from the above
steps

For example, a shop may create a sales data warehouse to keep records of the store's sales for the
dimension time, item, and location. These dimensions allow the save to keep track of things, for
example, monthly sales of items and the locations at which the items were sold. Each dimension
has a table related to it, called a dimensional table, which describes the dimension further. For
example, a dimensional table for an item may contain the attributes item_name, brand, and type.
A multidimensional data model is organized around a central theme, for example, sales. This
theme is represented by a fact table. Facts are numerical measures. The fact table contains the
names of the facts or measures of the related dimensional tables.

11
Page
Consider the data of a shop for items sold per quarter in the city of Delhi. The data is shown in
the table. In this 2D representation, the sales for Delhi are shown for the time dimension
(organized in quarters) and the item dimension (classified according to the types of an item sold).
The fact or measure displayed in rupee_sold (in thousands).

Now, if we want to view the sales data with a third dimension, For example, suppose the data
according to time and item, as well as the location is considered for the cities Chennai, Kolkata,
Mumbai, and Delhi. These 3D data are shown in the table. The 3D data of the table are
represented as a series of 2D tables.

12
Page
Conceptually, it may also be represented by the same data in the form of a 3D data cube, as
shown in fig:

5. Data Warehousing - OLAP

Online Analytical Processing Server (OLAP) is based on the multidimensional data model. It
allows managers, and analysts to get an insight of the information through fast, consistent, and
interactive access to information. This chapter cover the types of OLAP, operations on OLAP,
difference between OLAP, and statistical databases and OLTP.

5.1 Types of OLAP Servers

We have four types of OLAP servers −
 Relational OLAP (ROLAP)

 Multidimensional OLAP (MOLAP)

 Hybrid OLAP (HOLAP)

 Specialized SQL Servers

13
Page
Relational OLAP
ROLAP servers are placed between relational back-end server and client front-end tools. To
store and manage warehouse data, ROLAP uses relational or extended-relational DBMS.

ROLAP includes the following −

 Implementation of aggregation navigation logic.

 Optimization for each DBMS back end.

 Additional tools and services.

Multidimensional OLAP
MOLAP uses array-based multidimensional storage engines for multidimensional views of data.
With multidimensional data stores, the storage utilization may be low if the data set is sparse.
Therefore, many MOLAP server use two levels of data storage representation to handle dense
and sparse data sets.

Hybrid OLAP
Hybrid OLAP is a combination of both ROLAP and MOLAP. It offers higher scalability of
ROLAP and faster computation of MOLAP. HOLAP servers allows to store the large data
volumes of detailed information. The aggregations are stored separately in MOLAP store.

Specialized SQL Servers

Specialized SQL servers provide advanced query language and query processing support for
SQL queries over star and snowflake schemas in a read-only environment.

OLAP Operations
Since OLAP servers are based on multidimensional view of data, we will discuss OLAP
operations in multidimensional data.

Here is the list of OLAP operations −

 Roll-up

 Drill-down

 Slice and dice

 Pivot (rotate)
Page
Roll-up
Roll-up performs aggregation on a data cube in any of the following ways −
 By climbing up a concept hierarchy for a dimension

 By dimension reduction

The following diagram illustrates how roll-up works.

 Roll-up is performed by climbing up a concept hierarchy for the dimension location.

 Initially the concept hierarchy was "street < city < province < country".
 On rolling up, the data is aggregated by ascending the location hierarchy from the level
of city to the level of country.
 The data is grouped into cities rather than countries.
15

 When roll-up is performed, one or more dimensions from the data cube are removed.
Page
Drill-down
Drill-down is the reverse operation of roll-up. It is performed by either of the following ways −
 By stepping down a concept hierarchy for a dimension
 By introducing a new dimension.
The following diagram illustrates how drill-down works −

 Drill-down is performed by stepping down a concept hierarchy for the dimension time.
 Initially the concept hierarchy was "day < month < quarter < year."
 On drilling down, the time dimension is descended from the level of quarter to the level
of month.
 When drill-down is performed, one or more dimensions from the data cube are added.
 It navigates the data from less detailed data to highly detailed data.

Slice
16

The slice operation selects one particular dimension from a given cube and provides a new sub-
Page

cube. Consider the following diagram that shows how slice works.
 Here Slice is performed for the dimension "time" using the criterion time = "Q1".
 It will form a new sub-cube by selecting one or more dimensions.
Dice
Dice selects two or more dimensions from a given cube and provides a new sub-cube. Consider
the following diagram that shows the dice operation.

17
Page
The dice operation on the cube based on the following selection criteria involves three
dimensions.
 (location = "Toronto" or "Vancouver")
 (time = "Q1" or "Q2")
 (item =" Mobile" or "Modem")

Pivot
The pivot operation is also known as rotation. It rotates the data axes in view in order to provide
an alternative presentation of data. Consider the following diagram that shows the pivot
operation.

18
Page
OLAP vs OLTP

Sr. No. Data Warehouse (OLAP) Operational Database (OLTP)

1 Involves historical processing of Involves day-to-day processing.

information.

2 OLAP systems are used by knowledge OLTP systems are used by clerks, DBAs,
workers such as executives, managers or database professionals.
and analysts.

3 Useful in analyzing the business. Useful in running the business.

4 It focuses on Information out. It focuses on Data in.

5 Based on Star Schema, Snowflake, Based on Entity Relationship Model.

Schema and Fact Constellation
Schema.

6 Contains historical data. Contains current data.

7 Provides summarized and consolidated Provides primitive and highly detailed

data. data.

8 Provides summarized and Provides detailed and flat relational view

multidimensional view of data. of data.

9 Number or users is in hundreds. Number of users is in thousands.

10 Number of records accessed is in Number of records accessed is in tens.

millions.
Page
11 Database size is from 100 GB to 1 TB Database size is from 100 MB to 1 GB.

12 Highly flexible. Provides high performance.

References:
1. Decision support system, EIS, 2000
2. W.H.Inmon, “Building Data Warehousing”, Willey, 1998.
3. Han, Jiawei, Kamber, Michelinal, “ Data Mining Concepts & Techniques”, Harcourt
India, 2001
4. https://www.javatpoint.com/data-mining
5. http://www.lastnightstudy.com/Show?id=30/What-is-Data-Mining?
6. https://www.includehelp.com/data-warehouse/multidimensional-data-model.aspx
7. https://www.tutorialspoint.com/dwh/dwh_olap.htm

20
Page

Sample Report For CFA in APA
100% (1)
Sample Report For CFA in APA
6 pages
Difference Between Quantative and Qualative Research
100% (1)
Difference Between Quantative and Qualative Research
13 pages
Unit 2 (DWDM)
No ratings yet
Unit 2 (DWDM)
40 pages
Data Mining Techniques Unit-1
No ratings yet
Data Mining Techniques Unit-1
122 pages
Notes DATA MINING MBA III
No ratings yet
Notes DATA MINING MBA III
8 pages
L_1 Data Mining
No ratings yet
L_1 Data Mining
17 pages
Data Mining1
No ratings yet
Data Mining1
37 pages
Data Mining Tutorial
No ratings yet
Data Mining Tutorial
30 pages
Unit 1
No ratings yet
Unit 1
27 pages
Data Mining-Introduction
No ratings yet
Data Mining-Introduction
8 pages
Data Mining Unit 1(Msc Ds 3 Sem)
No ratings yet
Data Mining Unit 1(Msc Ds 3 Sem)
119 pages
Data Mining
No ratings yet
Data Mining
89 pages
Lps Week 16 Iatb
No ratings yet
Lps Week 16 Iatb
5 pages
DM Material
No ratings yet
DM Material
98 pages
Data mining M1
No ratings yet
Data mining M1
64 pages
data_mining
No ratings yet
data_mining
22 pages
Motivation of Data Mining
No ratings yet
Motivation of Data Mining
4 pages
Unit II Data Mining
No ratings yet
Unit II Data Mining
8 pages
Data Mining and Data Warehousing Unit 3 Part 1
No ratings yet
Data Mining and Data Warehousing Unit 3 Part 1
13 pages
Data Mining and Data Warehousing
No ratings yet
Data Mining and Data Warehousing
12 pages
Data Mining
No ratings yet
Data Mining
14 pages
Final Document
No ratings yet
Final Document
25 pages
Data Mining Tutorial - Javatpoint
No ratings yet
Data Mining Tutorial - Javatpoint
12 pages
Data Mining AND Warehousing: Abstract
No ratings yet
Data Mining AND Warehousing: Abstract
12 pages
18mca52c U1
No ratings yet
18mca52c U1
17 pages
Data Mining Unit 1
No ratings yet
Data Mining Unit 1
24 pages
1 ST Review Document
No ratings yet
1 ST Review Document
37 pages
Data Mining: Concepts and Techniques
No ratings yet
Data Mining: Concepts and Techniques
46 pages
DM-Unit_1
No ratings yet
DM-Unit_1
13 pages
DM Mod 1
No ratings yet
DM Mod 1
17 pages
Data Mining Notes
No ratings yet
Data Mining Notes
46 pages
Data Mining Notes
No ratings yet
Data Mining Notes
21 pages
Data Warehousing&Dat Mining
No ratings yet
Data Warehousing&Dat Mining
12 pages
Data Mining
No ratings yet
Data Mining
8 pages
DMW Notes by Me
No ratings yet
DMW Notes by Me
45 pages
Why We Need Data Mining?
No ratings yet
Why We Need Data Mining?
39 pages
Introduction to Data Mining_125604
No ratings yet
Introduction to Data Mining_125604
7 pages
Adm Unit - 1
No ratings yet
Adm Unit - 1
62 pages
Data Mining: M.P.Geetha, Department of CSE, Sri Ramakrishna Institute of Technology, Coimbatore
No ratings yet
Data Mining: M.P.Geetha, Department of CSE, Sri Ramakrishna Institute of Technology, Coimbatore
115 pages
Module 3
No ratings yet
Module 3
187 pages
Data Mining L1,2
No ratings yet
Data Mining L1,2
26 pages
Data Mining and Its Applications
No ratings yet
Data Mining and Its Applications
60 pages
Data Mining Unit 1
No ratings yet
Data Mining Unit 1
71 pages
Presentation On Data Mining
100% (1)
Presentation On Data Mining
51 pages
Data Mining 445545
No ratings yet
Data Mining 445545
11 pages
A Techinical Paper: Tupimakadia1@yahoo - Co.in Yamu - 4u1985@yahoo - Co.in
No ratings yet
A Techinical Paper: Tupimakadia1@yahoo - Co.in Yamu - 4u1985@yahoo - Co.in
14 pages
Unit 1 DMDW
No ratings yet
Unit 1 DMDW
57 pages
Data Mining First draft
No ratings yet
Data Mining First draft
84 pages
Unit 4 New Database Applications and Environments: by Bhupendra Singh Saud
No ratings yet
Unit 4 New Database Applications and Environments: by Bhupendra Singh Saud
14 pages
Data Mining: The Basic Concept
No ratings yet
Data Mining: The Basic Concept
23 pages
Data Mining-CH5
No ratings yet
Data Mining-CH5
49 pages
Unit 1 Datamining For Business Intelligence
No ratings yet
Unit 1 Datamining For Business Intelligence
101 pages
DM Module1
No ratings yet
DM Module1
15 pages
UNIT 1 - Lecture 1 - Introduction To Data Mining
No ratings yet
UNIT 1 - Lecture 1 - Introduction To Data Mining
62 pages
Datamining With Big Data - Siva
No ratings yet
Datamining With Big Data - Siva
69 pages
DWM Unit II
No ratings yet
DWM Unit II
76 pages
Data Mining Practical 123
No ratings yet
Data Mining Practical 123
26 pages
Mehrdad Jalali: Jalali@mshdiau - Ac.ir Jalali - Mshdiau.ac - Ir
No ratings yet
Mehrdad Jalali: Jalali@mshdiau - Ac.ir Jalali - Mshdiau.ac - Ir
27 pages
CH 1
No ratings yet
CH 1
66 pages
UNIT-1 Introduction To Data Mining
No ratings yet
UNIT-1 Introduction To Data Mining
29 pages
Unit - 4 Introduction To Data Mining
No ratings yet
Unit - 4 Introduction To Data Mining
71 pages
"Big Data Science" Basic Concepts and Applications
From Everand
"Big Data Science" Basic Concepts and Applications
Sukanta Bhattacharya
No ratings yet
Missing Data Techniques - UCLA
No ratings yet
Missing Data Techniques - UCLA
66 pages
Tugas Metode Kuantitatif - Dicky Wahyudi
No ratings yet
Tugas Metode Kuantitatif - Dicky Wahyudi
9 pages
Master Thesis Topics in Communication Engineering
100% (1)
Master Thesis Topics in Communication Engineering
6 pages
What Drives The Development of Life Insurance Sect
No ratings yet
What Drives The Development of Life Insurance Sect
15 pages
Econometrics: Chapter 6: Multiple Regression Model
No ratings yet
Econometrics: Chapter 6: Multiple Regression Model
23 pages
Bam 212 2023
No ratings yet
Bam 212 2023
116 pages
Data Science and Machine Learning
No ratings yet
Data Science and Machine Learning
13 pages
Valn 10 Ip
No ratings yet
Valn 10 Ip
6 pages
Grade 6 Report Card Example
No ratings yet
Grade 6 Report Card Example
3 pages
Unit 1 Business Intelligence Introduction
No ratings yet
Unit 1 Business Intelligence Introduction
8 pages
Regression Analysis: Source SS DF MS F P-Value
No ratings yet
Regression Analysis: Source SS DF MS F P-Value
4 pages
Analysing Bicycle Route Potential Towards Sustainable Transport in Ipoh City
No ratings yet
Analysing Bicycle Route Potential Towards Sustainable Transport in Ipoh City
10 pages
Journal of Periodontology - 2018 - Needleman - Mean Annual Attachment Bone Level and Tooth Loss A Systematic Review
No ratings yet
Journal of Periodontology - 2018 - Needleman - Mean Annual Attachment Bone Level and Tooth Loss A Systematic Review
20 pages
SHORT GRADE 11 FABM-2-Reviewer
0% (1)
SHORT GRADE 11 FABM-2-Reviewer
5 pages
Sr. Analyst JD - Gridlex
No ratings yet
Sr. Analyst JD - Gridlex
3 pages
Introduction To Case Study and Case Stud
No ratings yet
Introduction To Case Study and Case Stud
14 pages
Review of Related Literature RII
No ratings yet
Review of Related Literature RII
4 pages
Defining Accountability
No ratings yet
Defining Accountability
6 pages
Data Cube Technology
No ratings yet
Data Cube Technology
20 pages
Chambala Tea Factory
No ratings yet
Chambala Tea Factory
80 pages
Finding Answers Through Data Collection
No ratings yet
Finding Answers Through Data Collection
18 pages
MIA Competency Framework Exposure Draft
No ratings yet
MIA Competency Framework Exposure Draft
58 pages
Introduction To Clinical Research For Residents (16.9.14) Hani Tamim (FC1)
No ratings yet
Introduction To Clinical Research For Residents (16.9.14) Hani Tamim (FC1)
103 pages
Mid-Sem Model Answer 7
No ratings yet
Mid-Sem Model Answer 7
5 pages
Final Minor Project
No ratings yet
Final Minor Project
38 pages
ANOVAangel
No ratings yet
ANOVAangel
24 pages
Unit - Guide - ACCG3040 - 2022 - Session 1, in Person-Scheduled-Weekday, North Ryde
No ratings yet
Unit - Guide - ACCG3040 - 2022 - Session 1, in Person-Scheduled-Weekday, North Ryde
11 pages
Frontlearners Lesson Exemplar Math AdaptedReleasedItems
No ratings yet
Frontlearners Lesson Exemplar Math AdaptedReleasedItems
46 pages

KM Notes Unit-3

Uploaded by

KM Notes Unit-3

Uploaded by

MAHARANA PRATAP GROUP OF INSTITUTIONS

KOTHI MANDHANA, KANPUR

Subject Name : Knowledge Management

Multi- Dimensional Analysis

1.1 Data Mining Architecture

Data Mining Engine:

1.2 Types of Database

reporting, and organization.

1.3 Data Mining Applications

promotions that help the organization to attract the customer.

Data Mining in Healthcare:

Data Mining in Market Basket Analysis:

Data mining in Education:

Data Mining in Manufacturing Engineering:

Data Mining in CRM (Customer Relationship Management):

Data Mining in Fraud detection:

Data Mining in Lie Detection:

Data Mining Financial Banking:

Knowledge Discovery Process (KDD)

3. Data Mining Techniques

Data Mining Techniques

3.1 Benefits of Data Mining:

4. Multidimensional Data Model

5. Data Warehousing - OLAP

5.1 Types of OLAP Servers

 Multidimensional OLAP (MOLAP)

 Hybrid OLAP (HOLAP)

 Specialized SQL Servers

ROLAP includes the following −

 Optimization for each DBMS back end.

 Additional tools and services.

Specialized SQL Servers

Here is the list of OLAP operations −

 Slice and dice

The following diagram illustrates how roll-up works.

 Roll-up is performed by climbing up a concept hierarchy for the dimension location.

Sr. No. Data Warehouse (OLAP) Operational Database (OLTP)

1 Involves historical processing of Involves day-to-day processing.

3 Useful in analyzing the business. Useful in running the business.

4 It focuses on Information out. It focuses on Data in.

5 Based on Star Schema, Snowflake, Based on Entity Relationship Model.

6 Contains historical data. Contains current data.

7 Provides summarized and consolidated Provides primitive and highly detailed

8 Provides summarized and Provides detailed and flat relational view

9 Number or users is in hundreds. Number of users is in thousands.

10 Number of records accessed is in Number of records accessed is in tens.

12 Highly flexible. Provides high performance.

You might also like