Ccw331 Unit 2
Ccw331 Unit 2
UNIT II
BUSINESS INTELLIGENCE
You can create performance benchmarks, spot market trends, increase compliance,
and improve almost every aspect of your business with better businessintelligence
As part of the BI process, organizations collect data from internal IT systems and external
sources, prepare it for analysis, run queries against the data and create data
visualizations, BI dashboards and reports to make the analytics results available to
business users for operational decision-making and strategic planning
How BI works
Businesses and organizations have questions and goals. To answer these questions
and track performance against these goals, it is essential to gather the necessary data,
analyze it, and determine which actions to take to reach their goals.
On the technical side, raw data is collected from business systems. Data is processed
and then stored in data warehouses, the cloud, applications, and files.
Once it’s stored, users can access the data, starting the analysis process to answer
business questions.
BI platforms also offer data visualization tools, which convert data into charts or
graphs, as well as presenting to any key stakeholders or decision-makers.
UNIT-2- Page 1 of 23
CCW331
BI Processes
Business Intelligence has evolved to include more processes and activities to help
improve performance. These processes include:
Data mining: Using databases, statistics, and machine learning (ML) to uncover
trends in large datasets
Reporting: Sharing data analysis to stakeholders so they can draw conclusions and
make decisions
Performance metrics and benchmarking: Comparing current performance data to
historical data to track performance against goals, typically using customized
dashboards
Descriptive analytics: Using preliminary data analysis to find out what happened
Querying: Asking the data-specific questions, BI pulling the answers from the data
sets
Statistical analysis: Taking the results from descriptive analytics and further
exploring the data using statistics such as how this trend happened and why
Data visualization: Turning data analysis into visual representations such as charts,
graphs, and histograms to more easily consume data
Visual analysis: Exploring data through visual storytelling to communicate insights
on the fly and stay in the flow of analysis
Data preparation: Compiling multiple data sources, identifying the dimensions and
measurements, and preparing it for data analysis
UNIT-2- Page 2 of 23
CCW331
Benefits of BI
Data-driven business decisions: The ability to drive business decisions with data
is the central benefit of BI. A strong BI strategy can deliver accurate data and reporting
capabilities faster to business users to help them make better business decisions in a
more timely fashion.
Faster analysis and intuitive dashboards: BI improves reporting efficiency by
condensing reports into dashboards that are easy for non-technical users to analyze,
saving them time when seeking to glean insights from data.
Increased organizational efficiency: BI can help provide holistic views of business
operations, giving leaders the ability to benchmark results against larger
organizational goals and identify areas of opportunity.
Improved customer experience: Ready access to data can help employees charged
with customer satisfaction provide better experiences.
Improved employee satisfaction: Providing business users access to data without
having to contact analysts or IT can reduce friction, increase productivity, and
facilitate faster results.
Trusted and governed data: Modern BI platforms can combine internal databases
with external data sources into a single data warehouse, allowing departments across
an organization to access the same data at one time.
Increased competitive advantage: A sound BI strategy can help businesses monitor
their changing market and anticipate customer needs.
Disadvantages of BI include:
Complexity: The implementation and upkeep of business intelligence systems can be
extremely difficult and complicated. This may be a drawback for companies with
constrained IT resources.
High costs: Some businesses find it prohibitively expensive to implement and
purchase business intelligence technologies.
Business intelligence strongly depends on accurate and current data. The insights
produced by business intelligence technologies could not be accurate if the data is
inconsistent, erroneous, or incomplete.
Data Security: Business intelligence systems handle and store a lot of sensitive data,
which, if not adequately protected, is susceptible to security breaches.
Dependence on IT: Because business intelligence solutions frequently rely largely
on IT assistance, it may be challenging for enterprises to quickly get the data they
require.
Limited scalability: For firms with huge data volumes, business intelligence
solutions may not be able to handle enormous amounts of data.
UNIT-2- Page 3 of 23
CCW331
Operational Level: Operation decisions are made at this level. Operational decisions
help in operating the system.
A data warehouse is a type of data management system that is designed to enable and
support business intelligence (BI) activities, especially analytics.
Data warehouses are solely intended to perform queries and analysis and often contain
large amounts of historical data
UNIT-2- Page 4 of 23
CCW331
BI VS Data Warehouse
Data Marts are analytical record stores designed to focus on particular business
functions for a specific community within an organization. Data marts are derived from
subsets of data in a data warehouse, though in the bottom-up data warehouse design
UNIT-2- Page 5 of 23
CCW331
methodology, the data warehouse is created from the union of organizational data marts.
The fundamental use of a data mart is Business Intelligence (BI) applications. BI is used
to gather, store, access, and analyze record. It can be used by smaller businesses to utilize
the data they have accumulated since it is less expensive than implementing a data
warehouse.
A dependent data marts is a logical subset of a physical subset of a higher data warehouse.
According to this technique, the data marts are treated as the subsets of a data warehouse.
In this technique, firstly a data warehouse is created from which further various data
marts can be created. These data mart are dependent on the data warehouse and extract
the essential record from it.
UNIT-2- Page 6 of 23
CCW331
In this technique, as the data warehouse creates the data mart; therefore, there is no
need for data mart integration. It is also known as a top-down approach.
In this approach, as all the data marts are designed independently; therefore, the
integration of data marts is required. It is also termed as a bottom-up approach as the
data marts are integrated to develop a data warehouse.
Designing
The design step is the first in the data mart process. This phase covers all of the functions
from initiating the request for a data mart through gathering data about the requirements
and developing the logical and physical design of the data mart.
UNIT-2- Page 7 of 23
CCW331
Creating the physical database and logical structures such as tablespaces associated
with the data mart.
creating the schema objects such as tables and indexes describe in the design step.
Determining how best to set up the tables and access structures.
Populating
This step includes all of the tasks related to the getting data from the source, cleaning it
up, modifying it to the right format and level of detail, and moving it into the data mart.
Accessing
This step involves putting the data to use: querying the data, analyzing it, creating
reports, charts and graphs and publishing them.
Set up and intermediate layer (Meta Layer) for the front-end tool to use. This layer
translates database operations and objects names into business conditions so that the
end-clients can interact with the data mart using words which relates to the business
functions.
Set up and manage database architectures like summarized tables which help queries
agree through the front-end tools execute rapidly and efficiently.
Managing
This step contains managing the data mart over its lifetime. In this step,
management functions are performed as:
Providing secure access to the data.
Managing the growth of the data.
Optimizing the system for better performance.
Ensuring the availability of data event with system failures.
UNIT-2- Page 8 of 23
CCW331
It may hold multiple subject areas. It holds only one subject area. For example,
Finance or Sales.
In data warehousing, Fact constellation is In Data Mart, Star Schema and Snowflake
used. Schema are used.
UNIT-2- Page 9 of 23
CCW331
Effective knowledge management system typically goes through three main steps:
Knowledge Creation: During this step, organizations identify and document any
existing or new knowledge that they want to circulate across the company.
Tacit knowledge:
This type of knowledge is typically acquired through experience, and it is
intuitively understood.
As a result, it is challenging to articulate and codify, making it difficult to transfer
this information to other individuals.
Examples of tacit knowledge can include language, facial recognition, or leadership
skills.
UNIT-2- Page 10 of 23
CCW331
Implicit knowledge:
While some literature equivocates implicit knowledge to tacit knowledge, some
academics break out this type separately, expressing that the definition of tactic
knowledge is more nuanced.
While tacit knowledge is difficult to codify, implicit knowledge does not necessarily
have this problem.
Instead, implicit information has yet to be documented. It tends to exist within
processes, and it can be referred to as “know-how” knowledge.
Explicit knowledge:
Explicit knowledge is captured within various document types such as manuals,
reports, and guides, allowing organizations to easily share knowledge across
teams.
This type of knowledge is perhaps the most well-known and examples of it include
knowledge assets such as databases, white papers, and case studies.
This form of knowledge is important to retain intellectual capital within an
organization as well as facilitate successful knowledge transfer to new employees.
KM Tools
There are a number tools that organizations utilize to reap the benefits of knowledge
management. Examples of knowledge management systems can include:
UNIT-2- Page 11 of 23
CCW331
Wikis can be a popular knowledge management tool given its ease of use. They make
it easy to upload and edit information, but this ease can lead to concerns about
misinformation as workers may update them with incorrect or outdated information.
Data warehouses aggregate data from different sources into a single, central,
consistent data store to support data analysis, data mining, artificial intelligence (AI),
and machine learning. Data is extracted from these repositories so that companies can
derive insights, empowering employees to make data-driven decisions.
KM benefits
Identification of skill gaps: When teams create relevant documentation around
implicit or tacit knowledge or consolidate explicit knowledge, it can highlight gaps in
core competencies across teams. This provides valuable information to management
to form new organizational structures or hire additional resources.
Make better informed decisions: Knowledge management systems armindividuals
and departments with knowledge. By improving accessibility to current and historical
enterprise knowledge, your teams can upskill and make more information-driven
decisions that support business goals.
Maintains enterprise knowledge: If your most knowledgeable employees left
tomorrow, what would your business do? Practicing internal knowledge management
enables businesses to create an organizational memory. Knowledge held by your
long-term employees and other experts, then make it accessible to your wider team.
Operational efficiencies: Knowledge management systems create a go-to place that
enable knowledge workers to find relevant information more quickly. This, in turn,
reduces the amount of time on research, leading to faster decision-making and cost-
savings through operational efficiencies. Increase productivity not only saves time,
but also reduces costs.
Increased collaboration and communication: Knowledge management systems
and organizational cultures work together to build trust among team members.These
information systems provide more transparency among workers, creating more
understanding and alignment around common goals. Engaged leadership and open
communication create an environment for teams to embrace innovation and
feedback.
Data Security: Knowledge management systems enable organizations to customize
permission control, viewership control and the level of document-security to ensure
that information is shared only in the correct channels or with selected individuals.
Give your employees the autonomy access knowledge safely and with confidence.
UNIT-2- Page 12 of 23
CCW331
Decisions made at this level usually involve significant expenditure. However, they
are generally non-repetitive in nature and are taken only after careful analysis and
evaluation of many alternatives.
Tactical decisions
Tactical decisions (or semistructured decisions) occur with greater frequency (e.g.,
weekly or monthly) and fall into the mid-management level. Often, they relateto the
implementation of strategic decisions.
The impact of these types of decisions is medium regarding risk to the organization
and impact on profitability.
Operational decisions
Operational decisions (or structured decisions) usually happen frequently (e.g.,
daily or hourly), relate to day-to-day op•erations of the enterprise, and have a lesser
impact on the organization. Operational decisions determine the day-to-day
profitability of the business, how effectively it retains customers, or how well it
manages risk.
Summary
UNIT-2- Page 13 of 23
CCW331
Unstructured decisions are those in which the decision maker must providejudgement,
evaluation, and insight to solve the problem.
Structured decisions, by contrast, are repetitive and routine, and they involve a definite
procedure for handling them so that they do not have to be treated each time as if they
were new.
Many decisions have elements of both types of decisions and are semistructured, where
only part of the problem has a clear-cut answer provided by an accepted procedure. In
general, structured decisions are most prevalent at lower organizational levels, whereas
unstructured problems are most common at higher levels of the firms.
UNIT-2- Page 14 of 23
CCW331
There are many different decision-making methodologies, but most share at least
five steps in common:
UNIT-2- Page 15 of 23
CCW331
Characteristics of DSS
The flexibility of the DSS is extremely beneficial for users who travel frequently.
This gives them the opportunity to be well-informed at all times, providing them
the ability to make the best decisions for their company and customers on the go
or even on the spot.
DSS Uses
In organizations, a decision support system (DSS) analyzes and synthesizes vast
amounts of data to assist in decision-making.
With this information, it produces reports that may project revenue, sales, or manage
inventory.
Through the integration of multiple variables, a DSS can produce a number of
different outcomes based on the company’s previous data and current inputs.
Decision support systems help in making more informed decisions. Often used by
upper and mid-level management, decision support systems are used to make
actionable decisions, or produce multiple possible outcomes based on current and
historical company data.
UNIT-2- Page 16 of 23
CCW331
At the same time, decision support systems can be used to produce reports for
customers that are easily digestible and can be adjusted based on user specifications.
Decision support systems are used in a broad array of industries. Example uses
include:
GPS route planning. A DSS can be used to plan the fastest and best routes between
two points by analyzing the available options. These systems often include the
capability to monitor traffic in real-time to route around congestion.
Crop planning. Farmers use DSS to help them determine the best time to plant,
fertilize, and reap their crops. Bayer Crop Science has applied analytics and decision-
support to every element of its business, including the creation of “virtual factories”
to perform “what-if” analyses at its corn manufacturing sites.
Clinical DSS. These systems help clinicians diagnose their patients. Penn Medicine
has created a clinical DSS that helps it get ICU patients off ventilators faster.
UNIT-2- Page 17 of 23
CCW331
Topic 16 – OLAP
OLAP implement the multidimensional analysis of business information and support the
capability for complex estimations, trend analysis, and sophisticated data modeling. It is
rapidly enhancing the essential foundation for Intelligent Solutions containing Business
Performance Management, Planning, Budgeting, Forecasting, Financial Documenting,
Analysis, Simulation-Models, Knowledge Discovery, and Data Warehouses Reporting.
OLAP enables end-clients to perform ad hoc analysis of record in multiple dimensions,
providing the insight and understanding they require for better decision making.
Production planning
Defect analysis
OLAP cubes have two main purposes. The first is to provide business users with a data
model more intuitive to them than a tabular model. This model is called a Dimensional
Model.
The second purpose is to enable fast query response that is usually difficult to achieve
using tabular models.
OLAP working
It pre-calculates most of the queries that are typically very hard to execute over tabular
databases, namely aggregation, joining, and grouping. These queries are calculated
during a process that is usually called 'building' or 'processing' of the OLAP cube. This
process happens overnight, and by the time end users get to work - data will have been
updated.
UNIT-2- Page 18 of 23
CCW331
OLAP Guidelines
3) Accessibility: It provides access only to the data that is actually required to perform
the particular analysis, present a single, coherent, and consistent view to the clients. The
OLAP system must map its own logical schema to the heterogeneous physical data stores
and perform any necessary transformations. The OLAP operations should be sitting
between data sources (e.g., data warehouses) and an OLAP front-end.
4) Consistent Reporting Performance: To make sure that the users do not feel any
significant degradation in documenting performance as the number of dimensions or the
size of the database increases. That is, the performance of OLAP should not suffer as the
number of dimensions is increased. Users must observe consistent run time, response
time, or machine utilization every time a given query is run.
7) Dynamic Sparse Matrix Handling: To adapt the physical schema to the specific
analytical model being created and loaded that optimizes sparse matrix handling. When
encountering the sparse matrix, the system must be easy to dynamically assume the
distribution of the information and adjust the storage and access to obtain and maintain
a consistent level of performance.
8) Multiuser Support: OLAP tools must provide concurrent data access, data integrity,
and access security.
UNIT-2- Page 19 of 23
CCW331
and drop methods on the cells of the scientific model. It avoids the use of a menu or
multiple trips to a user interface.
12) Unlimited Dimensions and Aggregation Levels: The number of data dimensions
should be unlimited. Each of these common dimensions must allow a practically
unlimited number of customer-defined aggregation levels within any given consolidation
path.
Characteristics of OLAP
In the FASMI characteristics of OLAP methods, the term derived from the first letters of
the characteristics are:
Fast
It defines which the system targeted to deliver the most feedback to the client within
about five seconds, with the elementary analysis taking no more than one second and
very few taking more than 20 seconds.
Analysis
It defines which the method can cope with any business logic and statistical analysisthat
is relevant for the function and the user, keep it easy enough for the target client. Although
some preprogramming may be needed we do not think it acceptable if all application
definitions have to be allow the user to define new Adhoc calculations as part of the
analysis and to document on the data in any desired method, without having to program
so we excludes products (like Oracle Discoverer) that do not allow the user to define new
Adhoc calculation as part of the analysis and to document on the data in any desired
product that do not allow adequate end user-oriented calculation flexibility.
Share
It defines which the system tools all the security requirements for understanding and, if
multiple write connection is needed, concurrent update location at an appropriated level,
not all functions need customer to write data back, but for the increasing number which
does, the system should be able to manage multiple updates in a timely, secure manner.
Multidimensional
This is the basic requirement. OLAP system must provide a multidimensional conceptual
view of the data, including full support for hierarchies, as this is certainly the most logical
method to analyze business and organizations.
Information
The system should be able to hold all the data needed by the applications. Data sparsity
should be handled in an efficient manner.
UNIT-2- Page 20 of 23
CCW331
UNIT-2- Page 21 of 23
CCW331
related to sales. In some methods, it may also be feasible to identify expenditures which
produce a high return on investments (ROI). For example, recruiting a top salesperson
may contain high costs, but the revenue generated by the salesperson may justify the
investment
AVERAGE_RANK
AVERAGE_RANK orders the members of a dimension based on the values of an
expression. The function returns the sequence numbers of the dimension members.
AVERAGE_RANK assigns the same average rank to identical values. For example,
AVERAGE_RANK may return 1, 2, 3.5, 3.5, 5 for a series of five dimension members.
AVG
AVG returns the average of a selection of values calculated over a Time dimension. Use
this function to create cumulative averages and moving averages.
COUNT
COUNT tallies the number of data values identified by a selection of members in a Time
dimension.
DENSE_RANK
DENSE_RANK orders the members of a dimension based on the values of an expression.
The function returns the sequence numbers of the dimension members.
DENSE_RANK assigns the same minimum rank to identical values, and returns the results
in a sequential list. The result may be fewer ranks than values in the series. For example,
DENSE_RANK may return 1, 2, 3, 3, 4 for a series of five dimension members.
LAG
LAG returns the value from an earlier time period.
LAG_VARIANCE
LAG_VARIANCE returns the difference between values for the current time period and an
earlier period.
LEAD_VARIANCE_PERCENT
LEAD_VARIANCE_PERCENT returns the percent difference between values for the
current time period and the offset period.
MAX
MAX returns the largest of a selection of data values calculated over a Time dimension.
MIN
MIN returns the smallest of a selection of data values calculated over a Time dimension.
RANK
UNIT-2- Page 22 of 23
CCW331
RANK orders the members of a dimension based on the values of an expression. The
function returns the sequence numbers of the dimension members.
RANK assigns the same rank to identical values. For example, RANK may return 1, 2, 3,
3, 5 for a series of five dimension members.
ROW_NUMBER
ROW_NUMBER orders the members of a dimension based on the values of an expression.
The function returns the sequence numbers of the dimension members.
ROW_NUMBER assigns a unique rank to each dimension member; for identical values,
the rank is arbitrary. For example, ROW_NUMBER always returns 1, 2, 3, 4, 5 for a series
of five dimension members, even when they have the same value.
SUM
SUM returns the total of a selection of values calculated over a Time dimension. You can
use the SUM function to create period-to-date calculations
UNIT-2- Page 23 of 23