Data Visualization

Data visualization is essential in data science for understanding and presenting data effectively throughout its life cycle. It can be classified into various categories such as complexity, infographics versus data visualization, and exploratory versus explanatory visualizations, each serving different purposes and audiences. Proper classification of visualization techniques helps in selecting the most effective method based on data characteristics, relationships, and intended communication goals.

Uploaded by

namrata.valecha

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Data Visualization

Uploaded by

namrata.valecha

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 24

Data Visualization

Introduction
• Data visualization has a crucial role in data science for
understanding the data.
• Data visualization can be used in all steps of the data
science life cycle to facilitate data exploration, identify
anomalies, understand relationships and trends, and
produce reports.
• The best data in the world won't be worth anything if no
one can understand it. There is not only need to collect
and analyze data, but also to present it to the end users
and other interested parties who will then act on that data.
Here’s where data visualization comes in.
Introduction
• Sometimes data does not make sense until you can look at it in a visual
form, such as with charts and plots.
• Being able to quickly visualize your data samples for yourself and others is
an important skill both in applied statistics and in applied machine
learning.
• Statistics does indeed focus on quantitative descriptions and estimations
of data. Data visualization provides an important suite of tools for gaining
a qualitative understanding.
• Data visualization can be helpful when exploring and getting to know a
dataset and can help with identifying patterns, corrupt data, outliers, and
much more.
Classifications of Visualizations
• There are several ways to categorize and think about
different kinds of visualizations. Here are four of the most
useful. The first two are unrelated to the others; the last
two are related to each other.
• Complexity
• Infographics versus Data Visualization
• Exploration versus Explanation
• Informative versus Persuasive versus Visual Art
Complexity
• One way to classify a data visualization is by counting
how many different data dimensions it represents.
• By this we mean the number of discrete types of
information that are visually encoded in a diagram.
• For example, a simple line graph may show the price of a
company’s stock on different days: that’s two data
dimensions.
• If multiple companies are shown (and therefore
compared), there are now three dimensions; if trading
volume per day is added to the graph, there are four
Infographics
• We suggest that the term infographics is useful for
referring to any visual representation of data that is:
• manually drawn (and therefore a custom treatment of the
information);
• specific to the data at hand (and therefore nontrivial to recreate
with different data);
• aesthetically rich (strong visual content meant to draw the eye
and hold interest); and
• relatively data-poor (because each piece of information must be
manually encoded).
Infographics
• Because of their manually-drawn process of creation,
infographics have the option of being aesthetically rich.
• Another consequence of their manual origins is they tend
to be limited in the amount of data they can convey,
simply due to the practical limitations of manipulating
many data points.
• Similarly, it is difficult to change or update the data in an
infographic, as any changes must be implemented
manually.
Data Visualization
• By contrast, it is suggested that the terms data
visualization and information visualization (casually, data
viz and info viz) are useful for referring to any visual representation
of data that is:
• algorithmically drawn (may have custom touches but is largely rendered
with the help of computerized methods);
• easy to regenerate with different data (the same form may be repurposed
to represent different datasets with similar dimensions or characteristics);
• often aesthetically barren (data is not decorated); and
• relatively data-rich (large volumes of data are welcome and viable, in
contrast to infographics).
• The advantage of this approach is that it is relatively simple to
update or regenerate the visualization with more or new data. While
they may show great volumes of data, information visualizations
are often less aesthetically rich than infographics.
Exploration
• Exploratory data visualizations are appropriate when you have a
whole bunch of data and you are not sure what is in it.
• When you need to get a sense of what is inside your data set,
translating it into a visual medium can help you quickly identify its
features, including interesting curves, lines, trends, or anomalous
outliers.
• Exploration is generally best done at a high level of granularity.
There may be a whole lot of noise in your data, but if you
oversimplify or strip out too much information, you could end up
missing something important.
• This type of visualization is typically part of the data
analysis phase, and is used to find the story the data has to tell
you.
Explanation
• By contrast, explanatory data visualization is appropriate when you
already know what the data has to say, and you are trying to tell that
story to somebody else.
• It could be the head of your department, a grant committee, or the
general public.
• Whoever your audience is, the story you are trying to tell (or the answer
you are trying to share) is known to you at the outset, and therefore you
can design to specifically accommodate and highlight that story.
• In other words, you will need to make certain editorial decisions about
which information stays in, and which is distracting or irrelevant and
should come out.
• This is a process of selecting focused data that will support the story you
are trying to tell.
Explanation
• If exploratory data visualization is part of the data
analysis phase, then explanatory data visualization is
part of the presentation phase.
• Such a visualization may stand on its own, or may be part
of a larger presentation, such as a speech, a newspaper
article, or a report.
• In these scenarios, there is some supporting narrative
written or verbal that further explains things.
Informative versus Persuasive versus
Visual Art
• There are three main categories of explanatory
visualizations based on the relationships between the
three necessary players: the designer, the reader, and
the data.
Informative
• An informative visualization primarily serves the
relationship between the reader and the data.
• It aims for a neutral presentation of the facts in such a
way that will educate the reader (though not necessarily
persuade him).
• Informative visualizations are often associated with broad
data sets, and seek to distill the content into a
manageably consumable form.
• Ideally, they form the bulk of visualizations that the
average person encounters on a day-to-day basis
whether that’s at work, in the newspaper, or on a service-
provider’s website.
Persuasive
• A persuasive visualization primarily serves the
relationship between the designer and the reader.
• It is useful when the designer wishes to change the
reader’s mind about something.
It represents a very specific point of view, and advocates
a change of opinion or action on the part of the reader.
• In this category of visualization, the data represented is
specifically chosen for the purpose of supporting the
designer’s point of view, and is presented carefully so as
to convince the reader of same.
Visual Art
• The third category, visual art, primarily serves the
relationship between the designer and the data.
• Visual art is unlike the previous two categories in that it often
entails unidirectional encoding of information, meaning that the
reader may not be able to decode the visual presentation to
understand the underlying information.
• Whereas both informative and persuasive visualizations are meant
to be easily decodable bidirectional in their encoding visual art
merely translates the data into a visual form.
• The designer may intend only to condense it, translate it into a new
medium, or make it beautiful; he/she may not intend for the reader
to be able to extract anything from it other than enjoyment.
choosing the appropriate
visualization technique
• Classification of visualization techniques helps in choosing the
appropriate visualization method for different datasets by providing a
structured framework to match the characteristics of the data with the
most effective visual representation.
1. Understanding the Data
• Data Type: Visualization techniques are classified based on the type of
data (e.g., categorical, numerical, time-series, geospatial). This helps in
determining whether to use bar charts, scatter plots, maps, or other
methods.
• Example: For categorical data, bar charts or pie charts are suitable, while scatter
plots work better for numerical relationships.
• Dimensionality: High-dimensional datasets require specific techniques
like heat maps, parallel coordinates, or dimensionality reduction
methods (e.g., t-SNE) for effective visualization.
• Example: Parallel coordinate plots are suitable for datasets with more than two
attributes.
2. Highlighting Relationships
• Classification helps identify whether the visualization needs to show
relationships (e.g., correlation, causation), distributions, comparisons,
or compositions.
• Example: Scatter plots are ideal for analyzing correlations, while stacked bar
charts can represent compositions.
3. Scalability
• For large datasets, the classification of techniques into methods that
handle large volumes of data (e.g., treemaps, heat maps) can guide the
selection process.
• Example: Heat maps are better suited for summarizing large datasets compared
to simple tables.
4. Purpose of Visualization
• Techniques are often classified by their purpose, such as exploratory (to
uncover patterns) or explanatory (to communicate insights).
• Example: Bullet graphs are ideal for explanatory purposes when tracking
progress toward a goal.
5. Audience and Context
• Classifications based on complexity (simple, intermediate, advanced)
help ensure that the visualization is suitable for the intended audience.
• Example: Donut charts may be more engaging for general audiences, whereas
technical teams might prefer scatter plots or box plots for detailed analysis
6. Multivariate and Linked Data
• For datasets with multiple variables or linked data points, the
classification provides techniques like scatter plot matrices or
interactive dashboards to display relationships effectively.
7. Geospatial Data
• Geographic data is often visualized using maps, and classification
identifies specific techniques (e.g., choropleth maps, bubble maps)
suited to spatial information.
Conclusion
• By systematically categorizing visualization techniques based on data
types, dimensionality, purpose, audience, and scalability, classification
ensures that the most appropriate and effective technique is chosen.
• This reduces misinterpretation, enhances communication, and aids in
uncovering meaningful insights from the dataset.

The Gamification of Learning and Instruction Kapp en 19782
100% (1)
The Gamification of Learning and Instruction Kapp en 19782
7 pages
MATH1005 Final Exam From 2021
No ratings yet
MATH1005 Final Exam From 2021
14 pages
DWDV UNIT-3 PPT
No ratings yet
DWDV UNIT-3 PPT
49 pages
DMV - UNIT 3 & 4 (1)
No ratings yet
DMV - UNIT 3 & 4 (1)
32 pages
Data Presentation
No ratings yet
Data Presentation
31 pages
Data Discovery & Visualization - New
100% (1)
Data Discovery & Visualization - New
40 pages
Da End Sem
No ratings yet
Da End Sem
5 pages
Subject Code:Mb20Ba01 Subject Name: Data Visulization For Managers Faculty Name: Dr.M.Karthikeyan
No ratings yet
Subject Code:Mb20Ba01 Subject Name: Data Visulization For Managers Faculty Name: Dr.M.Karthikeyan
34 pages
Unit-5 BDA - Data Visualization
No ratings yet
Unit-5 BDA - Data Visualization
19 pages
UNIT 5 BDT.pptx
No ratings yet
UNIT 5 BDT.pptx
132 pages
Unit III Business Analytics
No ratings yet
Unit III Business Analytics
8 pages
Lecture3434 - CAP792 - UNIT 5
No ratings yet
Lecture3434 - CAP792 - UNIT 5
25 pages
Unit II. Methods and Techniques For Data Analytics
No ratings yet
Unit II. Methods and Techniques For Data Analytics
91 pages
Data Visualization-1
No ratings yet
Data Visualization-1
29 pages
Group - 3
No ratings yet
Group - 3
24 pages
Unit 3 - LIVES Approach and Visualization - Live Session
No ratings yet
Unit 3 - LIVES Approach and Visualization - Live Session
16 pages
Data Discovery & Visualization - New
100% (1)
Data Discovery & Visualization - New
41 pages
Common Visualization Idioms
0% (1)
Common Visualization Idioms
95 pages
Visual AIDS in Technical Communication Unit 1
No ratings yet
Visual AIDS in Technical Communication Unit 1
18 pages
UNIT4
No ratings yet
UNIT4
8 pages
Unit 2 Foundations For Visualization
No ratings yet
Unit 2 Foundations For Visualization
25 pages
Information Visualization: Dr. Parvathi.R VIT University, Chennai
No ratings yet
Information Visualization: Dr. Parvathi.R VIT University, Chennai
73 pages
Dvi 1
No ratings yet
Dvi 1
41 pages
Unit 2 Data Analytics
No ratings yet
Unit 2 Data Analytics
16 pages
DV
No ratings yet
DV
30 pages
Group 5
No ratings yet
Group 5
48 pages
#CH-2.2.3
No ratings yet
#CH-2.2.3
21 pages
Data Science
No ratings yet
Data Science
59 pages
Module 4
No ratings yet
Module 4
40 pages
CSC 428_4
No ratings yet
CSC 428_4
12 pages
Reading and Writing Set 2 Assgn
No ratings yet
Reading and Writing Set 2 Assgn
16 pages
Module4 DSV
No ratings yet
Module4 DSV
89 pages
Bda - Rahul Parida
No ratings yet
Bda - Rahul Parida
15 pages
Unit 4
No ratings yet
Unit 4
21 pages
C3 Graphic Organizers
No ratings yet
C3 Graphic Organizers
25 pages
SE 7204 BIG Data Analysis Unit I Final
No ratings yet
SE 7204 BIG Data Analysis Unit I Final
66 pages
Tableau Self Notes PDF
No ratings yet
Tableau Self Notes PDF
8 pages
Intro To Data Viz 2016
No ratings yet
Intro To Data Viz 2016
25 pages
DVP Unit1
No ratings yet
DVP Unit1
44 pages
Analisis Dan Visualisasi Data - Chapter 7
No ratings yet
Analisis Dan Visualisasi Data - Chapter 7
53 pages
Storytelling With Data Reveiw
100% (1)
Storytelling With Data Reveiw
56 pages
Data Visualization
No ratings yet
Data Visualization
23 pages
Data Gathering
No ratings yet
Data Gathering
17 pages
Topic 5 - Fundamental of Data Visulization-Edit
No ratings yet
Topic 5 - Fundamental of Data Visulization-Edit
17 pages
data visual (1)
No ratings yet
data visual (1)
14 pages
Data Presentation, Analysis and Interpretation
No ratings yet
Data Presentation, Analysis and Interpretation
12 pages
Data Science Process
No ratings yet
Data Science Process
30 pages
Infographics Grade 10
100% (1)
Infographics Grade 10
28 pages
Information Visualization
No ratings yet
Information Visualization
23 pages
Data Mining & Visualization: Submitted By: Anubhooti Gupta 08PG0347
No ratings yet
Data Mining & Visualization: Submitted By: Anubhooti Gupta 08PG0347
15 pages
DS Lecture 15
No ratings yet
DS Lecture 15
44 pages
Unit 02
No ratings yet
Unit 02
112 pages
Unit V
No ratings yet
Unit V
13 pages
Unit V-Data Visualization
No ratings yet
Unit V-Data Visualization
5 pages
Edashsh
No ratings yet
Edashsh
7 pages
DVP 2
No ratings yet
DVP 2
26 pages
04 Exploring+Data+Visually Combined Lms
No ratings yet
04 Exploring+Data+Visually Combined Lms
98 pages
Data Storytelling - Course
No ratings yet
Data Storytelling - Course
29 pages
DATA VISUALIZATION SHORTS
No ratings yet
DATA VISUALIZATION SHORTS
68 pages
DV UNIT-1
No ratings yet
DV UNIT-1
8 pages
Illuminating Data: A hands on guide to data visualization in R
From Everand
Illuminating Data: A hands on guide to data visualization in R
Eman Ahmad
No ratings yet
Be Data Curious!: Be Data Curious!, #1
From Everand
Be Data Curious!: Be Data Curious!, #1
Nick Jewell
No ratings yet
Introduction to Python
No ratings yet
Introduction to Python
71 pages
Data Discretization
No ratings yet
Data Discretization
32 pages
Data Preprocessing
No ratings yet
Data Preprocessing
84 pages
BCA Lecture I
No ratings yet
BCA Lecture I
20 pages
Solutions To Quadratics Equations Worksheet
No ratings yet
Solutions To Quadratics Equations Worksheet
5 pages
PS Answers Fall2022 Merged
No ratings yet
PS Answers Fall2022 Merged
91 pages
3.6.mean and Variance of Sample Means
No ratings yet
3.6.mean and Variance of Sample Means
41 pages
Unit 2 Natural Resource' Management and Environment: Structure
No ratings yet
Unit 2 Natural Resource' Management and Environment: Structure
19 pages
Full download Financial Econometrics Methods and Models Routledge Advanced Texts in Economics and Finance 1st Edition Peijie Wang pdf docx
100% (1)
Full download Financial Econometrics Methods and Models Routledge Advanced Texts in Economics and Finance 1st Edition Peijie Wang pdf docx
55 pages
Measured-Removal-Rates PDF
No ratings yet
Measured-Removal-Rates PDF
16 pages
The Core Questions
No ratings yet
The Core Questions
4 pages
L3 - Marketing Management
No ratings yet
L3 - Marketing Management
49 pages
Project File of Web Application
No ratings yet
Project File of Web Application
16 pages
Expt_2_T24016
No ratings yet
Expt_2_T24016
13 pages
Group Project School Education Presentation
No ratings yet
Group Project School Education Presentation
11 pages
UG TRB PHYSICS SYLLABUS
No ratings yet
UG TRB PHYSICS SYLLABUS
3 pages
23-JI - Critical Thinking, Intelligence, & Unsubstantiated Beliefs - An Integrative Review
No ratings yet
23-JI - Critical Thinking, Intelligence, & Unsubstantiated Beliefs - An Integrative Review
15 pages
Earthquake Alarm With Emergency Indicator
No ratings yet
Earthquake Alarm With Emergency Indicator
11 pages
Geography Lecture Notes 14 (Chapters 1-3)
No ratings yet
Geography Lecture Notes 14 (Chapters 1-3)
20 pages
AOMSI Guidelines for Research Grant-Policy Document
No ratings yet
AOMSI Guidelines for Research Grant-Policy Document
10 pages
Alfa - POS - 57 - 95 - Customer - Display - LED8N Manual
No ratings yet
Alfa - POS - 57 - 95 - Customer - Display - LED8N Manual
4 pages
Gavish Fert Machines
No ratings yet
Gavish Fert Machines
25 pages
Hyperloop's Social Impacts
No ratings yet
Hyperloop's Social Impacts
3 pages
Module 6 - Engineering Drawings and Plans, Lab. Midterm
No ratings yet
Module 6 - Engineering Drawings and Plans, Lab. Midterm
13 pages
EAPP Booklet
No ratings yet
EAPP Booklet
11 pages
8 Pavlik Harness Article
No ratings yet
8 Pavlik Harness Article
5 pages
AA3nBlogWhoAmInMakingAChangenLifePlan 6963478a6de0768
No ratings yet
AA3nBlogWhoAmInMakingAChangenLifePlan 6963478a6de0768
5 pages
Recruitment and Selection With Answers
100% (2)
Recruitment and Selection With Answers
11 pages
Immediate download (Ebook) Theory and Practice in Microbial Enhanced Oil Recovery by Kun Sang Lee, Tae-Hyuk Kwon, Taehyung Park, Moon Sik Jeong ISBN 9780128199831, 0128199830 ebooks 2024
100% (8)
Immediate download (Ebook) Theory and Practice in Microbial Enhanced Oil Recovery by Kun Sang Lee, Tae-Hyuk Kwon, Taehyung Park, Moon Sik Jeong ISBN 9780128199831, 0128199830 ebooks 2024
40 pages
Tu Darmstadt Kumulative Dissertation
100% (2)
Tu Darmstadt Kumulative Dissertation
7 pages
4000 Brochure Web ENG 1
No ratings yet
4000 Brochure Web ENG 1
20 pages
How Different Types of Meditation Can Enhance Athl PDF
No ratings yet
How Different Types of Meditation Can Enhance Athl PDF
5 pages