UNIT 3 DV
UNIT 3 DV
Classification of Visualization Systems in Data Visualization refers to the categorization of tools and
techniques based on their functionality, data types, user interaction, and visual representation. This helps in
understanding what kind of visualization system is suitable for a specific data analysis or presentation task.
2. Hierarchical Data Visualization: For tree or nested data (e.g., tree maps).
3. Network/Graph Data Visualization: For relational data (e.g., social networks).
4. Scalar Data Visualization: For single-value data points (e.g., temperature).
5. Vector Data Visualization: For directional or multi-dimensional data (e.g., wind flow).
6. Multivariate Data Visualization: For datasets with multiple variables (e.g., scatterplot matrix).
7. Temporal Data Visualization: For time-series data (e.g., stock prices).
8. Geospatial Data Visualization: For location-based data (e.g., maps).
Static Visualization Systems: The visual is generated once and remains unchanged (e.g., an image
or PDF)..Users cannot click, zoom, filter, or modify the view..
Interactive Visualization Systems: Allow users to manipulate and explore the data dynamically.
Immersive Visualization Systems: High Interactivity [Link] can move, zoom, rotate, and
manipulate data objects. These are advanced systems that allow users to interact with data in a
highly engaging, 3D or virtual environment
o 2D Visualization Systems: Data is displayed using horizontal (X-axis) and vertical (Y-axis)
dimensions. (e.g., charts).
o 3D Visualization Systems: Data is displayed along X, Y, and Z axes. These systems allow
users to explore complex, multi-dimensional datasets
o Mixed Reality Systems: Combine real-world and computer-generated visualizations. Users
can interact with both physical and virtual objects simultaneously.
Interaction Misleading in data visualization refers to the misuse or poor design of interactive
features (like tooltips(Showing incomplete or out-of-context information in tooltips), filters, zooming,
or drill-downs), They can hide patterns that can confuse, mislead, or bias the user's understanding
of the data.
(Interaction misleading happens when interactive elements in a visualization falsify , hide, or
misrepresent data insights, causing the user to draw incorrect conclusions.)
1. Data integration. Collect raw data and turn it into clean, analytics-ready
information by performing (coping data from one location to another location) data
replication, ingestion (importing data from large different databases to single
3. Visualization design. Design begins with selecting KPIs and types of graphs,
charts, and maps that best tell your story. Keeping your visualizations clean
and simple will help users understand and work with the data.
4. Collaboration and sharing. Allow all approved users to sightsee the data
freely to uncover their own insights. Your software should allow users to embed
your visualizations in other applications and to engage with them on their mobile
devices.
Below are some of the most examples of misleading visualizations and how they can be avoided:
1. Truncated Y-Axis
A truncated Y-axis is a common mistake in data visualization where the scale of the Y-axis is
artificially shortened to make changes in the data appear more significant.
This can lead to misleading visualizations and incorrect conclusions.
Example:
To avoid this, it is important to use an appropriate scale for the Y-axis that
accurately reflects the data.
2. Cherry-Picking Data
Cherry-picking data is the act of selecting only the data that supports a desired conclusion while
ignoring data that contradicts it.
This is a common mistake in data visualization and can lead to misleading visualizations and
incorrect conclusions.
It is important to consider the context and limitations of the data when creating a visualization.
Example:
To avoid cherry-picking data, it is important to consider all relevant data when creating
a visualization.
3. Dualing Data
Dualing data refers to the practice of comparing two or more sets of data in a way that
Source: Politifact
Dualing data can occur when different sets of data are plotted on different scales or when one
set of data is highlighted while the other is not as in the example above.
This can give a one-sided picture of the relationship between the data sets and lead to
incorrect
conclusions.
Solution:
To avoid dualing data, it is important to present data in a fair and impartial way.
This can include using the same scales and axes for all sets of data and providing equal
highlighting and attention to all data sets.
Q). Explain about Visualization of one, two and multi-dimensional data, Text and Text Documents?
( External exam:2023)
I. Visualization of One-Dimensional Data:
Definition: Data with two variables, often used to explore relationships or comparisons.
2D visualization is about translating abstract (non concrete/immaterial) data into a visual format that
can be processed by the human eye with ease and speed.
Two-dimensional (2D) data visualization refers to Rows and Columns: If you're working with a dataset
in a tabular format.
Here are some common methods for visualizing two-dimensional data:
Bar Charts
Description: Bar charts represent categorical data with rectangular bars. The length of each bar corresponds
to the value it represents.
Usage: Good for comparing quantities across categories.
Example: Sales numbers for different products or departments.
Scatter Plot: For showing correlation or patterns between two continuous variables.
Line Charts:
Show trends over time by connecting data points with a continuous line. They're great for
illustrating changes and trends.
Pie Charts:
Represent data as slices of a circle, with each slice proportional to the category's contribution
to the whole. They're useful for showing percentage or proportional data.
Examples:
pie chart, bar chart, histogram, rank plot, tree map
The second examines the relationships between the variables
Examples:
scatter plot, line chart, step chart, area chart, heat map, matrices, parallel coordinates/sets,
waterfall chart
Google Charts
Display live data on your website. upload your own data and create your own visualizations.
Tableau Public:
Tableau Public is a free tool that "brings data to life" (according to their website). View others'
visualizations or create your own.
Common Scenarios
Creating Groups
A field’s members are combined using Tableau Groups. For instance, using Tableau Groups, you
can get aggregated values for Furniture and Office Supplies. Once Tableau has finished
creating Tableau Groups, the aggregated value of Furniture and Office Supplies can be
displayed in the visuals. The following is a procedure for creating Tableau Groups:
s
Step 1: Select Category from the right-clicking menu. Choose the option Create then
choose the option Group for creating Tableau Groups.
Step 2: The Create Group window appears. Enter the name of the Group data in Tableau. Choose the
members who will be grouped. Select the Group button.
Step 3: In the Edit Group Window Furniture and Office supplies are created as Tableau
Groups. To create Tableau groups, simply click Ok.
This creates the Tableau groups with the name of the group and added it to the dimension
list. This can be used to visualize a group of people using the Tableau method for members in a field. The
functionality of the Tableau create group is demonstrated in the image below. In Tableau, the total sales for
both furniture and office supplies are visualized for grouping.
Def:Tree visualization is a method used to represent hierarchical data structures, where data is organized in a
parent–child relationship. It allows users to explore nested, branching relationships in a clear and interpretable
format.
A tree is defined as a set of nodes and edges where there are no loops. Every edge
has a pair of nodes called the parent node and child node. A child node has only
one parent node.
The root node which is a single node has no parents and the leaf nodes have no
children. The depth of a tree is the number of nodes from the root to the leaf.
Indented Layout
This layout places all items along vertically spaced rows. Uses indentation to show parent
child relationships. Example: Windows explorer.
Figure :Indented Layout
Dendrogram
The main use of a dendrogram is to work out the best way to allocate objects to
clusters.
Balloon Tree :
balloon Tree places the focus node in the middle of the canvas (picture) and all its children in a
circle around it. Children of children are again places in a circle around their parents, etc
It is effective on showing the tree structure at the cost of node details.
Hyperbolic Tree
A basic hyperbolic tree. Nodes in focus are placed in the center and given more room,
while out-of-focus nodes are compressed near the boundaries. This example also
uses balloon tree drawing
3D Tree visualization – Cone Tree
Cone trees are a 3D extension of the 2D layered tree drawing method. Parent is at the tip of a
cone, and its children are spaced equally on the bottom circle of the cone. Can be either
horizontal or vertical.
Node-Link Trees
The below figure shows that the nodes are distributed in space, connected by straight
or curved lines. 2D space is a typical approach used to break apart breadth and depth.
-------------------------------------------------------------------------------------------------------------------
Def: Graph Visualization is the process of visually representing graph data structures, where:
Social network analysis (e.g., showing how people are connected on Facebook or LinkedIn)
Fraud detection in financial networks
Graphs consists of nodes and edges but is different from a tree by the concept that a child node may have
multiple parent nodes.
A graph contains loops, or multiple paths between two nodes in the graph.
Representation of size of graph is difficult (How about representing thousands
of nodes? in a small screen).
Navigation also problem in Graph drawing.
It has two representations namely the Node-link diagrams and Matrix display.
Key Concepts:
1. Nodes (Vertices): These represent the entities or data points in the graph. For example, in a social
network graph, nodes could represent individuals.
2. Edges (Links): These are the connections between the nodes. In the social network example, an
edge might represent a friendship or interaction between two individuals.
3. Directed vs. Undirected Graphs:
o Directed Graphs: The edges have a direction, indicating a one-way relationship (e.g., following
someone on social media).
o Undirected Graphs: The edges do not have a direction, indicating a mutual or bidirectional
relationship (e.g., a friendship).
4. Weighted Graphs: In some graphs, edges can have weights or values associated with them,
representing the strength or capacity of the connection (e.g., the number of messages exchanged
between two individuals).
The goal of graph visualization techniques is to increase the understanding level of data
by providing clearer, intelligible layouts as well as suitable interaction mechanisms.
Spring Layout
Is a force directed approach where nodes are modeled as physical bodies that are
connected through strings Nodes connected by edges will exert an attraction force.
Graph using a Force-Directed Layout
Spanning Trees
Definition:
A Spanning Tree of a graph is a subset of the graph that:
Includes all the vertices
Is connected
Has no cycles
Has exactly (n - 1) edges if there are n nodes
3D Layout
Def: A 3D layout is a graphical arrangement where data points, objects, or nodes are positioned within a
three-dimensional coordinate system, allowing users to explore data from multiple angles.
Applications:
or
Tools for Graph Visualization:
There are various tools and libraries available for creating graph visualizations, such as:
NetworkX: A Python library for the creation, manipulation, and study of complex networks.
Cytoscape: Often used for biological research to visualize molecular interaction networks.
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Create clusters
To find clusters in a view in Tableau, follow these steps.
1. Create a view.
2. Drag Cluster from the Analytics pane into the view, and drop it on in the target
area in the view:
o Tableau creates a Clusters group on Color, and colors the marks in your
view by cluster. If there is already a field on Color, Tableau moves that field to
Detail and replaces it on Color with the clustering results.
o Tableau displays the Clusters dialog box, where you can customize(alter) the cluster.
3. Customize the cluster results by doing either of the following in the Clusters dialog box.
o Drag new fields from the Data pane into the Variables area of the Clusters
dialog box. You can also drag fields out of the Variables area to remove them.
When you add variables, measures are aggregated using the default aggregation
for the field; dimensions are aggregated using ATTR, which is the standard
way that Tableau aggregates dimensions.
o Specify the number of clusters (between 2 and 50). If you do not specify a
value, Tableau will automatically create up to 25 clusters.
4. When you finish customizing (alter) the cluster results, click the X in the upper-right
corner of the Clusters dialog box to close it:
To rename the resulting clusters, you must first save the cluster as a group.
and the
----------------------------------------------------------------------------------------------------------------------------------------------
Q). Explain about Visualization of Networks?
def:
Visualization of Networks in data visualization refers to the graphical representation of networks,
which are composed of nodes (or vertices) and edges (or links). This type of visualization is used to
display and analyze relationships, connections, or flows between entities.
(repeated) patterns, subgroups, and clusters within the network. These insights
are essential for understanding the behavior and dynamics of networks. Example, in a
transportation network, clusters of high traffic routes can be identified, helping in
route optimization.
3. Informed Decision-Making: By visualizing complex networks, decision-makers can
allocate resources effectively, detect bottlenecks (jams), and improve processes. This is
particularly valuable in supply chain management or organizational restructuring.
4. Enhanced Communication: To communicating complex information to a wider
audience. Visualization makes it easier for people with varying levels of expertise to
grasp the network's structure and dynamics. This helps in collaborative efforts and
facilitates better decision-making within multidisciplinary teams.
Types of Networks:
1. Social Networks: These networks involve individuals and their relationships.
Examples include social media connections and friendship networks. Example
Visualizing Facebook friends.
2. Transportation Networks: Transportation networks involve various modes of
transport, such as roads, railways, air routes, and shipping lanes. Visualizing
transportation networks is crucial for optimizing routes, reducing jamming, and
ensuring efficient logistics.
identify trends, concentration areas, or variations within the network, such as areas
of high or low activity.
5. Geo-Spatial Maps: Geo-spatial maps overlay (cover) network data on geographical
maps. This is valuable for visualizing transportation networks, showing routes, and
understanding geographic patterns.
Network visualization examples:
static maps can be created manually using tools like Visio or — as in the example below —
LucidChart.
Static network maps in data visualization are fixed, image-based representations of a network's topology
at a specific point in time Dynamic network maps
Dynamic network:
Def: Dynamic network maps in data visualization are interactive, real-time representations of network
topology that automatically update to reflect changes in the network
Dynamic network maps add real-time data on network changes to the mix. While a
static map is useful for understanding the intended network design, or even the state of
Generally, real-time dynamic network maps are interactive, allowing you to drill
down and view specific metrics on connections, devices, and traffic flows.
static maps, which show a fixed state, dynamic maps update constantly, providing
insights into network performance, security, and troubleshooting.
Network visibility solutions that provide network discovery and dynamic network mapping are
significantly better network troubleshooting, change management, and analysis resources than
static maps that must be manually updated.
Q). Discuss about Visualization of software?
Def: Visualization of Software in data visualization refers to the graphical representation of software
systems, structure, behavior, and evolution. It helps developers, testers, and project managers
understand complex software components, relationships.
This type focuses on the structure of the software at rest — how it's organized in terms of modules,
packages, classes, functions, and dependencies.
Example Techniques:
Module/Package Dependency Graphs
Fig: Safely restructure your codebase with Dependency Graphs | Understand Legacy Code
Use Case:
Helps understand code organization, modularity, and how different components relate.
This focuses on how the software behaves during execution, showing interactions between components,
method calls, or object lifecycles.
Example Techniques:
Sequence Diagrams
Call Graphs
Use Case:
Useful for debugging, understanding logic flow, and analyzing complex interactions.
3. Evolution Visualization:
This type tracks how software changes over time, helping identify trends, refactoring patterns,
or possible sources of bugs.
Example Techniques:
Commit timelines
Use Case:
Essential for code maintenance, project reviews, and impact analysis of changes.
4. Performance Visualization
Focuses on runtime performance metrics, enabling developers to spot bottlenecks, inefficient code.
Example Techniques:
Profiling Charts
Resource Usage Graphs
Use Case:
5. Testing Visualization:
Visualizes the testing process and results, helping evaluate test coverage, bug locations, and quality
metrics.
Example Techniques:
Test Execution Graphs
Use Case:
Ensures code quality, identifies untested areas, and improves test strategy.
Benefits:
Methods
1. Distance-Based Mapping
a. Authors to Words
b. Authors to Cats
c. Authors to Visual Styles
2. Attribute-Based Mapping
a. Attribute-Based Mapping
3. Topology-Based Mapping
Distance-Based Mapping
Authors from conferences like CHI, VIS, and SIGGRAPH are symbolically linked to English nouns. By
measuring similarity, authors are associated with nouns, enabling a novel means of visualizing relationships among
authors. This method adds a layer of metaphorical context to explore author connections through the lens of common
nouns.
b) Authors to Cats
For example, it can map books, movies, and games based on attributes like
The 500 most popular games are clustered, and mappings are established
between these clusters.
Topology-Based Mapping
It is exact distances are not essential or available .
This approach is particularly useful for network data or hierarchies
represented as graphs.
In such cases, the focus is on preserving (protective) the topology, which could mean
adjacency for generic graphs or descendancy for tree structures.