Tableau Interview2
Tableau Interview2
1. What is TABLEAU?
Tableau is the powerful and fastest visualizing tool that is used in the Business
Intelligence(BI) Industry. It simplifies the raw data into an understandable format.
Analysis of the data becomes faster with Tableau. The visualizations can be created in
the form of dashboards. The visualizations or diagrammatic representation of data can
easily be understood by the employees of the organizations who are at different levels.
In Tableau, there are no limitations with the file sizes. And moreover, there is no row or
column limit to import the data.
LOD Expression stands for Level of Detail Expression, and it is used to run complex
queries involving many dimensions at the data sourcing level.
A heat map is a graphical representation of data that uses the colour-coding technique
to represent different values of data. As the marks heat up due to their higher value, dark
colours will be shown on the map.
7. Define TreeMap?
TreeMap is a visualization that organizes data hierarchically and shows them as a set of
nested rectangles. The size and colours of rectangles are respective to their values of
the data points they project. Parent rectangles will be tiled with their child elements.
Parameters are dynamic values, we can replace the constant values in calculations.
Tableau dashboard is a group of various views which allows you to compare different
types of data simultaneously. Datasheets and dashboards are connected if any
modification happens to the data that directly reflects in dashboards. It is the most
efficient approach to visualize the data and analyze it.
12. Define Page Shelf in Tableau?
Page shelf breaks the views into a series of pages. It displays an alternate view on each
page. Due to this feature, you can analyze the effect of each field into the rest of the
data in the view.
The story can be defined as a sheet which is a collection of series of worksheets and
dashboards used to convey the insights of data. A story can be used to show the
connection between facts and outcomes that impacts the decision-making process. A
story can be published on the web or can be presented to the audience.
Facts are numeric measures of data. They are stored in fact tables. Fact tables store
that type of data that will be analyzed by dimension tables. Fact tables have foreign keys
associating with dimension tables.
Dimensions are descriptive attributes of data. Those will be stored in the dimensions
table. For example, customer’s information like name, number, and email will be stored
in the dimension table.
Extract: Extract is a snapshot of data that will be extracted from the data source and put
into the Tableau repository. This snapshot can be refreshed periodically fully or
incrementally. This can be scheduled in Tableau Server.
Live: It creates a direct connection to the data source and data will be fetched directly
from tables. So, data will be up to date and consistent. But, this also affects access
speed.
Tableau works the same as SQL. So, it supports all Joins possible in SQL
Data modeling is the analysis of data objects that are used in a business or other context
and also used as identification of the relationships among these data objects. It is the
first step of doing object-oriented programming.
Multiple measures can share in a single axis so that all the marks will be shown in a
single pane. We can blend measures by dragging the 1st measure on one axis and the
2nd on the existing axis.
Shelves: Every worksheet in Tableau will have shelves such as columns, rows, marks,
filters, pages, and more. By placing filters on shelves we can build our own visualization
structure. We can control the marks by including or excluding data.
Sets: The sets are used to compute a condition on which the dataset will be prepared.
Data will be grouped together based on a condition. Fields which is responsible for
grouping are known assets. For example – students having grades of more than 70%.
Whenever we set a context filter, Tableau generates a temp table that needs to refresh
each and every time, whenever the view is triggered. So, if the context filter is changed
in the database, it needs to recompute the temp table, so the performance will be
decreased.
There is a card to the left of the view where we can drag fields and control mark
properties like color, size, type, shape, detail, label, and tooltip.
A bullet graph is a variant of Bar graph. It is responsible for comparing the performance
of one measure with other measures.
Gantt Chart displays the progress of value over the period. It consists of bars along with
the time axis. It is a project management tool. Here, each bar is a measure of a task in
the project framework.
A histogram chart shows the distribution of continuous information over a certain period
of time. This chart helps us to find extreme points, gaps, unusual values, and more
concentrated values.
Tableau Reader is a free desktop application, where you can read and interact with
the Tableau packaged workbooks. It can only open the files but cannot create new
connections to the workbook.
31. What is the difference between published data and embedded data
sources?
The embedded data source contains connection information but it is associated with the
workbooks.
When we are working with large volumes of data, incredibly data may be messed. With
Tableau, you can easily create hierarchies to keep your data neat. Even if you don’t
need it, it is built into your data, which you can easily manage or organize the data and
you can track the data easily.
33. What is a Column chart?
A column chat visualizes the data as a set of rectangle columns, as their lengths are
proportional to values when they represent the data. The horizontal axis shows the
category to which they belong, and the vertical axis shows the values.
The line chart is a popular type of diagrammatic way for visualizing the data, it connects
the individual data points to view the data. We can easily visualize the series of values,
we can see trends over time or predict future values. The horizontal axis holds the
category to which it belongs and the vertical axis holds the values.
Stacked Bar Chart, composed of multiple bars stacked horizontally, one below the other.
The length of the bar depends on the value in the data point. A stacked bar chart makes
the work easier, they will help us to know the changes in all variables presented, side by
side. We can watch the changes in their total and forecast future values.
Stacked Column Chart, composed of multiple bars stacked vertically, one on another.
The length of the bar depends on the value in the data point. A stacked column chart is
the best one to know the changes in all variables. This type of chart should be checked
when the number of series is higher than two.
38. What is an Area Chart?
An area chart is nothing but line chat, the area between the x-axis and lines will be color
or patterns. These charts are typically used to represent accumulated totals over time
and are the conventional way to display stacked lines.
Step 1: Go to [Link] and enter your e-mail address and click “ Download
the App “.
Step 2: After downloading the file, run the file and follow the prompts to install the
Tableau.
Step 3: Now, you can see the Tableau shortcut on your desktop.
Step 4: Double click on the icon, to load tableau public. Now you see the screen as
below.
40. What is Context Filter and show the steps on how to create the
Context Filter Tableau?
Context Filters are applied to the data rows before any other filters. They are limited to
views, but they can be applied on selected sheets. They define Aggregation and
Disaggregation of data in Tableau
Step 1: Drag the subcategory dimensions to the row shelf and measure sales to the
column shelf. Now choose the horizontal bar chart as chart type and again drag the
sub-category dimensions to the filter shelf. Then we will get the following chart.
Step 2: Right-click on the Sub-Category field in the filter shelf and go to the Top fourth
tab. Choose the option field, from the next drop-down and choose the option Top 10 by
Sales Sum as shown in the following screenshot.
Step 3: Drag the dimension Category to the filter shelf. Give right-click on the general tab
to edit and under that choose Furniture from the list. As you can see the result shows
three subcategories of products.
Step 4: Right-click the Category: Furniture filter and select the option Add to Context.
This produces the final result, which shows the subcategory of products from the
category Furniture which are among the top 10 subcategories across all the products.
Tableau Interview Questions and Answers: Experienced
41. What are the products offered by Tableau? Explain a few things
about them.
Tableau Prep: Tableau Prep is responsible for preparing data for analysis. This tool
provides 3 coordinated views which provide us with a complete picture of data.
Tableau Desktop: Tableau Desktop is the place where the analysis happens. It has a
powerful drag and drops analytics which is very easy to use. Through the desktop, you
get your data insights quickly.
Tableau Online: Tableau Online is a secure and scalable self-service analytics cloud tool.
You can use it anywhere, anytime and it has eliminated the complexities of IT.
Tableau Server: From small to large enterprises, Tableau server is used for fulfilling their
BI requirements. This is an on-premise solution. This tool can take data from anywhere
and shared it across the organization through the desktop or mobile browsers.
Filters are the simpler and straightforward feature in Tableau. It applies to dimensions or
measures directly. For example, to only show Gujarat or Karnataka in a State dimension,
we can apply the filter on that. In Tableau, there are multiple UI options available for
filters like radio buttons, drop-down lists, checkboxes, sliders, and more. Filters on
sheets are also available in Tableau.
Parameters are like variables. They are complex and more powerful. Like a variable, a
parameter can be used in calculations. So, that means, it only allows a single value.
Parameters have the same UI options except for checkboxes because checkboxes don’t
have a single value. For example, we can create a parameter for interest rate and
period, and then we can use these parameters to calculate interest and principal
payments.
Filters are used to provide the correct information to viewers after removing unnecessary
data. There are various types of filters available in Tableau.
Extract Filters – Extract filters are used to apply filters on extracted data from the data
source. For this filter, data is extracted from the data source and placed into the Tableau
data repository.
Datasource Filters – Datasource filters are the same as extract filters. They also work on
the extracted dataset. But, the only difference is it works with both live and extract
connections.
Context Filters – Context Filters are applied on the data rows before any other filters.
They are limited to views, but they can be applied on selected sheets. They define
Aggregation and Disaggregation of data in Tableau
Measure Filters – Measure filters are applied to the values present in the measures.
In a tiled layout, items don’t overlap. The layout will be adjusted according to dashboard
size. In the floating layout, items can be placed on some other layers. Floating items can
have fixed positions and sizes.
Junk Dimension: Junk values or unrelated dimensions are termed as Junk Dimension.
Degenerated Dimension: Degenerated dimensions have primary keys only without any
matter of info.
Role enjoying Dimension: If one dimension is employed in multiple roles, then they are
termed as Role enjoying Dimensions.
Example – Date for e-commerce site order (Date of Order, Date of Shipment, Date of
delivery)
Inferred Dimension: Empty dimensions are called inferred dimensions. They are
usually used in ETL.
Example – Customer email which he may not enter while submitting any form will be
filled as null.
Aggregation → The process of summarizing the data and viewing a single numeric value
is called aggregation. Example – sum/avg of salary for each employee
Disaggregation →The process of viewing each transaction for analyzing all the
measures both dependently and independently. Example – individual salary transactions
for each employee.
Discrete data roles consist of values that are separate and distinct. Discrete data roles
can take individual values within a range. For Example – cancer patients in the hospital,
no. of threads in a sheet, state. Discrete values are displayed as blue icons in the data
window and blue pills on shelves. Discrete fields can be sorted.
Continuous data roles consist of any value within the finite or infinite intervals. For
Example – age, unit price, order quantity. Continuous values displayed as green icons in
the data window and green pills on shelves. Continuous fields cannot be sorted.
Traditional BITableauBI Architecture has hardware limitations. Tableau does not have
dependencies. It does not support in-memory, multi-thread, and multi-core computing. It
supports memory when used in advanced technologies. It has a predefined view of data.
It uses predictive analysis for various business operations. We should need a good
knowledge of BI and skills. It is easy working and understanding without the knowledge
of BI and the skills data security is provided data security is not provided for the public,
whereas in the professional version.
.twb: .twb means Tableau workbook. .twb is an XML sheet, it stores the data about your
documents, stories, and dashboards. This file is the reference to the source file such as
Excel or tde. This file will be linked to your source file when you save the TWB file. If you
want to share your workbook you need to send both the workbook and data source file.
.twbx: It is a compressed file, where you have all files. It includes data source files, twb,
and other files to produce the workbook. TWBX is obsolete for sharing because it will
share the copy of the file instead of an original source file. .twbx is used for reports and
we can view using the tableau viewer.
Step 1: Select Analysis -> drop down will open -> Select create calculated field
Step 2: Calculation Editor Box will open, give the name to the calculated field.
Tableau dashboard allows the representation of parameters in four ways only. They don’t
allow any multiple values like a filter can do. They only allow a single value.
56. State some reasons for the low performance of Tableau? Explain in
detail.
● Filters - filters need to create an extra query and if it used in large numbers and
inefficiently then they can reduce the performance. So, it is advised to use filters
whenever it is mandatory
● Live connection - Tableau extract works much better in comparison with a live
connection.
● Data sources - a wrong query to a wrong data source can reduce performance.
Also, the data source’s performance can also affect Tableau’s performance.
● Filters
● Calculated fields
● Actions
● Measure-swaps
● Changing views
● Auto-updates
Tableau Desktop Log files are placed in C:UsersMy DocumentsMy Tableau Repository.
In case of live connection to any data source, check the log file “[Link]” and
“[Link]” files. In case of extract connection to any data source, check the
“[Link]” file which has detailed information about queries.
60. If the owner of the published workbook license expires, then can
other users see those workbooks?
If the owner of published workbooks license expires then his/her role will change to
“Unlicensed”. He/She cannot access those workbooks, but others can. Only site admin
can change the ownership of those workbooks.
1. Go to the Segment filter, Click on the small drop arrow which is on the right side.
2. Select Customize.
3. Then we can visible a small dropbox under customize, uncheck the option for “
All” Value.
Extract connection is better than live connection because extract connection can be
used from anywhere, anytime without connecting to the database. We can construct our
own visualizations on it irrespective of the database connection.
63. Tell me something about the workbook version controlled in Tableau.
● At the desktop level, you keep copies of your changes and files or using 3rd party
system like Microsoft TFS.
● At the server level, where you publish the data source and version will be saved
in revision history on Tableau Server or Online. Reverting to a previous version is
also possible here.
Cascading filters means filter2 values are dependent on filter1 values. For Example,
filter1 is country and if we select “India” for filter1, filter2 values should show all Indian
states. This feature we can achieve by using the option “Only Relevant Values”.
67. How can we display the top and bottom 5 of records in a single
Tableau view?
68. State a few charts which we should not use with valid reasons
[Related Article: Create Pie charts, Scatter Plot, Area Fill charts & Circular View in
Tableau]
Tableau cannot plot null values on-axis. So, it will display an indicator at the lower right
corner of the view. Once you click on that indicator, you have options to handle null
values. Below are the options available to handle null values.
● Filter Data – If you choose this option, null values will be filtered out from the
view.
● Show Data at Default Position – It replaces the null value with the default value
and shows the data at the default position on-axis. These default values depend
on the data type of field. Below here are the defaults of a specific data type.
● Numbers →0
● Dates → 12-31-1899
● Geographic Location → (0,0)
● Negative Values → 1
Yes, Tableau Public can only allow 10 million rows to users for data visualization.
We can download views or workbooks from the server. But, data formats available to us
depend on the permissions granted by site administrators or content owners.
We can also download selected sheets into PDF format, but while generating PDF, web
page objects won’t be included.
72. Can we use unused columns (Columns that are not used in reports
but data source has columns) in Tableau Filters?
73. What is the benefit of the Tableau extract file over the live
connection?
Extract files can be used without any connections and you can build your own
visualization without connecting to the database.
75. Can we place an excel file in a shared location and use it to develop a
report and refresh it in regular intervals?
Yes, we can place an excel file in a shared location and we can use it to develop a
report, but for better performance, we need to extract the file.
Yes, we can do testing in Tableau by using tools and the easiest way is using the
desktop application. We need to check it before we publish it on the Tableau server.
77. Can you get values from two different sources as a single input into
a parameter?
78. Design a view in a map such that if a user selects any state the cities
under that state have to show profit and sales.
If you want to show the profit and sales of each and every city under the state in the
same worksheet, then follow a few steps.
The data which you want to display under the state, it must be in the same worksheet.
In the View, Size indicates the Sales, and Color indicates the profit of the respective
state(which you have selected).
79. How can u set permissions or protect the data in tableau public?
Tableau does not have any security permissions or to protect the data in Tableau public.
Whatever we shared in Tableau public those files, workbooks can be view by all users.
There is no security option in Tableau public.
80. If I delete a workbook from tableau public and there are links to other
blogs and other web locations. What happens to them?
Tableau public is like a repository, if you delete anything in the Tableau public the data
will be lost, even if links are in other locations and blogs. We cannot access those links.
81. Mention whether you can create relational joins in Tableau without
creating a new table?
Yes, we can create relational joins in Tableau without creating a new table
It happens when the user is trying to publish the workbook which is connected to the
internal or local server that means the company’s network.
Yes, Parameters have their own drop-down list, which enables the users to view the data
entries which are available in the parameter during the creation.
Option 1:
Example:
The above calculation creates the group with four values: Little Containers, Big
Containers, Small Containers, Midsize Containers.
Option 2:
Create a set from the group, and use the set in the calculated field.
Step 1: Right-click on the Data Pane and then select Create Set.
Yes, Tableau can be installed on both the Windows Operating system and macOS.
87. How to automate reports in Tableau?
When we are publishing the reports to Tableau Server, there we will find an option to
schedule the report, just select the time when you want to refresh the data.
88. Which databases give the best performance when connected live?
89. What are the challenges faced when working with huge volumes of
data?
Whenever we set a context filter, Tableau generates a temp table that needs to refresh
each and every time the view is triggered. So, if the context filter will be changed, the
database needs to recompute and rewrite the temp table, which in turn slows down the
performance.
There is a card to the left of the view where we can drag fields and control mark
properties like color, size, type, shape, detail, label, and tooltip.
The published data source has connection information in it. It is independent of any
workbook and can be accessed by multiple workbooks.
Multiple measures can share a single axis so that all the marks will be shown in a single
pane. We can blend measures by dragging the 1st measure on one axis and the 2nd on
the existing axis.
94. Can we remove the “All” option from the auto-filter in Tableau?
Yes, we can. Navigate to filter→ Right-click on it→ select customize→ uncheck the
option “Show All”
95. Define Bullet graph
A bullet graph is a variant of Bar graph. It is responsible for comparing the performance
of one measure with other measures.
Gantt Chart displays the progress of value over the period. It consists of bars along with
the time axis. It is a project management tool. Here, each bar is a measure of a task in
the project framework.
A histogram chart shows the distribution of continuous information over a certain period
of time. This chart helps us to find extreme points, gaps, unusual values, and more
concentrated values.
Tableau Public is an open-source and free service that allows anyone to publish the data
source and visualizations to the web. These visualizations can then be embedded into
blogs or web pages. They can also be distributed through email or social media.
Moreover, they can be made downloadable by other end users. For Tableau Public, no
programming skills are needed. It can be accessed by anyone free of cost.
99) What is the maximum no. of rows Tableau can utilize at one time?
Tableau is not restricted by the no. of rows in the table. Customers use Tableau to
access petabytes of data because it only retrieves the rows and columns needed to
answer your questions.