Minitab Basic Tutorial
Minitab Basic Tutorial
is a data set showing the show being viewed by 50 viewers in a Nielsen sample. Click on FILE > OPEN WORKSHEET and locate the file Nielsen in the chapter two folder. Click on STAT > TABLES > TALLY INDIVIDUAL VARIABLES I want to get a count of the number of people watching each of these television shows, so I enter TVShow in the data window and click the Counts box.
Starting with Charmed and ending with 24 cut and past these counts into your spreadsheet. 1
{hint: remove the space between Chicago and Hope to make it work. Then add it back.} Once you have done so you can add some variable names by simply putting the cursor at the head of the column and typing.
Now we can make some pictures. Click on GRAPH > PIE CHART We did not have to go to the trouble of using TALLY for this chart. We can proceed as follows, adding a title to the Pie Chart by clicking on the button Labels.
We can also ask Minitab to label the pie slices. In the Labels box, click on the tab Slice Labels.
Now click on OK > OK, and Minitab generates the following Graph.
Audience Share
C harmed 8.0% Chicago Hope 14.0% C ategory C harmed C hicago Hope Frasier Millionaire
Millionaire 48.0%
Frasier 30.0%
Had we originally been given the data as it appears in columns 2 and 3, already tallied, we could have generated the graph in the following way.
Another thing we can do with this data is make a bar chart. Click on GRAPH > BAR CHART. Select Counts of unique variable and Simple.
Click on OK. Select your variable and add a label by clicking on the box Labels.
You will get this picture when you click on OK > OK.
20
Count
15
10
Charmed
Frasier
Millionaire
If we 8
wanted change the Y axis to measure market share in percent, rather than as a count, Minitab will do this for you. Proceed as before, but click on the box Bar Chart Options.
40
Percent
30
20
10
Charmed
Frasier
Millionaire
If the data
had been given to you already tallied, as in columns 2 and 3, we could have made a chart by clicking on BAR CHARTS and then selecting not Counts of Unique Values, but Values from a table, along with Simple. Clicking on OK, we continue thusly:
10
20
Viewers
15
10
Charmed
Frasier
Millionaire
11
Examples using Quantitative Data In the Chapter 2 folder of your Anderson, Sweeny, and Williams data disk is a data set, Wageweb, that is a sample of annual salaries (in thousands of dollars) of marketing vice presidents. Open the data set by clicking on FILE > OPEN WORKSHEET and selecting Wageweb. We can present this data in many different ways. One way is a dotplot. Click on
GRAPH > DOTPLOT and select the options One Y and Simple. Then proceed as follows:
12
Clicking on OK > OK gives this graph. If you had data on a second variable, and wanted a simple visual comparison of the two, ou
Salaries of Marketing Vice Presidents
96
108
120
132 Salary
144
156
168
180
could overlay dotplots. For example, suppose you had salary data for Finance Vice Presidents and wanted to compare their pay scale to that of the Marketing Vice Presidents. (I created some fake data, called salary2, to use in this example.) Begin as before, but now select Multiple Ys along with Simple. Click OK, and continue in this way:
13
Salary
Salary2
98
112
126
140 Data
154
168
182
Another 14
nice way to look at data is by using a histogram. Click on GRAPH > HISTOGRAM and select Simple. To make a histogram of Marketing Vice President salaries, do the following:
15
There are occasions when it is helpful to display the count in each category. This can be done using one of the options available. Begin by clicking on GRAPH > HISTOGRAM and selecting Simple. Then click on the Labels button and select the tab Data Labels and select the radio button Use Y value labels.
16
100
120
140 Salary
160
180
17
Later in the course we will talk about the normal distribution, the famous bell curvepeople often mention. Sometimes it is useful to see how your histogram compares to the normal distribution, and one way to do so is to create a histogram with an approximating normal distribution superimposed. To do so, select GRAPH > HISTOGRAM, but instead of using Simple, use With Fit. Adding a title, as we did before, yields the following histogram.
In Anderson, Sweeny, and Williams, pp. 34-36, you will find a discussion of cumulative distributions. These are pictured with a different kind of histogram one that gives counts that represent the cumulative number up to a point. To produce such a histogram in Minitab, begin with GRAPH > HISTOGRAM and select Simple. Click on the box Scale, and then select the tab called Y-Scale type. Check the box Accumulate values across bins.
18
Cumulative Frequency
40
30
20
13
19
10
4 1
100
120
140 Salary
160
180
19
One of the lesser-used tools discussed in the book is the Stem-and-Leaf display see pp. 4043. To create a Stem-and-Leaf display in Minitab, select GRAPH > STEM-AND-LEAF.
When you click OK, you get the following graph, which uncharacteristically for Minitab appears in the Session window.
Stem-and-Leaf Display: Salary
Stem-and-leaf of Salary Leaf Unit = 1.0 2 4 9 15 (12) 23 12 7 3 9 10 11 12 13 14 15 16 17 35 24 23468 334477 124456788888 01122345588 14577 0255 038 N = 50
There is a final technique for displaying data which is discussed in Anderson, Sweeny, and Williams only in Chapter 3, pp. 1012, called the Box Plot. To make a box plot of the salary 20
data in Minitab, click on GRAPH > BOX PLOT, and select One Y and Simple. Click on Labels to add a title.
21
Box plots are especially useful for comparing two or more frequency distributions, such as the two salary variables. To display multiple box plots, begin with GRAPH > BOX PLOT, but then select Multiple Ys and Simple. For clarity, I am going to rename Salary as Marketing and Salary2" as Finance by relabelling the head of each column.
22
180
160 Data
140
120
Note the asterisk in the Finance box plot, which signifies an outlier.
23
Next, click on FILE > OPEN WORKSHEET and select the file named NFL, which gives information on 40 National football league draft prospects. We can use box plots to compare various attributes of these draft prospects. For example, click on GRAPHS > BOXPLOT and select One Y and With Groups. This time I propose to accept the default title, and compare prospects speed by position.
24
Since Speed is time in a 40 yard dash, a low time signifies a speedy individual. What is most apparent (and not at all surprising to any football fan) is that wide receivers are much faster than Guards and Offensive tackles. One less obvious observation is that while the average speed of offensive tackles and guards is almost the same, the speed of offensive tackles is considerably more variable.
25
Another less obvious result comes from considering the boxplots of rating versus position.
Rating
It appears that this particular draft had many blue-chip receiver prospects and few strong prospects at Guard. Comparing two Qualitative Variables We can also compare two variables, using a technique known as a Scatterplot. Here is simple example, comparing prospects speed and weight. Begin by clicking on GRAPH > SCATTERPLOT and selecting Simple.
26
27
The wide receivers, one might surmise, are the fast and light prospects in the lower left, and the guards and tackles the heavy slow ones in the upper right. We can make this more obvious by going back to GRAPH > SCATTERPLOT and instead of selecting Simple selecting With Groups.
28
We can also produce a fitted line through the scatter of points, if we wish. Begin with GRAPH > SCATTERPLOT and select With Regression. Then graph speed against weight.
29
Minitab can also do cross-tabulations, as described in the book, but only for qualitative variables. If we wanted to cross-tabulate weight against position, we would need to first convert the variable weight into a qualitative variable. Click on DATA > CODE > NUMERIC TO TEXT
30
This creates a new qualitative variable, which I have named Weight2. (Qualitative variables have a T in the column number. The T is for text.)
To Cross-tabulate, click on STAT > TABLES > CROSS TABULATION AND CHI SQUARE.
31
32