Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated:
1
00:00:00,004 --> 00:00:02,008
- [Presenter] Whenever you start to examine a new data set
2
00:00:02,008 --> 00:00:04,001
the first thing you should do
3
00:00:04,001 --> 00:00:07,003
is discover some basic facts about that data.
4
00:00:07,003 --> 00:00:09,002
In this movie I will show you
5
00:00:09,002 --> 00:00:11,005
how to make several of those measures.
6
00:00:11,005 --> 00:00:14,001
My sample files 01_01_Means,
7
00:00:14,001 --> 00:00:17,000
and you can find it in the Chapter01 folder
8
00:00:17,000 --> 00:00:19,007
of the Exercise Files collection.
9
00:00:19,007 --> 00:00:22,005
The data set that I have
10
00:00:22,005 --> 00:00:25,008
in this workbook shows the number of days in transit
11
00:00:25,008 --> 00:00:29,000
for a number of shipments coming from overseas.
12
00:00:29,000 --> 00:00:31,008
So you can see the first shipment arrived in 23 days,
13
00:00:31,008 --> 00:00:34,000
the next in 38,
14
00:00:34,000 --> 00:00:35,007
and the rest of the data shows
15
00:00:35,007 --> 00:00:39,000
the other days that were measured.
16
00:00:39,000 --> 00:00:42,004
If you want to find the average, or mean, of that data
17
00:00:42,004 --> 00:00:45,006
that would be the sum of all those values
18
00:00:45,006 --> 00:00:47,004
divided by the number of values.
19
00:00:47,004 --> 00:00:51,000
So if I go to cell D2,
20
00:00:51,000 --> 00:00:54,000
which is next to the Mean label,
21
00:00:54,000 --> 00:00:57,005
and remember mean is the same as average.
22
00:00:57,005 --> 00:00:59,006
In D2 I can type equal.
23
00:00:59,006 --> 00:01:03,001
And then the function we'll use is average.
24
00:01:03,001 --> 00:01:09,005
And I will select the data in the range A2 through A12.
25
00:01:09,005 --> 00:01:11,006
Right parenthesis and Enter.
26
00:01:11,006 --> 00:01:16,007
And we get a mean, or average, of 29.45 days.
27
00:01:16,007 --> 00:01:21,007
So just under 29 and 1/2 days on average.
28
00:01:21,007 --> 00:01:24,001
You can also calculate the median.
29
00:01:24,001 --> 00:01:28,006
The median is the middle value in a set of data
30
00:01:28,006 --> 00:01:31,001
that has been sorted into either ascending
31
00:01:31,001 --> 00:01:32,008
or descending order.
32
00:01:32,008 --> 00:01:37,007
For example, if I were to click cell A2
33
00:01:37,007 --> 00:01:42,006
and then on the Home tab of the ribbon go to Sort & Filter
34
00:01:42,006 --> 00:01:45,005
and then click in this case sort smallest to largest,
35
00:01:45,005 --> 00:01:48,007
although I could do either largest to smallest
36
00:01:48,007 --> 00:01:50,005
or smallest to largest,
37
00:01:50,005 --> 00:01:56,006
then I get the values of 23, 27 all the way down to 38.
38
00:01:56,006 --> 00:02:01,004
I have, I'll just count here, 11 rows of data.
39
00:02:01,004 --> 00:02:03,000
So if I go down to the six,
40
00:02:03,000 --> 00:02:09,006
that's 1, 2, 3, 4, 5, 6, the median value is 28.
41
00:02:09,006 --> 00:02:16,007
So if I go to cell D4 and type =MEDIAN
42
00:02:16,007 --> 00:02:20,001
and select A2 through A12.
43
00:02:20,001 --> 00:02:22,003
Right parenthesis and Enter.
44
00:02:22,003 --> 00:02:25,001
And I do in fact, get 28.
45
00:02:25,001 --> 00:02:26,009
One way to examine your data
46
00:02:26,009 --> 00:02:30,000
is to compare the mean to the median.
47
00:02:30,000 --> 00:02:32,001
And in this case, they're very close.
48
00:02:32,001 --> 00:02:33,003
So that means that your data
49
00:02:33,003 --> 00:02:35,007
tends to be fairly well spread out.
50
00:02:35,007 --> 00:02:37,006
To give you an example of how the mean
51
00:02:37,006 --> 00:02:39,003
and median could change,
52
00:02:39,003 --> 00:02:42,001
let's say that we had a shipment that was just horrible.
53
00:02:42,001 --> 00:02:44,006
It took 75 days to arrive.
54
00:02:44,006 --> 00:02:46,009
So I'll go to cell A12
55
00:02:46,009 --> 00:02:50,009
and change 38 to 75 and press Enter.
56
00:02:50,009 --> 00:02:53,004
And you can see that the mean, or average, increased
57
00:02:53,004 --> 00:02:55,000
but the median did not.
58
00:02:55,000 --> 00:02:57,003
And that's because the value of 28
59
00:02:57,003 --> 00:03:00,001
is still in the middle of the sorted list.
60
00:03:00,001 --> 00:03:04,002
So I will press Control + Z to undo my last change.
61
00:03:04,002 --> 00:03:06,009
And we're back to our original data set.
62
00:03:06,009 --> 00:03:08,005
One last thing about the median
63
00:03:08,005 --> 00:03:11,007
is that if you have an odd number of values
64
00:03:11,007 --> 00:03:13,003
then the value in the middle
65
00:03:13,003 --> 00:03:15,006
of the sorted list is the median.
66
00:03:15,006 --> 00:03:17,008
If you have an even number of values
67
00:03:17,008 --> 00:03:20,008
then Excel takes the average of the two
68
00:03:20,008 --> 00:03:21,009
that are in the middle.
69
00:03:21,009 --> 00:03:25,004
So in A2 through A12, we have 11 values.
70
00:03:25,004 --> 00:03:31,007
If I go to cell A13, then let's say I type in 39,
71
00:03:31,007 --> 00:03:35,008
then I can update the median formula
72
00:03:35,008 --> 00:03:40,004
so that it includes the new cell A13, Enter.
73
00:03:40,004 --> 00:03:43,007
And you see we get the median of 28.5,
74
00:03:43,007 --> 00:03:45,009
which is the average of the two values
75
00:03:45,009 --> 00:03:49,007
that are now in the middle of the list.
76
00:03:49,007 --> 00:03:51,001
I'll click to release the selection
77
00:03:51,001 --> 00:03:55,002
and press Control + Z to undo the data entry
78
00:03:55,002 --> 00:03:57,007
and also the formula change.
79
00:03:57,007 --> 00:04:00,000
The final type of measure that I'll describe here
80
00:04:00,000 --> 00:04:01,001
is the mode.
81
00:04:01,001 --> 00:04:06,005
And that is the most frequently occurring value or values.
82
00:04:06,005 --> 00:04:10,006
If I go to cell D6 which is next to Mode Single
83
00:04:10,006 --> 00:04:14,008
I'll type equal, and then I'll type mode
84
00:04:14,008 --> 00:04:16,007
and you see that underneath,
85
00:04:16,007 --> 00:04:20,001
I have mode mult which is for multiple values
86
00:04:20,001 --> 00:04:23,001
or single, S-N-G-L.
87
00:04:23,001 --> 00:04:25,009
I'll click single for now and press Tab.
88
00:04:25,009 --> 00:04:28,008
And I will select the same data range as before.
89
00:04:28,008 --> 00:04:31,002
A2 through A12.
90
00:04:31,002 --> 00:04:35,001
Right parenthesis and Enter and we get 28.
91
00:04:35,001 --> 00:04:36,008
And if I go over to the data set
92
00:04:36,008 --> 00:04:43,001
I can see that 28 occurs three times, 27 twice and 29 twice.
93
00:04:43,001 --> 00:04:47,002
And everyone else occurs a single time.
94
00:04:47,002 --> 00:04:50,002
So 28 is in fact, the mode.
95
00:04:50,002 --> 00:04:53,002
If I were to change the value
96
00:04:53,002 --> 00:04:58,001
at the bottom from 38 to 27 and press Enter,
97
00:04:58,001 --> 00:05:02,002
then you can see that the mode has changed to 27.
98
00:05:02,002 --> 00:05:06,002
And even though 27 and 28 occur the same number of times,
99
00:05:06,002 --> 00:05:09,002
27 occurs first in the data set.
100
00:05:09,002 --> 00:05:12,004
So that means that Excel will select it
101
00:05:12,004 --> 00:05:17,001
instead of going with the larger value.
102
00:05:17,001 --> 00:05:20,002
That's the rule it uses, the first value.
103
00:05:20,002 --> 00:05:24,002
If you do want to find multiple values
104
00:05:24,002 --> 00:05:25,007
that occur most frequently,
105
00:05:25,007 --> 00:05:28,003
in other words, they're tied within a data set,
106
00:05:28,003 --> 00:05:30,007
you can use mode mult.
107
00:05:30,007 --> 00:05:37,001
So I'll go to cell D8, type an equal sign, mode, M-U-L-T,
108
00:05:37,001 --> 00:05:39,004
and note that I still have my change from before.
109
00:05:39,004 --> 00:05:42,005
So I have 27 instead of 38.
110
00:05:42,005 --> 00:05:44,007
Select cells A2 through A12.
111
00:05:44,007 --> 00:05:46,006
Right parenthesis and Enter.
112
00:05:46,006 --> 00:05:50,004
And we get 27 and 28.
113
00:05:50,004 --> 00:05:52,007
In the past, you would've had to identify
114
00:05:52,007 --> 00:05:56,008
the number of modes that you wanted to display
115
00:05:56,008 --> 00:05:58,004
using mode.mult.
116
00:05:58,004 --> 00:06:01,002
However, in recent versions of Excel
117
00:06:01,002 --> 00:06:03,009
there is the spill behavior
118
00:06:03,009 --> 00:06:05,005
and that is what occurred here.
119
00:06:05,005 --> 00:06:10,002
Even though I typed the formula into cell D8
120
00:06:10,002 --> 00:06:11,001
when I pressed Enter
121
00:06:11,001 --> 00:06:15,001
because there were multiple values available
122
00:06:15,001 --> 00:06:17,004
then it applied the same formula
123
00:06:17,004 --> 00:06:22,007
and spilled the result from cell D8 to D9.
124
00:06:22,007 --> 00:06:28,002
If I were to change the value in A12, from 27 back to 38,
125
00:06:28,002 --> 00:06:32,009
then you can see that the mode is still 28,
126
00:06:32,009 --> 00:06:35,007
and you can see that there is only one value
127
00:06:35,007 --> 00:06:40,001
that occurs most frequently in the data list.
128
00:06:40,001 --> 00:06:41,007
So as you can see there are a lot of ways
129
00:06:41,007 --> 00:06:43,008
to get information about your data.
130
00:06:43,008 --> 00:06:46,006
Whether the average, the middle value or median
131
00:06:46,006 --> 00:06:49,007
or the most frequently occurring value, or values,
132
00:06:49,007 --> 00:06:52,000
you can learn a lot in just a few minutes.
10133
Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.