Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated:
1
00:00:00,000 --> 00:00:03,270
Dot maps are a fun way of being able to show quantities
2
00:00:03,270 --> 00:00:07,245
over areas as a nice alternative to a choropleth map.
3
00:00:07,245 --> 00:00:10,320
Like the most common way that people tend to think of
4
00:00:10,320 --> 00:00:13,650
mapping values for areas is to use a choropleth map,
5
00:00:13,650 --> 00:00:18,385
when you do that, your standardizing the data in this case,
6
00:00:18,385 --> 00:00:20,854
population per square kilometer.
7
00:00:20,854 --> 00:00:23,420
So you take counts for an area,
8
00:00:23,420 --> 00:00:26,915
you divide them by the area itself to standardize or normalize it.
9
00:00:26,915 --> 00:00:30,855
That gives you a density, and that gives you a choropleth map like this where you're
10
00:00:30,855 --> 00:00:35,905
assigning the shades or colors to those classes in order to be able to show a pattern.
11
00:00:35,905 --> 00:00:38,690
Dot density maps don't work that way.
12
00:00:39,090 --> 00:00:41,485
With a dot density map,
13
00:00:41,485 --> 00:00:42,950
you're not doing any of that stuff.
14
00:00:42,950 --> 00:00:45,995
All you're doing is showing density visually.
15
00:00:45,995 --> 00:00:49,250
So, you have a dot that's worth a certain amount,
16
00:00:49,250 --> 00:00:51,230
in terms of the count of whatever it is.
17
00:00:51,230 --> 00:00:53,650
Here its population but it could be something else,
18
00:00:53,650 --> 00:00:55,630
and the more dots there are,
19
00:00:55,630 --> 00:00:57,090
the higher the density is.
20
00:00:57,090 --> 00:00:59,775
So, you're showing the density visually,
21
00:00:59,775 --> 00:01:00,930
you're not calculating it.
22
00:01:00,930 --> 00:01:02,215
This is something I find,
23
00:01:02,215 --> 00:01:03,620
especially once people have learned about
24
00:01:03,620 --> 00:01:06,420
choropleth maps is that they automatically want when they're creating
25
00:01:06,420 --> 00:01:10,720
a dot map to normalize it and they don't have to do that.
26
00:01:10,720 --> 00:01:13,310
The whole idea is that you're just saying, where there's more dots,
27
00:01:13,310 --> 00:01:15,860
there's more of that thing in this case people,
28
00:01:15,860 --> 00:01:17,010
where there's fewer dots,
29
00:01:17,010 --> 00:01:18,435
there's fewer of that thing.
30
00:01:18,435 --> 00:01:19,835
That's all there is to it.
31
00:01:19,835 --> 00:01:23,240
So, I think a fun or different way.
32
00:01:23,240 --> 00:01:28,100
It's eye-catching to show variations in values across an area.
33
00:01:28,100 --> 00:01:30,680
I think it's something that pretty much anybody could easily relate
34
00:01:30,680 --> 00:01:35,690
to without any instruction or experience with this is you can just look at it and say,
35
00:01:35,690 --> 00:01:38,795
"Where you see a cluster of dots, there's more of that thing."
36
00:01:38,795 --> 00:01:41,230
Dot maps are easy to create with a software,
37
00:01:41,230 --> 00:01:43,420
but it was not always so easy to make though.
38
00:01:43,420 --> 00:01:46,010
In the old days, pre-GIS, pre-software,
39
00:01:46,010 --> 00:01:49,405
pre-computer, dot maps were actually created manually.
40
00:01:49,405 --> 00:01:52,955
They were an incredibly time-intensive process.
41
00:01:52,955 --> 00:01:56,480
You had somebody who would literally place
42
00:01:56,480 --> 00:02:01,025
each dot on a map manually using a pen or whatever methods they were using,
43
00:02:01,025 --> 00:02:02,780
and there was a pro and con to that.
44
00:02:02,780 --> 00:02:06,590
So, the good thing about that was that those dots could be placed
45
00:02:06,590 --> 00:02:10,885
very intentionally or very intelligently.
46
00:02:10,885 --> 00:02:12,100
So, if you were say,
47
00:02:12,100 --> 00:02:14,835
making a dot map of the population in United States,
48
00:02:14,835 --> 00:02:17,310
you'd have more dots around the cities,
49
00:02:17,310 --> 00:02:19,130
because you would know where those cities were and
50
00:02:19,130 --> 00:02:21,185
you'd be able to place those dots accordingly.
51
00:02:21,185 --> 00:02:23,950
You'd have fewer dots say in the prairies,
52
00:02:23,950 --> 00:02:26,995
or desert areas, or whatever.
53
00:02:26,995 --> 00:02:30,350
So, you could use this auxiliary information as
54
00:02:30,350 --> 00:02:33,915
an intelligent human to decide where those dots should be placed.
55
00:02:33,915 --> 00:02:37,935
The downside of course, is that to create one dot map could take months.
56
00:02:37,935 --> 00:02:39,870
That's right, literally months and so,
57
00:02:39,870 --> 00:02:41,690
that's why you didn't see them that often.
58
00:02:41,690 --> 00:02:44,800
With the advent of computers, and software,
59
00:02:44,800 --> 00:02:48,315
and GIS, you can now make a dot map in a matter of seconds.
60
00:02:48,315 --> 00:02:50,630
The downside is, you don't necessarily have
61
00:02:50,630 --> 00:02:53,785
that same level of control about the dot placement.
62
00:02:53,785 --> 00:02:55,240
There is ways around that,
63
00:02:55,240 --> 00:02:56,975
I'll show you a little bit about how to do that,
64
00:02:56,975 --> 00:02:58,490
but it's not exactly the same thing.
65
00:02:58,490 --> 00:03:00,585
So, yes you can now make them quickly,
66
00:03:00,585 --> 00:03:03,845
but it's not quite to the same level of
67
00:03:03,845 --> 00:03:05,810
quality that you would get if someone was
68
00:03:05,810 --> 00:03:08,230
painstakingly putting those dots in there by hand.
69
00:03:08,230 --> 00:03:09,770
That's okay. I'm willing to live with that,
70
00:03:09,770 --> 00:03:12,730
I don't really want to be the person putting those dots on the map.
71
00:03:12,730 --> 00:03:15,035
So, here I have the City of Toronto.
72
00:03:15,035 --> 00:03:18,370
If I treat the City of Toronto as one polygon.
73
00:03:18,370 --> 00:03:22,325
So, there's one population value for the entire city, let's say.
74
00:03:22,325 --> 00:03:24,990
I then tell the software, "Okay,
75
00:03:24,990 --> 00:03:29,510
here's my population for this polygon.
76
00:03:29,510 --> 00:03:34,780
I'm going to assign a dot value of one dot equals 1,000 people."
77
00:03:34,780 --> 00:03:40,825
What the software will then do is randomly place those dots inside that polygon.
78
00:03:40,825 --> 00:03:43,370
So, it takes the population value that I give it,
79
00:03:43,370 --> 00:03:45,795
divides it by the value for each dot,
80
00:03:45,795 --> 00:03:47,100
which here is 1,000,
81
00:03:47,100 --> 00:03:49,880
comes up with a total number of dots that it needs for that area,
82
00:03:49,880 --> 00:03:53,675
and then just randomly places them on the map.
83
00:03:53,675 --> 00:03:56,080
So, if you look at this,
84
00:03:56,080 --> 00:03:57,725
even if you don't know Toronto,
85
00:03:57,725 --> 00:04:00,500
you probably could figure out fairly quickly that this isn't
86
00:04:00,500 --> 00:04:03,725
very representative of the population patterns of the city.
87
00:04:03,725 --> 00:04:06,290
Where's the downtown area? Where the suburbs?
88
00:04:06,290 --> 00:04:09,230
Why don't I see patterns and differences and so on
89
00:04:09,230 --> 00:04:12,630
about clustering of where people actually live?
90
00:04:12,630 --> 00:04:16,610
Well, if you just do it the way the software says, "Okay.
91
00:04:16,610 --> 00:04:18,175
Well, if this is what you want me to do."
92
00:04:18,175 --> 00:04:20,150
So, it's just do what you told it to do.
93
00:04:20,150 --> 00:04:23,025
You're not going to end up with a very useful or effective dot map.
94
00:04:23,025 --> 00:04:26,320
But there's a way around that. So, let me show you how that works.
95
00:04:26,320 --> 00:04:30,625
If you still have the dots randomly placed,
96
00:04:30,625 --> 00:04:34,310
but you tell it to randomly place them inside smaller areas,
97
00:04:34,310 --> 00:04:36,610
then you can start to get something that's a little more realistic.
98
00:04:36,610 --> 00:04:38,570
So, what I've done here is I have
99
00:04:38,570 --> 00:04:42,865
population values for each census tract in the city of Toronto.
100
00:04:42,865 --> 00:04:46,400
What I can do is, I'm still using one dot equals 1,000,
101
00:04:46,400 --> 00:04:48,185
but when I do that,
102
00:04:48,185 --> 00:04:50,730
now it's randomly placing them inside each census tracks.
103
00:04:50,730 --> 00:04:52,840
So, some census tracks are going to have a lot more people.
104
00:04:52,840 --> 00:04:54,515
Some are going to have a lot fewer people.
105
00:04:54,515 --> 00:04:56,185
So, they'll be more dots,
106
00:04:56,185 --> 00:04:58,780
where they're needed and fewer dots where they're not needed.
107
00:04:58,780 --> 00:05:00,350
So, now I've got a pattern that's
108
00:05:00,350 --> 00:05:03,865
a little more representative of the real population patterns for the city.
109
00:05:03,865 --> 00:05:08,150
One little tip by the way is when you're doing this method is that you don't actually
110
00:05:08,150 --> 00:05:12,430
have to show the boundaries of the areas that you're using to create that dot map.
111
00:05:12,430 --> 00:05:14,310
Like for here, it's the census tracks,
112
00:05:14,310 --> 00:05:17,560
but I could actually just make those invisible like that.
113
00:05:17,560 --> 00:05:18,950
So, that's actually much better.
114
00:05:18,950 --> 00:05:20,970
So, now I'm getting that effect of a dot map,
115
00:05:20,970 --> 00:05:24,540
I've told it to randomly place them inside each of those little areas.
116
00:05:24,540 --> 00:05:26,110
So now we can see that yes,
117
00:05:26,110 --> 00:05:29,330
there's definitely more people downtown where you would expect
118
00:05:29,330 --> 00:05:32,780
to have clusters of points on higher population density.
119
00:05:32,780 --> 00:05:34,290
As you move out to the suburbs,
120
00:05:34,290 --> 00:05:37,080
you're getting less population density so fewer points.
121
00:05:37,080 --> 00:05:40,840
So that works pretty well. We can actually take that a step further,
122
00:05:40,840 --> 00:05:42,560
though and use even smaller areas.
123
00:05:42,560 --> 00:05:45,225
So, in the Canadian census system,
124
00:05:45,225 --> 00:05:48,560
dissemination areas are smaller than census tracts,
125
00:05:48,560 --> 00:05:50,840
but we have population counts for those as well.
126
00:05:50,840 --> 00:05:52,400
So, I can use exactly the same method,
127
00:05:52,400 --> 00:05:54,170
I can tell it to randomly place the dots,
128
00:05:54,170 --> 00:05:57,395
they're still worth one dot equals 1,000,
129
00:05:57,395 --> 00:06:00,550
but now it's randomly placing them inside even smaller areas.
130
00:06:00,550 --> 00:06:05,400
So, if I do that and takeaway the boundaries again,
131
00:06:05,400 --> 00:06:07,940
I get I think an even better version of that dot map,
132
00:06:07,940 --> 00:06:11,910
where you really get this nice clustering taking place in different parts of the city.
133
00:06:11,910 --> 00:06:13,395
So, you can see there in there.
134
00:06:13,395 --> 00:06:20,165
But then there's areas that are much fewer dots so we have lower population densities.
135
00:06:20,165 --> 00:06:22,160
So, depending on the size of
136
00:06:22,160 --> 00:06:25,250
the units that are available to you and the data that's available,
137
00:06:25,250 --> 00:06:27,620
these are ways that you can try to simulate
138
00:06:27,620 --> 00:06:31,250
that manual dot placement method is by
139
00:06:31,250 --> 00:06:35,735
making those randomly placed dots constrained to much smaller areas.
140
00:06:35,735 --> 00:06:39,275
So here we have census tracks versus dissemination areas.
141
00:06:39,275 --> 00:06:40,880
Both of them are not bad,
142
00:06:40,880 --> 00:06:43,250
but I have to admit I think the dissemination area map
143
00:06:43,250 --> 00:06:45,650
looks a little bit better and does a better job.
144
00:06:45,650 --> 00:06:49,030
Here's a comparison of a choropleth map to a dot density map,
145
00:06:49,030 --> 00:06:51,120
and there's nothing really wrong with a choropleth map.
146
00:06:51,120 --> 00:06:52,730
There are very popular. They're used all the time.
147
00:06:52,730 --> 00:06:55,890
Most people I think are fairly comfortable looking at them now.
148
00:06:55,890 --> 00:07:01,220
Sometimes it's nice to mix things up to try something that's a little off the wall,
149
00:07:01,220 --> 00:07:02,750
not even off the wall really but just a little bit
150
00:07:02,750 --> 00:07:05,320
different and to use something like a dot density map.
151
00:07:05,320 --> 00:07:07,840
So don't always automatically go to choropleth,
152
00:07:07,840 --> 00:07:11,130
if a dot density might work for you.
153
00:07:11,450 --> 00:07:14,080
There's really not a lot to play with,
154
00:07:14,080 --> 00:07:16,515
with a dot density map in terms of the settings.
155
00:07:16,515 --> 00:07:21,365
Really, all you can do is set the size of the dot and the value of the dot.
156
00:07:21,365 --> 00:07:24,070
So, here the dot size is three points.
157
00:07:24,070 --> 00:07:27,570
So, that's what a three-point size dot would look like on the map,
158
00:07:27,570 --> 00:07:29,330
they're giving you an example of that.
159
00:07:29,330 --> 00:07:32,265
Then the dot value here is 1,000,
160
00:07:32,265 --> 00:07:34,090
you can set that to whatever you want.
161
00:07:34,090 --> 00:07:35,825
So, between those two things,
162
00:07:35,825 --> 00:07:39,760
that's really your main way of varying the way your map is going to look.
163
00:07:39,760 --> 00:07:45,030
So, let's have a look at the effects that can have on the interpretation of your dot map.
164
00:07:45,030 --> 00:07:48,330
If your dot value is too high.
165
00:07:48,330 --> 00:07:51,485
So, here we have one dot equals 10,000,
166
00:07:51,485 --> 00:07:54,935
you end up with dot map of the population of Toronto,
167
00:07:54,935 --> 00:07:57,085
that makes it look like there's nobody there.
168
00:07:57,085 --> 00:07:59,000
That the city's practically deserted.
169
00:07:59,000 --> 00:08:01,615
I don't know if there was some zombie apocalypse or something,
170
00:08:01,615 --> 00:08:04,550
but whatever happened, the city is practically deserted.
171
00:08:04,550 --> 00:08:07,390
We just have these stray dots wandering around the city,
172
00:08:07,390 --> 00:08:09,155
wondering where everybody went.
173
00:08:09,155 --> 00:08:13,240
So, that's not exactly the impression you want to give somebody is that,
174
00:08:13,240 --> 00:08:15,420
just by making the dot value too high,
175
00:08:15,420 --> 00:08:17,005
you end up with too few dots,
176
00:08:17,005 --> 00:08:19,640
then you end up with a map that doesn't really have any density at all,
177
00:08:19,640 --> 00:08:24,385
and doesn't really show people a useful pattern to the data.
178
00:08:24,385 --> 00:08:26,780
Here the dot value is pretty good.
179
00:08:26,780 --> 00:08:30,860
We have one dot equals 1,000 which is pretty reasonable for this data set.
180
00:08:30,860 --> 00:08:33,505
But the dot size is too small.
181
00:08:33,505 --> 00:08:35,165
So, even though we have,
182
00:08:35,165 --> 00:08:37,015
I think a good number of dots.
183
00:08:37,015 --> 00:08:39,680
If this dots are too small, they're hard to see.
184
00:08:39,680 --> 00:08:42,000
So, again, remember the whole idea of the dot map,
185
00:08:42,000 --> 00:08:44,390
is that you're trying to show density visually.
186
00:08:44,390 --> 00:08:48,230
If nobody can see anything that looks like a dense area,
187
00:08:48,230 --> 00:08:51,695
if you don't have clusters of dots that are easily visible,
188
00:08:51,695 --> 00:08:54,470
then again you're getting this idea that it's not very dense in
189
00:08:54,470 --> 00:08:57,735
the city and you're going to have people misinterpret that dataset.
190
00:08:57,735 --> 00:09:02,000
So, if you have too high of a dot value, it's not going to work well,
191
00:09:02,000 --> 00:09:06,109
and if you have too small of a dot size,
192
00:09:06,109 --> 00:09:07,900
it's not going to work very well either.
193
00:09:07,900 --> 00:09:11,335
Also, if the dot sizes are too big,
194
00:09:11,335 --> 00:09:12,655
you end up with something like this,
195
00:09:12,655 --> 00:09:15,070
in which obviously this is a little bit extreme.
196
00:09:15,070 --> 00:09:17,660
But the interesting thing is that when
197
00:09:17,660 --> 00:09:20,350
they've studied how people interpret maps like this,
198
00:09:20,350 --> 00:09:22,035
if these dots are too big,
199
00:09:22,035 --> 00:09:25,090
people actually tend to think that it's a crude map.
200
00:09:25,090 --> 00:09:28,265
In other words, they start to question the validity of the data.
201
00:09:28,265 --> 00:09:30,240
Even though the data is exactly the same,
202
00:09:30,240 --> 00:09:31,540
they look at that and say,
203
00:09:31,540 --> 00:09:36,425
"Yeah whoever made that probably didn't know what they are doing, it looks crude."
204
00:09:36,425 --> 00:09:38,090
So, I don't know if I would really trust
205
00:09:38,090 --> 00:09:40,190
the data with this map so you don't want to do that.
206
00:09:40,190 --> 00:09:43,345
Conversely by the way, if the dot size is too small,
207
00:09:43,345 --> 00:09:45,740
it looks like there's this pinpoint accuracy to
208
00:09:45,740 --> 00:09:48,450
the location of it and people will start to think that it's more accurate,
209
00:09:48,450 --> 00:09:50,570
when it's really just the same data and the same dots,
210
00:09:50,570 --> 00:09:52,370
it's just the way that's it's being shown is different.
211
00:09:52,370 --> 00:09:55,220
So, I'm showing you some extremes to give
212
00:09:55,220 --> 00:09:57,950
you a sense of what's possible or what's good or bad,
213
00:09:57,950 --> 00:10:00,385
but you want to avoid a situation like this.
214
00:10:00,385 --> 00:10:02,135
The problem of course with this,
215
00:10:02,135 --> 00:10:04,580
is that there's too much density going on.
216
00:10:04,580 --> 00:10:07,070
Because the dots are all starting to meet each other,
217
00:10:07,070 --> 00:10:11,230
you've got huge parts of the city that all look like they're really dense.
218
00:10:11,230 --> 00:10:15,550
So, again, you're losing some of that differentiation, or pattern,
219
00:10:15,550 --> 00:10:18,020
or clustering, so the people can look at that and say,
220
00:10:18,020 --> 00:10:20,085
"Oh, here's a higher value or lower value."
221
00:10:20,085 --> 00:10:22,190
It just all starts to look the same,
222
00:10:22,190 --> 00:10:23,490
whether it's high or low,
223
00:10:23,490 --> 00:10:28,985
and then it's losing its value as a good map. This map is pretty good.
224
00:10:28,985 --> 00:10:31,840
So, we have a dot value that seems to work well.
225
00:10:31,840 --> 00:10:34,155
We have a dot size that seems to work well.
226
00:10:34,155 --> 00:10:38,015
What you're striving for as much as possible is to have
227
00:10:38,015 --> 00:10:43,180
some areas where the dots cluster together and start to coalesce,
228
00:10:43,180 --> 00:10:45,590
and other areas where they're still separated out.
229
00:10:45,590 --> 00:10:50,045
But you can see these areas where there's definitely some clustering happening,
230
00:10:50,045 --> 00:10:51,765
but not too much.
231
00:10:51,765 --> 00:10:53,310
It's a bit of experimentation.
232
00:10:53,310 --> 00:10:55,220
It depends on the dataset and
233
00:10:55,220 --> 00:11:01,070
both the statistical and geographical spacing or distribution of those values,
234
00:11:01,070 --> 00:11:02,870
but you have to work with a little bit.
235
00:11:02,870 --> 00:11:06,530
So here, you can see that there's definitely some good clustering going on,
236
00:11:06,530 --> 00:11:08,030
some areas that are not as clustered.
237
00:11:08,030 --> 00:11:10,590
So overall, I think this works fairly well.
238
00:11:10,590 --> 00:11:13,160
What you're striving for as I said,
239
00:11:13,160 --> 00:11:16,360
is to have coalescence of dots.
240
00:11:16,360 --> 00:11:17,800
So, that's what we would call it.
241
00:11:17,800 --> 00:11:19,880
In other words, where the dots start to overlap with each
242
00:11:19,880 --> 00:11:21,980
other in the densest parts of your map.
243
00:11:21,980 --> 00:11:23,540
You don't want to have too much of this,
244
00:11:23,540 --> 00:11:25,130
and you don't want to have not enough of it.
245
00:11:25,130 --> 00:11:29,695
You're striving for this enough coalescence as I've shown in the example here,
246
00:11:29,695 --> 00:11:31,810
where you have some areas where again,
247
00:11:31,810 --> 00:11:33,600
remember, you're showing density visually.
248
00:11:33,600 --> 00:11:36,680
So you have to have that coalescence in order for people to get that idea,
249
00:11:36,680 --> 00:11:38,960
that there's a higher density taking place.
250
00:11:38,960 --> 00:11:41,365
So that's basically it for dot maps.
251
00:11:41,365 --> 00:11:45,370
There's lots of ways you can experiment with this, with different datasets.
252
00:11:45,370 --> 00:11:47,960
Essentially, one thing I didn't mention was colors.
253
00:11:47,960 --> 00:11:49,150
You have dot size and dot value,
254
00:11:49,150 --> 00:11:52,000
and of course the color in relation to the rest of your map.
255
00:11:52,000 --> 00:11:55,710
But I think if you play around with this you'll find that dot maps can
256
00:11:55,710 --> 00:12:00,160
be a fun and interesting alternative to just your typical choropleth map.21653
Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.