Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated:
1
00:00:00,420 --> 00:00:04,050
In this video, I will explain about the architecture of your office seven.
2
00:00:04,960 --> 00:00:07,270
Here is the architecture of YOLO v seven.
3
00:00:08,010 --> 00:00:14,760
In the backbone computational blocks YOLO v seven using e lan and e lan, but e line is only used on
4
00:00:14,820 --> 00:00:20,730
all of 76 c in the net of e seven sensors p p to sp exp.
5
00:00:21,480 --> 00:00:26,520
YOLO v seven also employs an optimize path aggregation network by incorporating e lan.
6
00:00:27,470 --> 00:00:28,040
On the head.
7
00:00:28,040 --> 00:00:32,189
YOLO V seven integrates three scales based on the neck with additional rib conf.
8
00:00:33,170 --> 00:00:33,860
The big ones.
9
00:00:33,860 --> 00:00:40,400
Architectural details are as follows CB's is mainly composed of convolution bells, normalization and
10
00:00:40,400 --> 00:00:42,050
similar activation function.
11
00:00:42,860 --> 00:00:46,970
KBS connects better normalization layer directly to convolutional layer.
12
00:00:47,740 --> 00:00:52,510
The purpose of this is to integrate the mean and variance of bats normalization into the bias and weight
13
00:00:52,510 --> 00:00:54,940
of convolutional layer two Inference states.
14
00:00:55,630 --> 00:01:00,460
YOLO v seven As previously stated implies, you learn in its computational blocks.
15
00:01:01,120 --> 00:01:02,800
Here are the details from Ilan.
16
00:01:03,580 --> 00:01:05,470
Next there is empty conf.
17
00:01:06,210 --> 00:01:09,990
The NP conflict is mainly divided into Max Pro and CBSE.
18
00:01:10,980 --> 00:01:12,180
Next on the neck.
19
00:01:12,970 --> 00:01:15,580
First, I will explain what spe pxp.
20
00:01:16,570 --> 00:01:19,000
Here are the details from SP PXP.
21
00:01:19,780 --> 00:01:25,720
The SP SP module adds the correct operation at the end based on the SP module, which is fused with
22
00:01:25,720 --> 00:01:28,060
the FITS before the SP module.
23
00:01:28,570 --> 00:01:30,790
It aims to enrich the feature information.
24
00:01:31,550 --> 00:01:33,920
The following are the details of the SP.
25
00:01:35,000 --> 00:01:39,880
YOLO v seven also uses an optimised path aggregation network that incorporates inline.
26
00:01:40,870 --> 00:01:44,860
Path aggregation network and student sample operation after sample.
27
00:01:45,750 --> 00:01:50,730
Power Aggregation network was chosen because of its ability to accurately preserve spatial information
28
00:01:50,730 --> 00:01:53,460
which aids in the proper localization of pixels.
29
00:01:54,090 --> 00:01:56,790
Next, here are the details of the event on the NEC.
30
00:01:57,760 --> 00:02:03,100
On the head of seven integrates three scales based on the neck and allocates three anchor boxes under
31
00:02:03,100 --> 00:02:03,820
each scale.
32
00:02:04,610 --> 00:02:10,820
By using three scales, it increases accuracy when detecting three object sizes small, medium and large.
33
00:02:11,510 --> 00:02:13,610
In addition, we call this also added.
34
00:02:14,510 --> 00:02:16,760
Web.com refused to change the number of channels.
35
00:02:16,760 --> 00:02:21,500
Output from webcomic has a certain difference between training and inference.
36
00:02:22,190 --> 00:02:26,450
There is an additive output of the three branches during training and the parameters of the branches
37
00:02:26,450 --> 00:02:29,120
are parameterized to the main brands during deployment.
38
00:02:30,510 --> 00:02:33,390
That's the explanation of the YOLO v seven architecture.
39
00:02:33,810 --> 00:02:35,280
See you in the next video.
3778
Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.