All language subtitles for 002 YOLOv7 architecture detailed explanation

af Afrikaans
sq Albanian
am Amharic
ar Arabic
hy Armenian
az Azerbaijani
eu Basque
be Belarusian
bn Bengali
bs Bosnian
bg Bulgarian
ca Catalan
ceb Cebuano
ny Chichewa
zh-CN Chinese (Simplified)
zh-TW Chinese (Traditional)
co Corsican
hr Croatian
cs Czech
da Danish
nl Dutch
en English
eo Esperanto
et Estonian
tl Filipino
fi Finnish
fr French
fy Frisian
gl Galician
ka Georgian
de German
el Greek
gu Gujarati
ht Haitian Creole
ha Hausa
haw Hawaiian
iw Hebrew
hi Hindi
hmn Hmong
hu Hungarian
is Icelandic
ig Igbo
id Indonesian
ga Irish
it Italian
ja Japanese
jw Javanese
kn Kannada
kk Kazakh
km Khmer
ko Korean
ku Kurdish (Kurmanji)
ky Kyrgyz
lo Lao
la Latin
lv Latvian
lt Lithuanian
lb Luxembourgish
mk Macedonian
mg Malagasy
ms Malay
ml Malayalam
mt Maltese
mi Maori
mr Marathi
mn Mongolian
my Myanmar (Burmese)
ne Nepali
no Norwegian
ps Pashto
fa Persian
pl Polish
pt Portuguese
pa Punjabi
ro Romanian
ru Russian
sm Samoan
gd Scots Gaelic
sr Serbian
st Sesotho
sn Shona
sd Sindhi
si Sinhala
sk Slovak
sl Slovenian
so Somali
es Spanish
su Sundanese
sw Swahili
sv Swedish
tg Tajik
ta Tamil
te Telugu
th Thai
tr Turkish
uk Ukrainian
ur Urdu
uz Uzbek
vi Vietnamese Download
cy Welsh
xh Xhosa
yi Yiddish
yo Yoruba
zu Zulu
or Odia (Oriya)
rw Kinyarwanda
tk Turkmen
tt Tatar
ug Uyghur
Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated: 1 00:00:00,420 --> 00:00:04,050 In this video, I will explain about the architecture of your office seven. 2 00:00:04,960 --> 00:00:07,270 Here is the architecture of YOLO v seven. 3 00:00:08,010 --> 00:00:14,760 In the backbone computational blocks YOLO v seven using e lan and e lan, but e line is only used on 4 00:00:14,820 --> 00:00:20,730 all of 76 c in the net of e seven sensors p p to sp exp. 5 00:00:21,480 --> 00:00:26,520 YOLO v seven also employs an optimize path aggregation network by incorporating e lan. 6 00:00:27,470 --> 00:00:28,040 On the head. 7 00:00:28,040 --> 00:00:32,189 YOLO V seven integrates three scales based on the neck with additional rib conf. 8 00:00:33,170 --> 00:00:33,860 The big ones. 9 00:00:33,860 --> 00:00:40,400 Architectural details are as follows CB's is mainly composed of convolution bells, normalization and 10 00:00:40,400 --> 00:00:42,050 similar activation function. 11 00:00:42,860 --> 00:00:46,970 KBS connects better normalization layer directly to convolutional layer. 12 00:00:47,740 --> 00:00:52,510 The purpose of this is to integrate the mean and variance of bats normalization into the bias and weight 13 00:00:52,510 --> 00:00:54,940 of convolutional layer two Inference states. 14 00:00:55,630 --> 00:01:00,460 YOLO v seven As previously stated implies, you learn in its computational blocks. 15 00:01:01,120 --> 00:01:02,800 Here are the details from Ilan. 16 00:01:03,580 --> 00:01:05,470 Next there is empty conf. 17 00:01:06,210 --> 00:01:09,990 The NP conflict is mainly divided into Max Pro and CBSE. 18 00:01:10,980 --> 00:01:12,180 Next on the neck. 19 00:01:12,970 --> 00:01:15,580 First, I will explain what spe pxp. 20 00:01:16,570 --> 00:01:19,000 Here are the details from SP PXP. 21 00:01:19,780 --> 00:01:25,720 The SP SP module adds the correct operation at the end based on the SP module, which is fused with 22 00:01:25,720 --> 00:01:28,060 the FITS before the SP module. 23 00:01:28,570 --> 00:01:30,790 It aims to enrich the feature information. 24 00:01:31,550 --> 00:01:33,920 The following are the details of the SP. 25 00:01:35,000 --> 00:01:39,880 YOLO v seven also uses an optimised path aggregation network that incorporates inline. 26 00:01:40,870 --> 00:01:44,860 Path aggregation network and student sample operation after sample. 27 00:01:45,750 --> 00:01:50,730 Power Aggregation network was chosen because of its ability to accurately preserve spatial information 28 00:01:50,730 --> 00:01:53,460 which aids in the proper localization of pixels. 29 00:01:54,090 --> 00:01:56,790 Next, here are the details of the event on the NEC. 30 00:01:57,760 --> 00:02:03,100 On the head of seven integrates three scales based on the neck and allocates three anchor boxes under 31 00:02:03,100 --> 00:02:03,820 each scale. 32 00:02:04,610 --> 00:02:10,820 By using three scales, it increases accuracy when detecting three object sizes small, medium and large. 33 00:02:11,510 --> 00:02:13,610 In addition, we call this also added. 34 00:02:14,510 --> 00:02:16,760 Web.com refused to change the number of channels. 35 00:02:16,760 --> 00:02:21,500 Output from webcomic has a certain difference between training and inference. 36 00:02:22,190 --> 00:02:26,450 There is an additive output of the three branches during training and the parameters of the branches 37 00:02:26,450 --> 00:02:29,120 are parameterized to the main brands during deployment. 38 00:02:30,510 --> 00:02:33,390 That's the explanation of the YOLO v seven architecture. 39 00:02:33,810 --> 00:02:35,280 See you in the next video. 3778

Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.