subtitlecat.com

All language subtitles for 002 YOLOv7 architecture detailed explanation

Afrikaans

Akan

Albanian

Amharic

Arabic

Armenian

Azerbaijani

Basque

Belarusian

Bemba

Bengali

Bihari

Bosnian

Breton

Bulgarian

Cambodian

Catalan

Cebuano

Cherokee

Chichewa

Chinese (Simplified)

Chinese (Traditional)

Corsican

Croatian

Czech

Danish

Dutch

English

Esperanto

Estonian

Ewe

Faroese

Filipino

Finnish

French

Frisian

Galician

Georgian

German

Greek

Guarani

Gujarati

Haitian Creole

Hausa

Hawaiian

Hebrew

Hindi

Hmong

Hungarian

Icelandic

Igbo

Indonesian

Interlingua

Irish

Italian

Japanese

Javanese

Kannada

Kazakh

Kinyarwanda

Kirundi

Kongo

Korean

Krio (Sierra Leone)

Kurdish

Kurdish (Soranî)

Kyrgyz

Laothian

Latin

Latvian

Lingala

Lithuanian

Lozi

Luganda

Luo

Luxembourgish

Macedonian

Malagasy

Malay

Malayalam

Maltese

Maori

Marathi

Mauritian Creole

Moldavian

Mongolian

Myanmar (Burmese)

Montenegrin

Nepali

Nigerian Pidgin

Northern Sotho

Norwegian

Norwegian (Nynorsk)

Occitan

Oriya

Oromo

Pashto

Persian

Polish

Portuguese (Brazil)

Portuguese (Portugal)

Punjabi

Quechua

Romanian

Romansh

Runyakitara

Russian

Samoan

Scots Gaelic

Serbian

Serbo-Croatian

Sesotho

Setswana

Seychellois Creole

Shona

Sindhi

Sinhalese

Slovak

Slovenian

Somali

Spanish

Spanish (Latin American)

Sundanese

Swahili

Swedish

Tajik

Tamil

Tatar

Telugu

Thai

Tigrinya

Tonga

Tshiluba

Tumbuka

Turkish

Turkmen

Twi

Uighur

Ukrainian

Urdu

Uzbek

Vietnamese Download

Welsh

Wolof

Xhosa

Yiddish

Yoruba

Zulu

Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated: 1 00:00:00,420 --> 00:00:04,050 In this video, I will explain about the architecture of your office seven. 2 00:00:04,960 --> 00:00:07,270 Here is the architecture of YOLO v seven. 3 00:00:08,010 --> 00:00:14,760 In the backbone computational blocks YOLO v seven using e lan and e lan, but e line is only used on 4 00:00:14,820 --> 00:00:20,730 all of 76 c in the net of e seven sensors p p to sp exp. 5 00:00:21,480 --> 00:00:26,520 YOLO v seven also employs an optimize path aggregation network by incorporating e lan. 6 00:00:27,470 --> 00:00:28,040 On the head. 7 00:00:28,040 --> 00:00:32,189 YOLO V seven integrates three scales based on the neck with additional rib conf. 8 00:00:33,170 --> 00:00:33,860 The big ones. 9 00:00:33,860 --> 00:00:40,400 Architectural details are as follows CB's is mainly composed of convolution bells, normalization and 10 00:00:40,400 --> 00:00:42,050 similar activation function. 11 00:00:42,860 --> 00:00:46,970 KBS connects better normalization layer directly to convolutional layer. 12 00:00:47,740 --> 00:00:52,510 The purpose of this is to integrate the mean and variance of bats normalization into the bias and weight 13 00:00:52,510 --> 00:00:54,940 of convolutional layer two Inference states. 14 00:00:55,630 --> 00:01:00,460 YOLO v seven As previously stated implies, you learn in its computational blocks. 15 00:01:01,120 --> 00:01:02,800 Here are the details from Ilan. 16 00:01:03,580 --> 00:01:05,470 Next there is empty conf. 17 00:01:06,210 --> 00:01:09,990 The NP conflict is mainly divided into Max Pro and CBSE. 18 00:01:10,980 --> 00:01:12,180 Next on the neck. 19 00:01:12,970 --> 00:01:15,580 First, I will explain what spe pxp. 20 00:01:16,570 --> 00:01:19,000 Here are the details from SP PXP. 21 00:01:19,780 --> 00:01:25,720 The SP SP module adds the correct operation at the end based on the SP module, which is fused with 22 00:01:25,720 --> 00:01:28,060 the FITS before the SP module. 23 00:01:28,570 --> 00:01:30,790 It aims to enrich the feature information. 24 00:01:31,550 --> 00:01:33,920 The following are the details of the SP. 25 00:01:35,000 --> 00:01:39,880 YOLO v seven also uses an optimised path aggregation network that incorporates inline. 26 00:01:40,870 --> 00:01:44,860 Path aggregation network and student sample operation after sample. 27 00:01:45,750 --> 00:01:50,730 Power Aggregation network was chosen because of its ability to accurately preserve spatial information 28 00:01:50,730 --> 00:01:53,460 which aids in the proper localization of pixels. 29 00:01:54,090 --> 00:01:56,790 Next, here are the details of the event on the NEC. 30 00:01:57,760 --> 00:02:03,100 On the head of seven integrates three scales based on the neck and allocates three anchor boxes under 31 00:02:03,100 --> 00:02:03,820 each scale. 32 00:02:04,610 --> 00:02:10,820 By using three scales, it increases accuracy when detecting three object sizes small, medium and large. 33 00:02:11,510 --> 00:02:13,610 In addition, we call this also added. 34 00:02:14,510 --> 00:02:16,760 Web.com refused to change the number of channels. 35 00:02:16,760 --> 00:02:21,500 Output from webcomic has a certain difference between training and inference. 36 00:02:22,190 --> 00:02:26,450 There is an additive output of the three branches during training and the parameters of the branches 37 00:02:26,450 --> 00:02:29,120 are parameterized to the main brands during deployment. 38 00:02:30,510 --> 00:02:33,390 That's the explanation of the YOLO v seven architecture. 39 00:02:33,810 --> 00:02:35,280 See you in the next video. 3778