subtitlecat.com

All language subtitles for 5. Split the dataset

Afrikaans

Akan

Albanian

Amharic

Arabic

Armenian

Azerbaijani

Basque

Belarusian

Bemba

Bengali

Bihari

Bosnian

Breton

Bulgarian

Cambodian

Catalan

Cebuano

Cherokee

Chichewa

Chinese (Simplified)

Chinese (Traditional)

Corsican

Croatian

Czech

Danish

Dutch

English

Esperanto

Estonian

Ewe

Faroese

Filipino

Finnish

French Download

Frisian

Galician

Georgian

German

Greek

Guarani

Gujarati

Haitian Creole

Hausa

Hawaiian

Hebrew

Hindi

Hmong

Hungarian

Icelandic

Igbo

Indonesian

Interlingua

Irish

Italian

Japanese

Javanese

Kannada

Kazakh

Kinyarwanda

Kirundi

Kongo

Korean

Krio (Sierra Leone)

Kurdish

Kurdish (Soranî)

Kyrgyz

Laothian

Latin

Latvian

Lingala

Lithuanian

Lozi

Luganda

Luo

Luxembourgish

Macedonian

Malagasy

Malay

Malayalam

Maltese

Maori

Marathi

Mauritian Creole

Moldavian

Mongolian

Myanmar (Burmese)

Montenegrin

Nepali

Nigerian Pidgin

Northern Sotho

Norwegian

Norwegian (Nynorsk)

Occitan

Oriya

Oromo

Pashto

Persian

Polish

Portuguese (Brazil)

Portuguese (Portugal)

Punjabi

Quechua

Romanian

Romansh

Runyakitara

Russian

Samoan

Scots Gaelic

Serbian

Serbo-Croatian

Sesotho

Setswana

Seychellois Creole

Shona

Sindhi

Sinhalese

Slovak

Slovenian

Somali

Spanish

Spanish (Latin American)

Sundanese

Swahili

Swedish

Tajik

Tamil

Tatar

Telugu

Thai

Tigrinya

Tonga

Tshiluba

Tumbuka

Turkish

Turkmen

Twi

Uighur

Ukrainian

Urdu

Uzbek

Vietnamese

Welsh

Wolof

Xhosa

Yiddish

Yoruba

Zulu

Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated: 1 00:00:11,290 --> 00:00:13,300 Hi, everyone, and welcome in this new video. 2 00:00:13,900 --> 00:00:19,150 In this video, we're going to create our different sent to do our analysis. 3 00:00:20,830 --> 00:00:29,530 First, we need to define how much data we want in the sense and amateur data we want in the test because 4 00:00:29,890 --> 00:00:32,140 our algorithm needs it. 5 00:00:32,770 --> 00:00:42,700 The train sets to calibrate each coefficient and a test set to test the performance of this algorithm 6 00:00:43,330 --> 00:00:45,370 on UNO data. 7 00:00:46,270 --> 00:01:00,700 So two of the variable splits we need to define the threshold to separate the train test and the test. 8 00:01:01,330 --> 00:01:05,230 Usually, we take that officials at 80 percent. 9 00:01:06,040 --> 00:01:16,330 It means that 80 percent of the data all used to train the algorithm and 20 percent of the data are 10 00:01:16,330 --> 00:01:19,610 used to test the performance of this argument. 11 00:01:20,590 --> 00:01:25,210 So to do it, we take the integral parts of. 12 00:01:32,040 --> 00:01:37,410 The length of or that a friend multiplied by 90 percent. 13 00:01:40,960 --> 00:01:46,300 Then we need to define that which are used as features. 14 00:01:46,420 --> 00:01:53,110 So the variable that we are going to use to predict all targets. 15 00:01:53,920 --> 00:02:05,950 So this dataframe is called X and we need to define X for the train and X for the test. 16 00:02:06,640 --> 00:02:07,000 So. 17 00:02:12,680 --> 00:02:23,290 For the train and the test for the X, we want to take the semi, the moving relativity and the RSI. 18 00:02:29,200 --> 00:02:33,100 And then we have the property you look, we split. 19 00:02:36,390 --> 00:02:37,050 The data. 20 00:02:41,310 --> 00:02:51,860 And we do exactly the same thing for the white train, which contain the targets for the train sets. 21 00:02:52,900 --> 00:02:53,470 Which is. 22 00:02:58,090 --> 00:03:09,100 The returns of the asset and also we need to split taking only the train data. 23 00:03:12,180 --> 00:03:20,290 Then we need to do exactly the same thing for the extremists and the widest. 24 00:03:23,400 --> 00:03:33,060 The only difference is that now we have taken the data from the beginning to the split. 25 00:03:35,220 --> 00:03:46,320 And for the tests, it's we need to take the data from the street to the end of the dataset. 26 00:03:47,870 --> 00:03:52,400 And let me just show you some of our. 27 00:03:52,850 --> 00:03:57,200 To be sure that you are really understand what we have done here. 28 00:03:59,330 --> 00:04:05,750 So, for example, extreme contain all the necessary data to predict 29 00:04:09,410 --> 00:04:18,860 wine train, which is the returns of the assets, and we have to do exactly the same thing for the tests. 30 00:04:26,300 --> 00:04:35,440 And as we can see, we have exactly the same data, but not in the same time because we have this database, 31 00:04:35,450 --> 00:04:37,700 we are going to train our algorithm. 32 00:04:38,060 --> 00:04:45,590 And with this database, we're going to test or trading strategy following our linear regression prediction. 3312