All language subtitles for 5. Split the dataset

af Afrikaans
sq Albanian
am Amharic
ar Arabic
hy Armenian
az Azerbaijani
eu Basque
be Belarusian
bn Bengali
bs Bosnian
bg Bulgarian
ca Catalan
ceb Cebuano
ny Chichewa
zh-CN Chinese (Simplified)
zh-TW Chinese (Traditional)
co Corsican
hr Croatian
cs Czech
da Danish
nl Dutch
en English
eo Esperanto
et Estonian
tl Filipino
fi Finnish
fr French Download
fy Frisian
gl Galician
ka Georgian
de German
el Greek
gu Gujarati
ht Haitian Creole
ha Hausa
haw Hawaiian
iw Hebrew
hi Hindi
hmn Hmong
hu Hungarian
is Icelandic
ig Igbo
id Indonesian
ga Irish
it Italian
ja Japanese
jw Javanese
kn Kannada
kk Kazakh
km Khmer
ko Korean
ku Kurdish (Kurmanji)
ky Kyrgyz
lo Lao
la Latin
lv Latvian
lt Lithuanian
lb Luxembourgish
mk Macedonian
mg Malagasy
ms Malay
ml Malayalam
mt Maltese
mi Maori
mr Marathi
mn Mongolian
my Myanmar (Burmese)
ne Nepali
no Norwegian
ps Pashto
fa Persian
pl Polish
pt Portuguese
pa Punjabi
ro Romanian
ru Russian
sm Samoan
gd Scots Gaelic
sr Serbian
st Sesotho
sn Shona
sd Sindhi
si Sinhala
sk Slovak
sl Slovenian
so Somali
es Spanish
su Sundanese
sw Swahili
sv Swedish
tg Tajik
ta Tamil
te Telugu
th Thai
tr Turkish
uk Ukrainian
ur Urdu
uz Uzbek
vi Vietnamese
cy Welsh
xh Xhosa
yi Yiddish
yo Yoruba
zu Zulu
or Odia (Oriya)
rw Kinyarwanda
tk Turkmen
tt Tatar
ug Uyghur
Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated: 1 00:00:11,290 --> 00:00:13,300 Hi, everyone, and welcome in this new video. 2 00:00:13,900 --> 00:00:19,150 In this video, we're going to create our different sent to do our analysis. 3 00:00:20,830 --> 00:00:29,530 First, we need to define how much data we want in the sense and amateur data we want in the test because 4 00:00:29,890 --> 00:00:32,140 our algorithm needs it. 5 00:00:32,770 --> 00:00:42,700 The train sets to calibrate each coefficient and a test set to test the performance of this algorithm 6 00:00:43,330 --> 00:00:45,370 on UNO data. 7 00:00:46,270 --> 00:01:00,700 So two of the variable splits we need to define the threshold to separate the train test and the test. 8 00:01:01,330 --> 00:01:05,230 Usually, we take that officials at 80 percent. 9 00:01:06,040 --> 00:01:16,330 It means that 80 percent of the data all used to train the algorithm and 20 percent of the data are 10 00:01:16,330 --> 00:01:19,610 used to test the performance of this argument. 11 00:01:20,590 --> 00:01:25,210 So to do it, we take the integral parts of. 12 00:01:32,040 --> 00:01:37,410 The length of or that a friend multiplied by 90 percent. 13 00:01:40,960 --> 00:01:46,300 Then we need to define that which are used as features. 14 00:01:46,420 --> 00:01:53,110 So the variable that we are going to use to predict all targets. 15 00:01:53,920 --> 00:02:05,950 So this dataframe is called X and we need to define X for the train and X for the test. 16 00:02:06,640 --> 00:02:07,000 So. 17 00:02:12,680 --> 00:02:23,290 For the train and the test for the X, we want to take the semi, the moving relativity and the RSI. 18 00:02:29,200 --> 00:02:33,100 And then we have the property you look, we split. 19 00:02:36,390 --> 00:02:37,050 The data. 20 00:02:41,310 --> 00:02:51,860 And we do exactly the same thing for the white train, which contain the targets for the train sets. 21 00:02:52,900 --> 00:02:53,470 Which is. 22 00:02:58,090 --> 00:03:09,100 The returns of the asset and also we need to split taking only the train data. 23 00:03:12,180 --> 00:03:20,290 Then we need to do exactly the same thing for the extremists and the widest. 24 00:03:23,400 --> 00:03:33,060 The only difference is that now we have taken the data from the beginning to the split. 25 00:03:35,220 --> 00:03:46,320 And for the tests, it's we need to take the data from the street to the end of the dataset. 26 00:03:47,870 --> 00:03:52,400 And let me just show you some of our. 27 00:03:52,850 --> 00:03:57,200 To be sure that you are really understand what we have done here. 28 00:03:59,330 --> 00:04:05,750 So, for example, extreme contain all the necessary data to predict 29 00:04:09,410 --> 00:04:18,860 wine train, which is the returns of the assets, and we have to do exactly the same thing for the tests. 30 00:04:26,300 --> 00:04:35,440 And as we can see, we have exactly the same data, but not in the same time because we have this database, 31 00:04:35,450 --> 00:04:37,700 we are going to train our algorithm. 32 00:04:38,060 --> 00:04:45,590 And with this database, we're going to test or trading strategy following our linear regression prediction. 3312

Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.