Would you like to inspect the original subtitles? These are the user uploaded subtitles that are being translated:
1
00:00:00,000 --> 00:00:01,125
GWENDOLYN STRIPLING: Hello.
2
00:00:01,125 --> 00:00:04,019
And welcome to Introduction
to Generative AI.
3
00:00:04,019 --> 00:00:06,690
My name is Dr.
Gwendolyn Stripling.
4
00:00:06,690 --> 00:00:09,540
And I am the
artificial intelligence
5
00:00:09,540 --> 00:00:14,620
technical curriculum developer
here at Google Cloud.
6
00:00:14,620 --> 00:00:18,490
In this course, you learn
to define generative AI,
7
00:00:18,490 --> 00:00:23,080
explain how generative AI works,
describe generative AI model
8
00:00:23,080 --> 00:00:28,390
types, and describe
generative AI applications.
9
00:00:28,390 --> 00:00:31,960
Generative AI is a type
of artificial intelligence
10
00:00:31,960 --> 00:00:36,170
technology that can produce
various types of content,
11
00:00:36,170 --> 00:00:41,050
including text, imagery,
audio, and synthetic data.
12
00:00:41,050 --> 00:00:44,090
But what is artificial
intelligence?
13
00:00:44,090 --> 00:00:46,030
Well, since we are
going to explore
14
00:00:46,030 --> 00:00:48,560
generative artificial
intelligence,
15
00:00:48,560 --> 00:00:51,130
let's provide a bit of context.
16
00:00:51,130 --> 00:00:53,140
So two very common
questions asked
17
00:00:53,140 --> 00:00:55,810
are what is artificial
intelligence
18
00:00:55,810 --> 00:01:00,220
and what is the difference
between AI and machine
19
00:01:00,220 --> 00:01:01,480
learning.
20
00:01:01,480 --> 00:01:05,860
One way to think about it
is that AI is a discipline,
21
00:01:05,860 --> 00:01:08,290
like physics for example.
22
00:01:08,290 --> 00:01:11,230
AI is a branch of
computer science
23
00:01:11,230 --> 00:01:15,010
that deals with the creation
of intelligence agents, which
24
00:01:15,010 --> 00:01:20,550
are systems that can reason,
and learn, and act autonomously.
25
00:01:20,550 --> 00:01:24,690
Essentially, AI has to do
with the theory and methods
26
00:01:24,690 --> 00:01:30,190
to build machines that
think and act like humans.
27
00:01:30,190 --> 00:01:33,020
In this discipline, we
have machine learning,
28
00:01:33,020 --> 00:01:35,590
which is a subfield of AI.
29
00:01:35,590 --> 00:01:40,210
It is a program or system that
trains a model from input data.
30
00:01:40,210 --> 00:01:42,820
That trained model can
make useful predictions
31
00:01:42,820 --> 00:01:45,880
from new or never
before seen data
32
00:01:45,880 --> 00:01:49,960
drawn from the same one
used to train the model.
33
00:01:49,960 --> 00:01:51,940
Machine learning
gives the computer
34
00:01:51,940 --> 00:01:56,470
the ability to learn without
explicit programming.
35
00:01:56,470 --> 00:01:59,440
Two of the most common classes
of machine learning models
36
00:01:59,440 --> 00:02:03,790
are unsupervised and
supervised ML models.
37
00:02:03,790 --> 00:02:05,740
The key difference
between the two
38
00:02:05,740 --> 00:02:09,669
is that, with supervised
models, we have labels.
39
00:02:09,669 --> 00:02:14,680
Labeled data is data that comes
with a tag like a name, a type,
40
00:02:14,680 --> 00:02:16,240
or a number.
41
00:02:16,240 --> 00:02:20,630
Unlabeled data is data
that comes with no tag.
42
00:02:20,630 --> 00:02:23,390
This graph is an
example of the problem
43
00:02:23,390 --> 00:02:26,210
that a supervised model
might try to solve.
44
00:02:26,210 --> 00:02:29,780
For example, let's say you
are the owner of a restaurant.
45
00:02:29,780 --> 00:02:32,370
You have historical
data of the bill amount
46
00:02:32,370 --> 00:02:36,140
and how much different people
tipped based on order type
47
00:02:36,140 --> 00:02:39,500
and whether it was
picked up or delivered.
48
00:02:39,500 --> 00:02:42,980
In supervised learning, the
model learns from past examples
49
00:02:42,980 --> 00:02:47,240
to predict future values,
in this case tips.
50
00:02:47,240 --> 00:02:49,970
So here the model uses
the total bill amount
51
00:02:49,970 --> 00:02:54,230
to predict the future tip amount
based on whether an order was
52
00:02:54,230 --> 00:02:57,020
picked up or delivered.
53
00:02:57,020 --> 00:02:58,840
This is an example
of the problem
54
00:02:58,840 --> 00:03:02,330
that an unsupervised
model might try to solve.
55
00:03:02,330 --> 00:03:05,080
So here you want to look
at tenure and income
56
00:03:05,080 --> 00:03:08,050
and then group or
cluster employees
57
00:03:08,050 --> 00:03:11,170
to see whether someone
is on the fast track.
58
00:03:11,170 --> 00:03:14,530
Unsupervised problems
are all about discovery,
59
00:03:14,530 --> 00:03:18,940
about looking at the raw data
and seeing if it naturally
60
00:03:18,940 --> 00:03:21,750
falls into groups.
61
00:03:21,750 --> 00:03:24,650
Let's get a little deeper
and show this graphically
62
00:03:24,650 --> 00:03:27,530
as understanding
these concepts are
63
00:03:27,530 --> 00:03:31,790
the foundation for your
understanding of generative AI.
64
00:03:31,790 --> 00:03:35,870
In supervised learning,
testing data values or x
65
00:03:35,870 --> 00:03:37,880
are input into the model.
66
00:03:37,880 --> 00:03:42,500
The model outputs a prediction
and compares that prediction
67
00:03:42,500 --> 00:03:45,860
to the training data
used to train the model.
68
00:03:45,860 --> 00:03:50,750
If the predicted test data
values and actual training data
69
00:03:50,750 --> 00:03:54,380
values are far apart,
that's called error.
70
00:03:54,380 --> 00:03:56,930
And the model tries
to reduce this error
71
00:03:56,930 --> 00:04:01,580
until the predicted and actual
values are closer together.
72
00:04:01,580 --> 00:04:05,710
This is a classic
optimization problem.
73
00:04:05,710 --> 00:04:07,410
Now that we've
explored the difference
74
00:04:07,410 --> 00:04:10,470
between artificial intelligence
and machine learning,
75
00:04:10,470 --> 00:04:13,860
and supervised and
unsupervised learning,
76
00:04:13,860 --> 00:04:16,529
let's briefly explore
where deep learning
77
00:04:16,529 --> 00:04:20,640
fits as a subset of
machine learning methods.
78
00:04:20,640 --> 00:04:22,590
While machine learning
is a broad field that
79
00:04:22,590 --> 00:04:25,080
encompasses many
different techniques,
80
00:04:25,080 --> 00:04:27,390
deep learning is a type
of machine learning
81
00:04:27,390 --> 00:04:29,790
that uses artificial
neural networks,
82
00:04:29,790 --> 00:04:32,970
allowing them to process more
complex patterns than machine
83
00:04:32,970 --> 00:04:34,470
learning.
84
00:04:34,470 --> 00:04:37,880
Artificial neural networks are
inspired by the human brain.
85
00:04:37,880 --> 00:04:41,690
They are made up of many
interconnected nodes or neurons
86
00:04:41,690 --> 00:04:46,130
that can learn to perform tasks
by processing data and making
87
00:04:46,130 --> 00:04:47,270
predictions.
88
00:04:47,270 --> 00:04:49,760
Deep learning models
typically have many layers
89
00:04:49,760 --> 00:04:52,430
of neurons, which
allows them to learn
90
00:04:52,430 --> 00:04:55,880
more complex patterns than
traditional machine learning
91
00:04:55,880 --> 00:04:56,720
models.
92
00:04:56,720 --> 00:05:00,840
And neural networks can use
both labeled and unlabeled data.
93
00:05:00,840 --> 00:05:03,530
This is called
semi-supervised learning.
94
00:05:03,530 --> 00:05:06,260
In semi-supervised
learning, a neural network
95
00:05:06,260 --> 00:05:09,140
is trained on a small
amount of labeled data
96
00:05:09,140 --> 00:05:12,590
and a large amount
of unlabeled data.
97
00:05:12,590 --> 00:05:15,200
The labeled data helps
the neural network
98
00:05:15,200 --> 00:05:17,750
to learn the basic
concepts of the task
99
00:05:17,750 --> 00:05:20,960
while the unlabeled data
helps the neural network
100
00:05:20,960 --> 00:05:24,540
to generalize to new examples.
101
00:05:24,540 --> 00:05:27,720
Now we finally get to
where generative AI
102
00:05:27,720 --> 00:05:30,390
fits into this AI discipline.
103
00:05:30,390 --> 00:05:33,570
Gen AI is a subset of
deep learning, which
104
00:05:33,570 --> 00:05:36,540
means it uses artificial
neural networks,
105
00:05:36,540 --> 00:05:40,410
can process both labeled
and unlabeled data using
106
00:05:40,410 --> 00:05:45,900
supervised, unsupervised,
and semi-supervised methods.
107
00:05:45,900 --> 00:05:51,320
Large language models are also
a subset of deep learning.
108
00:05:51,320 --> 00:05:54,310
Deep learning models, or machine
learning models in general,
109
00:05:54,310 --> 00:05:59,635
can be divided into two types,
generative and discriminative.
110
00:05:59,635 --> 00:06:02,290
A discriminative model
is a type of model
111
00:06:02,290 --> 00:06:06,520
that is used to classify or
predict labels for data points.
112
00:06:06,520 --> 00:06:08,080
Discriminative
models are typically
113
00:06:08,080 --> 00:06:10,960
trained on a data set
of labeled data points.
114
00:06:10,960 --> 00:06:14,530
And they learn the relationship
between the features
115
00:06:14,530 --> 00:06:17,350
of the data points
and the labels.
116
00:06:17,350 --> 00:06:20,480
Once a discriminative
model is trained,
117
00:06:20,480 --> 00:06:25,180
it can be used to predict the
label for new data points.
118
00:06:25,180 --> 00:06:28,390
A generative model
generates new data instances
119
00:06:28,390 --> 00:06:33,610
based on a learned probability
distribution of existing data.
120
00:06:33,610 --> 00:06:38,070
Thus generative models
generate new content.
121
00:06:38,070 --> 00:06:40,260
Take this example here.
122
00:06:40,260 --> 00:06:42,990
The discriminative model learns
the conditional probability
123
00:06:42,990 --> 00:06:45,780
distribution or the
probability of y,
124
00:06:45,780 --> 00:06:50,070
our output, given x, our
input, that this is a dog
125
00:06:50,070 --> 00:06:54,480
and classifies it as
a dog and not a cat.
126
00:06:54,480 --> 00:06:58,260
The generative model learns the
joint probability distribution
127
00:06:58,260 --> 00:07:02,100
or the probability of
x and y and predicts
128
00:07:02,100 --> 00:07:05,250
the conditional probability
that this is a dog
129
00:07:05,250 --> 00:07:09,430
and can then generate
a picture of a dog.
130
00:07:09,430 --> 00:07:11,970
So to summarize,
generative models
131
00:07:11,970 --> 00:07:16,530
can generate new data instances
while discriminative models
132
00:07:16,530 --> 00:07:21,470
discriminate between different
kinds of data instances.
133
00:07:21,470 --> 00:07:23,600
The top image shows
a traditional machine
134
00:07:23,600 --> 00:07:25,580
learning model which
attempts to learn
135
00:07:25,580 --> 00:07:28,880
the relationship between
the data and the label,
136
00:07:28,880 --> 00:07:30,590
or what you want to predict.
137
00:07:30,590 --> 00:07:33,470
The bottom image shows
a generative AI model
138
00:07:33,470 --> 00:07:36,980
which attempts to learn
patterns on content so that it
139
00:07:36,980 --> 00:07:40,100
can generate new content.
140
00:07:40,100 --> 00:07:43,840
A good way to distinguish
what is gen AI and what is not
141
00:07:43,840 --> 00:07:46,370
is shown in this illustration.
142
00:07:46,370 --> 00:07:51,380
It is not gen AI when the
output, or y, or label is
143
00:07:51,380 --> 00:07:55,760
a number or a class, for
example spam or not spam,
144
00:07:55,760 --> 00:07:57,620
or a probability.
145
00:07:57,620 --> 00:08:03,170
It is gen AI when the output is
natural language, like speech
146
00:08:03,170 --> 00:08:08,610
or text, an image or
audio, for example.
147
00:08:08,610 --> 00:08:12,030
Visualizing this mathematically
would look like this.
148
00:08:12,030 --> 00:08:14,340
If you haven't seen
this for a while,
149
00:08:14,340 --> 00:08:18,900
the y is equal to f of
x equation calculates
150
00:08:18,900 --> 00:08:23,280
the dependent output of a
process given different inputs.
151
00:08:23,280 --> 00:08:25,380
The y stands for
the model output.
152
00:08:25,380 --> 00:08:29,070
The f embodies the function
used in the calculation.
153
00:08:29,070 --> 00:08:33,280
And the x represents the input
or inputs used for the formula.
154
00:08:33,280 --> 00:08:36,659
So the model output is a
function of all the inputs.
155
00:08:36,659 --> 00:08:41,400
If the y is the number,
like predicted sales,
156
00:08:41,400 --> 00:08:43,440
it is not gen AI.
157
00:08:43,440 --> 00:08:46,740
If y is a sentence,
like define sales,
158
00:08:46,740 --> 00:08:51,480
it is generative as the question
would elicit a text response.
159
00:08:51,480 --> 00:08:55,710
The response would be based
on all the massive large data
160
00:08:55,710 --> 00:08:59,250
the model was
already trained on.
161
00:08:59,250 --> 00:09:03,450
To summarize at a high level,
the traditional, classical
162
00:09:03,450 --> 00:09:06,000
supervised and unsupervised
learning process
163
00:09:06,000 --> 00:09:09,930
takes training code and
label data to build a model.
164
00:09:09,930 --> 00:09:12,420
Depending on the
use case or problem,
165
00:09:12,420 --> 00:09:15,300
the model can give
you a prediction.
166
00:09:15,300 --> 00:09:18,630
It can classify something
or cluster something.
167
00:09:18,630 --> 00:09:22,440
We use this example to show
you how much more robust
168
00:09:22,440 --> 00:09:25,290
the gen AI process is.
169
00:09:25,290 --> 00:09:29,070
The gen AI process can take
training code, label data,
170
00:09:29,070 --> 00:09:31,530
and unlabeled data
of all data types
171
00:09:31,530 --> 00:09:33,300
and build a foundation model.
172
00:09:33,300 --> 00:09:36,420
The foundation model can
then generate new content.
173
00:09:36,420 --> 00:09:42,600
For example, text, code,
images, audio, video, et cetera.
174
00:09:42,600 --> 00:09:45,180
We've come a long away from
traditional programming
175
00:09:45,180 --> 00:09:48,300
to neural networks
to generative models.
176
00:09:48,300 --> 00:09:50,280
In traditional
programming, we used
177
00:09:50,280 --> 00:09:53,340
to have to hard code the rules
for distinguishing a cat--
178
00:09:53,340 --> 00:10:00,390
the type, animal; legs,
four; ears, two; fur, yes;
179
00:10:00,390 --> 00:10:03,060
likes yarn and catnip.
180
00:10:03,060 --> 00:10:05,310
In the wave of
neural networks, we
181
00:10:05,310 --> 00:10:07,920
could give the network
pictures of cats and dogs
182
00:10:07,920 --> 00:10:12,770
and ask is this a cat and
it would predict a cat.
183
00:10:12,770 --> 00:10:15,740
In the generative
wave, we as users
184
00:10:15,740 --> 00:10:18,770
can generate our own
content, whether it
185
00:10:18,770 --> 00:10:23,090
be text, images, audio,
video, et cetera, for example
186
00:10:23,090 --> 00:10:26,360
models like PaLM or
Pathways Language Model,
187
00:10:26,360 --> 00:10:30,140
or LAMBDA, Language Model
for Dialogue Applications,
188
00:10:30,140 --> 00:10:33,440
ingest very, very large data
from the multiple sources
189
00:10:33,440 --> 00:10:36,290
across the internet and
build foundation language
190
00:10:36,290 --> 00:10:40,790
models we can use simply
by asking a question,
191
00:10:40,790 --> 00:10:43,730
whether typing it into
a prompt or verbally
192
00:10:43,730 --> 00:10:45,900
talking into the prompt itself.
193
00:10:45,900 --> 00:10:48,800
So when you ask it
what's a cat, it
194
00:10:48,800 --> 00:10:52,610
can give you everything it
has learned about a cat.
195
00:10:52,610 --> 00:10:55,010
Now we come to our
formal definition.
196
00:10:55,010 --> 00:10:57,530
What is generative AI?
197
00:10:57,530 --> 00:11:00,020
Gen AI is a type of
artificial intelligence
198
00:11:00,020 --> 00:11:02,900
that creates new content
based on what it has
199
00:11:02,900 --> 00:11:05,150
learned from existing content.
200
00:11:05,150 --> 00:11:07,520
The process of learning
from existing content
201
00:11:07,520 --> 00:11:10,580
is called training and
results in the creation
202
00:11:10,580 --> 00:11:13,880
of a statistical model
when given a prompt.
203
00:11:13,880 --> 00:11:18,260
AI uses the model to predict
what an expected response might
204
00:11:18,260 --> 00:11:21,980
be and this generates
new content.
205
00:11:21,980 --> 00:11:24,350
Essentially, it learns
the underlying structure
206
00:11:24,350 --> 00:11:26,930
of the data and
can then generate
207
00:11:26,930 --> 00:11:31,430
new samples that are similar
to the data it was trained on.
208
00:11:31,430 --> 00:11:35,870
As previously mentioned, a
generative language model
209
00:11:35,870 --> 00:11:38,720
can take what it has learned
from the examples it's
210
00:11:38,720 --> 00:11:41,630
been shown and create
something entirely new
211
00:11:41,630 --> 00:11:43,400
based on that information.
212
00:11:43,400 --> 00:11:47,810
Large language models are
one type of generative AI
213
00:11:47,810 --> 00:11:52,310
since they generate novel
combinations of text
214
00:11:52,310 --> 00:11:56,180
in the form of natural
sounding language.
215
00:11:56,180 --> 00:11:59,150
A generative image
model takes an image
216
00:11:59,150 --> 00:12:04,500
as input and can output text,
another image, or video.
217
00:12:04,500 --> 00:12:07,340
For example, under
the output text,
218
00:12:07,340 --> 00:12:09,680
you can get visual
question answering
219
00:12:09,680 --> 00:12:14,780
while under output image, an
image completion is generated.
220
00:12:14,780 --> 00:12:19,310
And under output video,
animation is generated.
221
00:12:19,310 --> 00:12:22,760
A generative language
model takes text as input
222
00:12:22,760 --> 00:12:27,590
and can output more text, an
image, audio, or decisions.
223
00:12:27,590 --> 00:12:29,690
For example, under
the output text,
224
00:12:29,690 --> 00:12:31,400
question answering is generated.
225
00:12:31,400 --> 00:12:35,740
And under output image,
a video is generated.
226
00:12:35,740 --> 00:12:38,770
We've stated that generative
language models learn
227
00:12:38,770 --> 00:12:41,740
about patterns and language
through training data,
228
00:12:41,740 --> 00:12:46,510
then, given some text, they
predict what comes next.
229
00:12:46,510 --> 00:12:50,740
Thus generative language models
are pattern matching systems.
230
00:12:50,740 --> 00:12:54,890
They learn about patterns
based on the data you provide.
231
00:12:54,890 --> 00:12:57,170
Here is an example.
232
00:12:57,170 --> 00:12:59,930
Based on things it's learned
from its training data,
233
00:12:59,930 --> 00:13:03,690
it offers predictions of how
to complete this sentence,
234
00:13:03,690 --> 00:13:09,430
I'm making a sandwich with
peanut butter and jelly.
235
00:13:09,430 --> 00:13:12,100
Here is the same
example using Bard,
236
00:13:12,100 --> 00:13:15,400
which is trained on a
massive amount of text data
237
00:13:15,400 --> 00:13:17,710
and is able to
communicate and generate
238
00:13:17,710 --> 00:13:21,730
humanlike text in response
to a wide range of prompts
239
00:13:21,730 --> 00:13:23,530
and questions.
240
00:13:23,530 --> 00:13:25,850
Here is another example.
241
00:13:25,850 --> 00:13:29,050
The meaning of life is--
242
00:13:29,050 --> 00:13:32,020
and Bart gives you
a contextual answer
243
00:13:32,020 --> 00:13:35,680
and then shows the highest
probability response.
244
00:13:35,680 --> 00:13:40,330
The power of generative AI comes
from the use of transformers.
245
00:13:40,330 --> 00:13:43,180
Transformers produced
a 2018 revolution
246
00:13:43,180 --> 00:13:45,400
in natural language processing.
247
00:13:45,400 --> 00:13:47,950
At a high level, a
transformer model
248
00:13:47,950 --> 00:13:50,630
consists of an
encoder and decoder.
249
00:13:50,630 --> 00:13:53,020
The encoder encodes
the input sequence
250
00:13:53,020 --> 00:13:55,600
and passes it to
the decoder, which
251
00:13:55,600 --> 00:13:58,720
learns how to decode
the representation
252
00:13:58,720 --> 00:14:01,150
for a relevant task.
253
00:14:01,150 --> 00:14:06,340
In transformers, hallucinations
are words or phrases
254
00:14:06,340 --> 00:14:09,190
that are generated
by the model that
255
00:14:09,190 --> 00:14:13,660
are often nonsensical or
grammatically incorrect.
256
00:14:13,660 --> 00:14:17,180
Hallucinations can be caused
by a number of factors,
257
00:14:17,180 --> 00:14:21,430
including the model is not
trained on enough data,
258
00:14:21,430 --> 00:14:25,630
or the model is trained
on noisy or dirty data,
259
00:14:25,630 --> 00:14:29,440
or the model is not
given enough context,
260
00:14:29,440 --> 00:14:33,010
or the model is not
given enough constraints.
261
00:14:33,010 --> 00:14:35,530
Hallucinations can be a
problem for transformers
262
00:14:35,530 --> 00:14:40,030
because they can make the output
text difficult to understand.
263
00:14:40,030 --> 00:14:41,860
They can also make
the model more
264
00:14:41,860 --> 00:14:46,690
likely to generate incorrect
or misleading information.
265
00:14:46,690 --> 00:14:49,810
A prompt is a
short piece of text
266
00:14:49,810 --> 00:14:53,410
that is given to the large
language model as input.
267
00:14:53,410 --> 00:14:57,580
And it can be used to control
the output of the model
268
00:14:57,580 --> 00:14:59,320
in a variety of ways.
269
00:14:59,320 --> 00:15:01,900
Prompt design is the
process of creating
270
00:15:01,900 --> 00:15:04,990
a prompt that will
generate the desired output
271
00:15:04,990 --> 00:15:07,650
from a large language model.
272
00:15:07,650 --> 00:15:11,730
As previously mentioned,
gen AI depends a lot
273
00:15:11,730 --> 00:15:14,760
on the training data that
you have fed into it.
274
00:15:14,760 --> 00:15:18,270
And it analyzes the patterns
and structures of the input data
275
00:15:18,270 --> 00:15:20,220
and thus learns.
276
00:15:20,220 --> 00:15:23,880
But with access to a browser
based prompt, you, the user,
277
00:15:23,880 --> 00:15:27,420
can generate your own content.
278
00:15:27,420 --> 00:15:31,570
We've shown illustrations of the
types of input based upon data.
279
00:15:31,570 --> 00:15:33,960
Here are the
associated model types.
280
00:15:33,960 --> 00:15:35,640
Text-to-text.
281
00:15:35,640 --> 00:15:38,700
Text-to-text models take
a natural language input
282
00:15:38,700 --> 00:15:40,710
and produces a text output.
283
00:15:40,710 --> 00:15:43,230
These models are trained
to learn the mapping
284
00:15:43,230 --> 00:15:45,420
between a pair of text, e.g.
285
00:15:45,420 --> 00:15:49,740
for example, translation
from one language to another.
286
00:15:49,740 --> 00:15:50,940
Text-to-image.
287
00:15:50,940 --> 00:15:54,930
Text-to-image models are trained
on a large set of images,
288
00:15:54,930 --> 00:15:58,290
each captioned with a
short text description.
289
00:15:58,290 --> 00:16:01,440
Diffusion is one method
used to achieve this.
290
00:16:01,440 --> 00:16:04,470
Text-to-video and text-to-3D.
291
00:16:04,470 --> 00:16:08,250
Text-to-video models aim to
generate a video representation
292
00:16:08,250 --> 00:16:09,900
from text input.
293
00:16:09,900 --> 00:16:13,920
The input text can be anything
from a single sentence
294
00:16:13,920 --> 00:16:15,420
to a full script.
295
00:16:15,420 --> 00:16:20,220
And the output is a video that
corresponds to the input text.
296
00:16:20,220 --> 00:16:23,790
Similarly, text-to-3D
models generate
297
00:16:23,790 --> 00:16:28,200
three dimensional objects that
correspond to a user's text
298
00:16:28,200 --> 00:16:29,610
description.
299
00:16:29,610 --> 00:16:34,910
For example, this can be used
in games or other 3D worlds.
300
00:16:34,910 --> 00:16:36,500
Text-to-task.
301
00:16:36,500 --> 00:16:41,120
Text-to-task models are trained
to perform a defined task
302
00:16:41,120 --> 00:16:44,120
or action based on text input.
303
00:16:44,120 --> 00:16:46,640
This task can be a
wide range of actions
304
00:16:46,640 --> 00:16:50,900
such as answering a question,
performing a search,
305
00:16:50,900 --> 00:16:55,230
making a prediction, or
taking some sort of action.
306
00:16:55,230 --> 00:16:58,220
For example, a
text-to-task model
307
00:16:58,220 --> 00:17:03,560
could be trained to navigate a
web UI or make changes to a doc
308
00:17:03,560 --> 00:17:05,000
through the GUI.
309
00:17:05,000 --> 00:17:08,690
A foundation model is a
large AI model pre-trained
310
00:17:08,690 --> 00:17:13,339
on a vast quantity of data
designed to be adapted or fine
311
00:17:13,339 --> 00:17:17,270
tuned to a wide range
of downstream tasks,
312
00:17:17,270 --> 00:17:22,310
such as sentiment analysis,
image captioning, and object
313
00:17:22,310 --> 00:17:23,660
recognition.
314
00:17:23,660 --> 00:17:26,240
Foundation models
have the potential
315
00:17:26,240 --> 00:17:29,330
to revolutionize many
industries, including
316
00:17:29,330 --> 00:17:32,660
health care, finance,
and customer service.
317
00:17:32,660 --> 00:17:36,410
They can be used to
detect fraud and provide
318
00:17:36,410 --> 00:17:38,980
personalized customer support.
319
00:17:38,980 --> 00:17:41,890
Vertex AI offers a
model garden that
320
00:17:41,890 --> 00:17:43,930
includes foundation models.
321
00:17:43,930 --> 00:17:45,880
The language foundation
models include
322
00:17:45,880 --> 00:17:48,730
PaLM API for chat and text.
323
00:17:48,730 --> 00:17:52,790
The vision foundation models
includes stable diffusion,
324
00:17:52,790 --> 00:17:55,900
which has been shown to
be effective at generating
325
00:17:55,900 --> 00:18:00,330
high quality images
from text descriptions.
326
00:18:00,330 --> 00:18:01,830
Let's say you have
a use case where
327
00:18:01,830 --> 00:18:05,250
you need to gather sentiments
about how your customers are
328
00:18:05,250 --> 00:18:07,890
feeling about your
product or service.
329
00:18:07,890 --> 00:18:12,720
You can use the classification
task sentiment analysis task
330
00:18:12,720 --> 00:18:15,300
model for just that purpose.
331
00:18:15,300 --> 00:18:19,710
And what if you needed to
perform occupancy analytics?
332
00:18:19,710 --> 00:18:23,370
There is a task model
for your use case.
333
00:18:23,370 --> 00:18:27,220
Shown here are gen
AI applications.
334
00:18:27,220 --> 00:18:30,660
Let's look at an example
of code generation
335
00:18:30,660 --> 00:18:34,410
shown in the second block
under code at the top.
336
00:18:34,410 --> 00:18:39,360
In this example, I've input a
code file conversion problem,
337
00:18:39,360 --> 00:18:41,850
converting from Python to JSON.
338
00:18:41,850 --> 00:18:42,720
I use Bard.
339
00:18:42,720 --> 00:18:46,480
And I insert into the
prompt box the following.
340
00:18:46,480 --> 00:18:50,520
I have a Pandas DataFrame with
two columns, one with the file
341
00:18:50,520 --> 00:18:54,300
name and one with the hour
in which it is generated.
342
00:18:54,300 --> 00:18:57,420
I'm trying to convert
this into a JSON file
343
00:18:57,420 --> 00:19:00,570
in the format shown onscreen.
344
00:19:00,570 --> 00:19:06,720
Bard returns the steps I need
to do this and the code snippet.
345
00:19:06,720 --> 00:19:10,110
And here my output
is in a JSON format.
346
00:19:10,110 --> 00:19:11,220
It gets better.
347
00:19:11,220 --> 00:19:16,380
I happen to be using Google's
free, browser-based Jupyter
348
00:19:16,380 --> 00:19:18,540
Notebook, known as Colab.
349
00:19:18,540 --> 00:19:23,350
And I simply export the
Python code to Google's Colab.
350
00:19:23,350 --> 00:19:26,230
To summarize, Bart
code generation
351
00:19:26,230 --> 00:19:29,470
can help you debug your
lines of source code,
352
00:19:29,470 --> 00:19:31,870
explain your code
to you line by line,
353
00:19:31,870 --> 00:19:34,960
craft SQL queries
for your database,
354
00:19:34,960 --> 00:19:37,940
translate code from one
language to another,
355
00:19:37,940 --> 00:19:42,970
and generate documentation
and tutorials for source code.
356
00:19:42,970 --> 00:19:47,930
Generative AI Studio lets you
quickly explore and customize
357
00:19:47,930 --> 00:19:51,760
gen AI models that you can
leverage in your applications
358
00:19:51,760 --> 00:19:53,080
on Google Cloud.
359
00:19:53,080 --> 00:19:57,520
Generative AI Studio helps
developers create and deploy
360
00:19:57,520 --> 00:20:02,680
Gen AI models by providing a
variety of tools and resources
361
00:20:02,680 --> 00:20:05,740
that make it easy
to get started.
362
00:20:05,740 --> 00:20:09,410
For example, there's a
library of pre-trained models.
363
00:20:09,410 --> 00:20:12,230
There is a tool for
fine tuning models.
364
00:20:12,230 --> 00:20:15,490
There is a tool for deploying
models to production.
365
00:20:15,490 --> 00:20:18,340
And there is a community
forum for developers
366
00:20:18,340 --> 00:20:21,700
to share ideas and collaborate.
367
00:20:21,700 --> 00:20:24,460
Generative AI App
Builder lets you
368
00:20:24,460 --> 00:20:28,450
create gen AI apps without
having to write any code.
369
00:20:28,450 --> 00:20:31,990
Gen AI App Builder has a
drag and drop interface
370
00:20:31,990 --> 00:20:35,120
that makes it easy to
design and build apps.
371
00:20:35,120 --> 00:20:36,580
It has a visual
editor that makes
372
00:20:36,580 --> 00:20:39,310
it easy to create
and edit app content.
373
00:20:39,310 --> 00:20:40,840
It has a built-in
search engine that
374
00:20:40,840 --> 00:20:43,930
allows users to search for
information within the app.
375
00:20:43,930 --> 00:20:46,360
And it has a
conversational AI Engine
376
00:20:46,360 --> 00:20:49,600
that helps users to
interact with the app using
377
00:20:49,600 --> 00:20:51,400
natural language.
378
00:20:51,400 --> 00:20:55,450
You can create your own digital
assistants, custom search
379
00:20:55,450 --> 00:20:59,770
engines, knowledge bases,
training applications,
380
00:20:59,770 --> 00:21:01,950
and much more.
381
00:21:01,950 --> 00:21:05,400
PaLM API lets you
test and experiment
382
00:21:05,400 --> 00:21:09,180
with Google's large language
models and gen AI tools.
383
00:21:09,180 --> 00:21:11,700
To make prototyping quick
and more accessible,
384
00:21:11,700 --> 00:21:15,330
developers can integrate
PaLM API with Maker suite
385
00:21:15,330 --> 00:21:20,520
and use it to access the
API using a graphical user
386
00:21:20,520 --> 00:21:21,450
interface.
387
00:21:21,450 --> 00:21:25,590
The suite includes a number of
different tools such as a model
388
00:21:25,590 --> 00:21:29,730
training tool, a model
deployment tool, and a model
389
00:21:29,730 --> 00:21:31,650
monitoring tool.
390
00:21:31,650 --> 00:21:35,270
The model training tool helps
developers train ML models
391
00:21:35,270 --> 00:21:38,700
on their data using
different algorithms.
392
00:21:38,700 --> 00:21:42,650
The model deployment tool helps
developers deploy ML models
393
00:21:42,650 --> 00:21:48,390
to production with a number of
different deployment options.
394
00:21:48,390 --> 00:21:51,750
The model monitoring
tool helps developers
395
00:21:51,750 --> 00:21:54,510
monitor the performance
of their ML models
396
00:21:54,510 --> 00:21:58,680
in production using a
dashboard and a number
397
00:21:58,680 --> 00:22:01,810
of different metrics.
398
00:22:01,810 --> 00:22:04,500
Thank you for watching
our course, Introduction
399
00:22:04,500 --> 00:22:07,280
to Generative AI.
31383
Can't find what you're looking for?
Get subtitles in any language from opensubtitles.com, and translate them here.