๐ข Understanding AI Minds
There are a few simple things you should know about different AIs and how they work before you start reading the rest of the course.
Different AIsโ
Thousands, if not millions of AIs exist. Some are better than others. Different AIs can produce images, music, text, and even videos. Note that these are all generative AIs, basically AIs that make things. There are also discriminative AIs, which are AIs that classify things. For example, you can use an image classifier to tell if an image is a cat or a dog. We will not be using any discriminative AIs in this course.
Only a few generative AIs are currently advanced enough to be especially useful for prompt engineering. We mainly use GPT-3 and ChatGPT in this course. As we mentioned in the last page, ChatGPT is a chat bot, while GPT-3 is not. They will usually produce different responses when asked the same question. If you are a developer, I recommend using GPT-3, since it is more reproducible. If you are a non-developer, I recommend using ChatGPT, since it is easier to use. Most techniques in this course can be applied to both AIs. However, some of them will be GPT-3 only, so we encourage you to use GPT-3 if you want to use all of the techniques in this course.
We will also use Stable Diffusion and DALLE in the image generation section. See more relevant AIs here.
How these AIs workโ
This section describes aspects of popular generative text AIs. These AIs have brains that are made up of billions of artificial neurons. The way these neurons are structured is called a transformer architecture. It is a fairly complex type of neural network. What you should understand is:
- These AIs are just math functions. Instead of , they are more like f(thousands of variables) = thousands of possible outputs.
- These AIs understand sentences by breaking them into words/subwords called tokens (e.g. the AI might read
I don't like
as"I", "don", "'t" "like"
). Each token is then converted into a list of numbers, so the AI can process it. - These AIs predict the next word/token in the sentence based on the previous words/tokens (e.g. the AI might predict
apples
afterI don't like
). Each token they write is based on the previous tokens they have seen and written; every time they write a new token, they pause to think about what the next token should be. - These AIs look at every token at the same time. They don't read left to right, or right to left like humans do.
Please understand that the words "think", "brain", and "neuron" are zoomorphisms, which are essentially metaphors for what the model is actually doing. These models are not really thinking, they are just math functions. They are not actually brains, they are just artificial neural networks. They are not actually biological neurons, they are just numbers.
This is an area of active research and philosophizing. This description is rather cynical about their nature and is meant to temper popular media depiction of AIs as beings that think/act like humans. This being said, if you want to anthropomorphize the AI, go ahead! It seems that most people do this and it may even be helpful for learning.
Notesโ
d2l.ai is a good resource for learning about how AI works
Please note that the authors do, in fact, enjoy apples. They are delicious.