7 Drill Questions: Uncertain Quantities

Remember to hand in your work …

At any point, you can submit your answers by collecting them and uploading them to the class site.

Drill-07-uncertainty.rmarkdown No answers yet collected

If requested by your instructor, please identify here the people from whom you received assistance on this assignment.

If the answers that have been loaded automatically are not yours, press this button before starting your work:

A. Relative probability

Drill 7. 2 Each of the following graphics, except one, can be interpretted as a relative probability function. Which is the exception?

Warning: Removed 3 rows containing missing values or values outside the scale range
(`geom_line()`).

KnZDfY

Drill 7. 3

hct-1-fkw

An (absolute) probability is also a relative probability.

Drill 7. 4 Large Language Models generate responses in steps. In each step, one new token is added to the part of the response that has already been formed. To illustrate, here are the tokens that one AI considers to be likely to follow, “When I was walking …”.

token	weight	lyrics
in	2.8	2.1
around	1.4	-0.1
to	3.1	0.6
through	0.7	1.0
down	-1.2	1.8
home	1.9	2.0
back	-0.3	-1.1
I	1.7	0.8
by	-1.1	1.6
along	-2.6	1.4

Each word is assigned a weight. There are typically around 50,000 words to choose from. The above shows just a handful of the more likely ones.

The prompt “What are the most likely AI tokens that might follow ‘When I was walking’” generated the weights in the middle column. But for the “lyrics” column, the prompt was preceeded by “Taking into account the lyrics of popular music, ….”

The numbers in the “weight” and “lyrics” columns are not relative probabilities. Which of the following explanations is most salient for this conclusion?

fck-1-ue2

fck-2-3kd

True or False

Relative probabilities do not need to add up to 1.
fck-3-7dw

True or False

Relative probabilities do not need to be non-negative.
fck-4-3ks

True or False

A relative probability can never be zero.

In fact, the weights generated by the AI are not relative probabilities. They are ordinary numbers which might be large or small, positive or negative. In the context of indicating a relative probability, the AI weights are called “logits.” Converting a logit to a relative probability is simple: use the storybook function double(). The following chunk reads the weights and words into an R data frame, then does the conversion to relative and absolute probabilities.

Which three words are much more likely to occur following “When I was walking” in the context of lyrics compared to general text?