this post was submitted on 11 Nov 2023

14 points (81.8% liked)

Ask Science

9230 readers

75 users here now

Ask a science question, get a science answer.

Community Rules

Rule 1: Be respectful and inclusive.

Treat others with respect, and maintain a positive atmosphere.

Rule 2: No harassment, hate speech, bigotry, or trolling.

Avoid any form of harassment, hate speech, bigotry, or offensive behavior.

Rule 3: Engage in constructive discussions.

Contribute to meaningful and constructive discussions that enhance scientific understanding.

Rule 4: No AI-generated answers.

Strictly prohibit the use of AI-generated answers. Providing answers generated by AI systems is not allowed and may result in a ban.

Rule 5: Follow guidelines and moderators' instructions.

Adhere to community guidelines and comply with instructions given by moderators.

Rule 6: Use appropriate language and tone.

Communicate using suitable language and maintain a professional and respectful tone.

Rule 7: Report violations.

Report any violations of the community rules to the moderators for appropriate action.

Rule 8: Foster a continuous learning environment.

Encourage a continuous learning environment where members can share knowledge and engage in scientific discussions.

Rule 9: Source required for answers.

Provide credible sources for answers. Failure to include a source may result in the removal of the answer to ensure information reliability.

By adhering to these rules, we create a welcoming and informative environment where science-related questions receive accurate and credible answers. Thank you for your cooperation in making the Ask Science community a valuable resource for scientific knowledge.

We retain the discretion to modify the rules as we deem necessary.

founded 2 years ago

MODERATORS

If GPTs only predict the next word how do they decide between "a" and "an". Wouldn't this have massive effect on their abilities? (programming.dev)

submitted 1 year ago by [email protected] to c/askscience

8 comments fedilink hide all child comments

top 8 comments

sorted by: hot top controversial new old

[–] [email protected] 34 points 1 year ago

I think when people say it's only predicting the next word, it's a bit of an oversimplification to explain that the AI is not actually intelligent. It's more or less stringing words together in a way that seems plausible.

[–] [email protected] 16 points 1 year ago

They're very good at predicting the next word, so their choice of "a" or "an" is likely to make sense in context. But you can absolutely ask a GPT to continue a sentence that appears to use the wrong word.

For instance, I just tried giving a GPT this to start with:

My favorite fruit grows on trees, is red, and can be made into pies. It is a

And the GPT finished it with:

delicious and versatile fruit called apples!

So as you can see, language is malleable enough to make sense of most inputs. Though occasionally, a GPT will get caught up in a nonsensical phrase due to this behavior.

[–] GarrettBird 11 points 1 year ago* (last edited 1 year ago) (1 children)

To be overly simple about it, the LLM uses statistics and a bit of controlled RNG to pick its words. Words in the LLM have links to each other with statistical probabilities attached. If you take the sentence "I fed a peanut to an elephant" and "I fed a peanut to a elephant" and then asked 100 people which is more correct, there will be a percentage which favors one over the other. Now with LLMs its not choosing using weighted coin flips, but rather picking the most likely next word (most of the time). So if the 100 people choose "an elephant" over "a elephant" 65% of the time in its training data, then the LLM will be inclined to use "an elephant." However, Its important to know that the words around "an elephant" will also bias its choice to use the word 'an' for the word 'elephant'.

Really, its largely based on the training data and the contexts to which 'a' and 'an' are used. Or in other words, the LLM knows because people figured it out for the LLM. People did all the thinking, LLM's just use statistics on our bottled phrases to know when to use which. Of course, because it got its data from people - it will sometimes get it wrong which is based on how often people got it wrong.

[–] [email protected] 1 points 1 year ago (1 children)

I want to know what it does for words that can be either a or an like herb.

[–] GarrettBird 2 points 1 year ago

Well, my example of the word 'elephant' has the same property as 'herb' where the use of 'a' or 'an' can depend on who you ask. I chose my example trying to anticipate this exact question, and I believe I gave you an answer.

Let me put it this way: it depends... It depends on the data the LLM (Chat GPT for example) has been given to train its output. If we have an LLM dataset which uses only text by people in the United Kingdom, then the data will favor "a herb" as the 'h' is pronounced, where data from the United States will favor the other way as the 'h' is usually silent when spoken out loud.

As a fairly general rule, people use the article “an” before a vowel sound (like a silent “h”) and “a” before a consonant sound (like a pronounced, or aspirated, “h”). Usually the data gathered is from multiple English speaking countries, so both "an herb" and "a herb" will exist in the training data, and from there the LLM will favor picking the one that is shown more often (as the data will biased.)

Just for fun, I asked the LLM running on my local machine. Prompt: "Fill in the blank: "It is _ herb" Response: "It is an herb."

[–] [email protected] 6 points 1 year ago

If it generates "I ate" and the next word can be "a" or "an", then it will just generate one or the other based on how often they appear after "I ate". It hasn't decided by this point what it has eaten. After it has generated the next token, for example "I ate an", then its next token is now limited to food items that fit the grammatical structure of this sentence so far. Now it can decide: did I eat an apple? An orange? An awesome steak? etc

[–] felixwhynot 2 points 1 year ago

I thought they’re not so much choosing words as chunks of words. So it would be “a” but then “-n other” or whatever. Maybe?

[–] [email protected] 1 points 1 year ago

GPT creates plausible looking sentences, it has no concept of truth or anything like that. Since if you have an "an" it's overwhelmingly likely that the next word will begin with a vowel it will choose one which plausibly fits with the corpus of text that came before. Likewise for an "a".

There is no compromise in ability. It doesn't have anything to "say" or whatever. What it produces is more like nonsense poetry than speech.