this post was submitted on 28 May 2024
442 points (94.6% liked)

Technology

58738 readers
4034 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] BradleyUffner 10 points 4 months ago (2 children)

Not exactly. The answers would be exactly the same given the exact same inputs if they didn't intentionally and purposefully inject some random jitter into the algorithm each time specifically to avoid getting the same answer each time

[–] [email protected] 10 points 4 months ago (1 children)

It’s not just random jitter, it also likely adds context, including the device you’re using, other recent queries, and your relative location (like what state you’re in).

I don’t work for Google, but I am somewhat close to a major AI product, and it’s pretty much the industry standard to give some contextual info to the model in addition to your query. It’s also generally not “one model”, but a set of models run in sequence— with the LLM (think chatGPT) only employed at the end to generate a paragraph from a conclusion and evidence found by a previous model.

[–] BradleyUffner 4 points 4 months ago

I consider "context", even if not added explicitly by the user, to be part of the input.

[–] [email protected] 1 points 4 months ago (1 children)

That jitter is automatically present because different people will get different search results, so it's not really intentional or purposeful

[–] [email protected] 4 points 4 months ago (1 children)

Yes it is intentional.

Some interferences even expose a way to set the "temperature" - higher values of that mean more randomized (feels creative) output, lower values mean less randomness. A temperature of 0 will make the model deterministic.

[–] [email protected] 1 points 4 months ago (1 children)

even at 0 temperature the model will not be deterministic, because it depends on the seed used as well as things like numerical noise.

[–] [email protected] 1 points 4 months ago* (last edited 4 months ago) (1 children)

Yeah no, that's not how this works.

Where in the process does that seed play a role and what do you even mean with numerical noise?

Edit: I feel like I should add that I am very interested in learning more. If you can provide me with any sources to show that GPTs are inherently random I am happy to eat my own hat.

[–] [email protected] 1 points 4 months ago (2 children)
[–] [email protected] 1 points 4 months ago

I appreciate the constructive comment.

Unfortunately the API docs are incomplete (insert obi wan meme here). The seed value is both optional and irrelevant when setting the temperature to 0. I just tested it.

[–] [email protected] 1 points 4 months ago

Addendum:

The docs say

For reproducible outputs, set temperature to 0 and seed to a number:

But what they should say is

For reproducible outputs, set temperature to 0 or seed to a number:

Easy mistake to make