Technology

63928 readers

5787 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

347

WormGPT Is a ChatGPT Alternative With 'No Ethical Boundaries or Limitations' (www.pcmag.com)

submitted 2 years ago by L4s to c/technology

88 comments fedilink hide all child comments

WormGPT Is a ChatGPT Alternative With 'No Ethical Boundaries or Limitations'::undefined

you are viewing a single comment's thread
view the rest of the comments

[+] dorkian_gray 9 points 2 years ago* (last edited 2 years ago) (1 children)

[deleted]

[–] [email protected] 2 points 2 years ago (2 children)

the thing is, each ai is usually trained from scratch. There isn't any easy way to reuse the old weights. So the primary training has been done.... for the existing models. Future models are not affected by how current ones were trained. They will either have to figure out how to keep ai content out of their datasets, or they would have to stick to current "untainted" datasets.

[–] [email protected] 8 points 2 years ago

there isn't any easy way to reuse old weights

There is! As long as the model structure doesn't change, you can reuse the old weights and finetune the model for your desired task. You can also train smaller models based on larger models in a process called "knowledge distillation". But you're right: Newer, larger models need to be trained from scratch (as of right now)

But even then it's not really a problem to keep ai data out of a dataset. As you said: You can just take an earlier version of the data. As someone else suggested you can also add new data that is being curated by humans. If inbreeding actually ever happens remains to be seen ofc. There will be a point in time where we won't train machines to be like humans anymore, but rather to be whatever is most helpful to a human. And if that incorporates training on other AI data, well then that's that. Stanford's Alpaca already showed how ressource effective it can be to fine-tune on other AI data.

The future is uncertain but I don't think that AI models will just collapse like that

tl;dr beep boop