this post was submitted on 21 Nov 2023
995 points (97.9% liked)

Technology

59594 readers
2961 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Over half of all tech industry workers view AI as overrated::undefined

you are viewing a single comment's thread
view the rest of the comments
[–] stingpie 2 points 1 year ago* (last edited 1 year ago) (1 children)

Let's play a little game, then. We bothe give each other descriptions of the projects we made, and we try to make the project based on what we can get out of ChatGPT? We send each other the chat log after a week or something. I'll start: the hierarchical multiscale LSTM is a stacked LSTM where the layer below returns a boundary state which will cause the layer above it to update, if it's true. the final layer is another LSTM that takes the hidden state from every layer, and returns a final hidden state as an embedding of the whole input sequence.

I can't do this myself, because that would break OpenAI's terms of service, but if you make a model that won't develop I to anything, that's fine. Now, what does your framework do?

Here's the paper I referenced while implementing it: https://arxiv.org/abs/1807.03595