this post was submitted on 03 Jun 2024
1475 points (98.0% liked)
People Twitter
5225 readers
2438 users here now
People tweeting stuff. We allow tweets from anyone.
RULES:
- Mark NSFW content.
- No doxxing people.
- Must be a tweet or similar
- No bullying or international politcs
- Be excellent to each other.
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
LLMs are not a good tool for processing data like this. They would be good for presenting that data though.
Make an LLM convert the data into a standardized format for your traditional algorithm.
There's no way to ensure that data will stay in that standardized format though. A custom model could but they are expensive to train.
Llms are excellent at consuming web data.
Not if you want to ensure the validity of the compiled coupons/discounts. A custom algorithm would be best but data standardization would be the main issue, regardless of how you process it.
What does validity mean in this case? A functionary LLM can follow links and make actions. I'm not saying it's not "work" to develop your personal bot framework, but this is all doable from the home PC, with a self hosted llm
Edit and of course you'll need non LLM code to handle parts of the processing, not discounting that
The LLM doesn't do that though, that the software built around it that does that which is what I'm saying. Its definitely possible to do, but the bulk of the work wouldn't be the task of the LLM.
Edit: forgot to address validity. By that I mean keeping a standard format and ensuring that the output is actually true given the input. Its not impossible, but its something that requires careful data duration and a really good system prompt.
Llms are great for scraping data
LLMs don't scrape data, scrapers scrape data. LLMs predict text.
https://youtu.be/fjP328HN-eY?si=quZeZx57fDjBW5EW
Puppeteer and gpt-vision are decidedly not LLMs
👍