this post was submitted on 12 Jan 2025
1157 points (98.1% liked)

memes

10930 readers
3471 users here now

Community rules

1. Be civilNo trolling, bigotry or other insulting / annoying behaviour

2. No politicsThis is non-politics community. For political memes please go to [email protected]

3. No recent repostsCheck for reposts when posting a meme, you can only repost after 1 month

4. No botsNo bots without the express approval of the mods or the admins

5. No Spam/AdsNo advertisements or spam. This is an instance rule and the only way to live.

Sister communities

founded 2 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 14 points 3 days ago* (last edited 3 days ago) (1 children)

Sure it does, tool use is huge for actually using this tech to be useful for humans. Which openai and Google seem to have little interest in

Most of the core latest generation models have been focused on this, you can tell them what they have access to and how to use it, the one I have running at home (running on my too old for windows 11 mid-range gaming computer) can search the Web, ingest data into a vector database, and I'm working on a multi-turn system so they can handle more complex tasks with a mix of code and layers of llm evaluation. There's projects out there that give them control of a system or build entire apps on the spot

You can give them direct access to the terminal if you want to... It's very easy, but they're probably just going to trash the system without detailed external guidance

[–] [email protected] 3 points 3 days ago (1 children)
[–] [email protected] 5 points 3 days ago* (last edited 3 days ago)

essentially rather than generating a reply meant for a human, they generate a special reply that the software interprets as “call this tool”. in th same way as the system prompt where the model operator tells the system how to behave, you tell the model what tools and parameters are available to it (for example, load page is a common one)… when the software receives a call for the tool, it calls real code to perform an action, which then responds to the model with the result so that it can continue to process. in this way, the model may kind of request access to limited external resources