memes

10930 readers

3471 users here now

Community rules

1. Be civil

No trolling, bigotry or other insulting / annoying behaviour

2. No politics

This is non-politics community. For political memes please go to [email protected]

3. No recent reposts

Check for reposts when posting a meme, you can only repost after 1 month

4. No bots

No bots without the express approval of the mods or the admins

5. No Spam/Ads

No advertisements or spam. This is an instance rule and the only way to live.

Sister communities

[email protected] : Star Trek memes, chat and shitposts
[email protected] : Lemmy Shitposts, anything and everything goes.
[email protected] : Linux themed memes
[email protected] : for those who love comic stories.

founded 2 years ago

MODERATORS

Tenthrow

The_Picard_Maneuver

[email protected]

1157

I know just the audience for this (lemmy.world)

submitted 3 days ago by General_Effort to c/memes

112 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 14 points 3 days ago* (last edited 3 days ago) (1 children)

Sure it does, tool use is huge for actually using this tech to be useful for humans. Which openai and Google seem to have little interest in

Most of the core latest generation models have been focused on this, you can tell them what they have access to and how to use it, the one I have running at home (running on my too old for windows 11 mid-range gaming computer) can search the Web, ingest data into a vector database, and I'm working on a multi-turn system so they can handle more complex tasks with a mix of code and layers of llm evaluation. There's projects out there that give them control of a system or build entire apps on the spot

You can give them direct access to the terminal if you want to... It's very easy, but they're probably just going to trash the system without detailed external guidance

[–] [email protected] 3 points 3 days ago (1 children)

Tell moarz ?

[–] [email protected] 5 points 3 days ago* (last edited 3 days ago)

essentially rather than generating a reply meant for a human, they generate a special reply that the software interprets as “call this tool”. in th same way as the system prompt where the model operator tells the system how to behave, you tell the model what tools and parameters are available to it (for example, load page is a common one)… when the software receives a call for the tool, it calls real code to perform an action, which then responds to the model with the result so that it can continue to process. in this way, the model may kind of request access to limited external resources