overview for gjhiggins

18

Sam Bleckley's insighful explanation of why "Don't Fire Your Illustrator" (sambleckley.com)

submitted 11 months ago by [email protected] to c/[email protected]

1 comments fedilink

IMO, a very sane and well-informed explanation of the challenges of using generative ai to create specific imagery.

Decoding intermediate activations in llama-2-7b — LessWrong in c/[email protected]

[–] [email protected] 3 points 11 months ago

Informative, even to relatively ignorant me.

Any suggestions for this community? in c/[email protected]

[–] [email protected] 2 points 11 months ago

No, I haven't and I don't intend to because I wouldn't get anything out if the exercise. I don't (yet?) have a deep enough model to inform comparisons with anything other than different parameter sizes of the same pre-trained models of the Meta LLAMA foundation model. What I posted was basically the results of a proof-of-method. Now that I have some confidence that the responses aren't simply random, I guess the next step would be to run the method over the 7B/13B/30B models for i) vicuna and ii) wizard-vicuna which, AFAICT are the only pre-trained models that have been published with all three 7, 13 and 30 sizes.

It's not possible to get the foundation model to respond to OCEAN tests but on such a large and disparate training set, a broad “neural” on everything would be expected, just from the stats. In consequence, the results I posted are likely to be artefacts arising from the pre-training - it's plausible (to me) that the relatively-elevated Agreeableness and Conscientiousness are elevated as a result of explicit training and I can see how Neuroticism, Extroversion and Openness might not be similary affected.

In terms of the comparison between model parameter sizes, I have yet to run those tests and will report back when I have done.

What is wrong with LLM benchmarks, and why are we still using them? in c/[email protected]

[–] [email protected] 1 points 11 months ago

There is something wrong with these benchmarks. They do not relate to real-world performance.

That's true and generally readily admitted but apparently just ignored by many. I suspect your experience of the difference in performance between GPT-3 and much smaller models is an inevitable consequence of the fewer parameters. AUIU, the perplexity scale f'rinstance is non-linear, a fact which adds a lot of punch to this graph of quantization/model size. This is merely speculation on my part, I wish I had the kit to run > 30B models locally, sigh.

Any suggestions for this community? in c/[email protected]

[–] [email protected] 2 points 11 months ago

I edited the post to include a link to the discord image. If there's interest I can make a post with more details (I used Python's pexpect to communicate with a spawned llama.cpp process).

Any suggestions for this community? in c/[email protected]

[–] [email protected] 4 points 11 months ago* (last edited 11 months ago) (4 children)

was thinking of doing a daily quantized models post just for keeping up with the bloke

Wouldn't go amiss.

The best way to grow a community is to share the highest quality information possible. The reason I actually stopped being a lurker is because another Lemmy user told me this.

Okay, point taken. I've been guilty of lurking inappropriately and I can model the consequences of that.

I have a reasonable amount of direct experience of purposeful llama.cpp use with 7B/13B/30B models to offer. And there's a context - I'm exploring its potential role in the generation of content, supporting a sci-fi web comic project - hardly groundbreaking, I know but I'm hoping it'll help me create something outside the box.

For additional context, I'm a cognitive psychologist by discipline and a cognitive scientist by profession (now retired) and worked in classic AI back in the day.

Over on TheBloke's discord server, I've been exposing the results of a small variety of pre-trained LLM models' responses to the 50 questions of the OCEAN personality questionnaire, presented 25 times to each - just curious to see whether there was any kind of a reliable pattern emerging from the pre-training:

OCEAN questionnaire

OCEAN questionnaire full-size jpeg

Looks like the larger models enable a wider range of responses, I guess that's an expected consequence of a smoother manifold.

Happy to answer any questions that people may have and will be posting more in future.

Cheers, Graham