DrakeRichards

joined 1 year ago
[–] DrakeRichards 12 points 1 year ago (1 children)

I can wash clothes just fine; all I have to do is gather them up and throw them in the washer. It’s folding that’s the big problem. We’re basically living out of laundry baskets at this point.

[–] DrakeRichards 1 points 1 year ago* (last edited 1 year ago) (1 children)

I have a local installation of Vladmandic's fork of the Automatic1111 web UI, so these steps are specific to Stable Diffusion. They will also work fine if you have a good colab.

1. Find a good base image

  1. Pick the right model. I use A-Zovya RPG Artist Tools mostly, but there are many other models that are great for more specific styles.
  2. Start with a simple prompt that includes general details about your subject. Don't try to go down the Midjourney-style rabbit hole of crafting the perfect prompt; quantity is far more important than quality with current AI models.
  3. Use txt2img to generate an image that will serve as a good base. You're looking for something that has the right colors, silhouette, and style. Don't worry about the fine details like fingers and faces: you'll clean those up later.
  4. Generate just a few images with your initial prompt to see what sort of results you get: batches of 5-10 should be enough to tell you if you're using the right tokens. If you see something frequently popping up that you don't want, add it to the negative prompt. Change your prompts around until your results start consistently including the details you're looking for.
  5. If you find an image that you like the style of but not the details, you can use that image as an input for ControlNet's Reference model in txt2img.

Here's the image I chose as my base for the tiefling. Generation parameters:

Female tiefling, sorcerer, librarian, goat horns, watercolor, masterpiece, best quality

Negative prompt: bad_prompt_version2, nude, nsfw, explicit, penis, nipples, sex, suggestive, lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, username, blurry

Steps: 10, Sampler: UniPC, CFG scale: 7, Seed: 3178845724, Size: 512x512, Model hash: da5224a242, Model: aZovyaRPGArtistTools_v2, Version: 1dffd11

2. Inpaint

  1. Inpaint any details you don't like. I use the openOutpaint extension since it's much easier to pick out details.
  2. Make sure you use an inpainting model and don't forget to load a VAE! Some checkpoints have a VAE baked in to their base model but not their inpainting model. This will make your inpainting attempts look grey and generally awful. What VAE you choose honestly doesn't matter much: I use Grapefruit's since it makes nice vibrant colors, but if I'm getting faces that look too much like anime then I'll switch to the base 1.5 VAE.
  3. Keep inpainting until you have all the features you want (right number of fingers, right clothes, etc.). Don't worry about the really fine details like eyes and fingernails yet. This will be the longest step; take your time!
  4. OpenOutpaint also lets you outpaint, so do that now if you want.
  5. Once you've got everything right, upscale it 2-4x. Try out all the different upscaling models to see which one works best for this specific image.
  6. Start inpainting again. Your image is now much larger, and you should only inpaint in 512x512 sections, so you'll also need to alter your prompt specifically for what you are inpainting at that time.
  7. Just keep inpainting and upscaling as needed. There's no one way to do things from here; just tinker until you're happy with the results.

Here's most of the images that I generated for the tiefling. This took me about 12 hours and almost 1000 images.

[–] DrakeRichards 1 points 1 year ago

These were all made with A-Zovya RPG Artist Tools v3. Tieflings are really tricky; I've found the most success by specifying what I wanted the horns to look like. For the tiefling in this post I used tiefling and goat horns in the prompt. I've also used the inpaint sketch tool to help define the rough shape, but it usually wasn't too effective.

[–] DrakeRichards 10 points 1 year ago (1 children)

They just want them to pay hundreds of thousands to millions of dollars to do so.

This is the hilarious part to me: some companies might pay these fees, but there will be many more who won’t and will instead use actual web scrapers to get their data anyways. As the number of individuals training LLM models increases in the next couple of years, this will create a much more significant traffic load compared to API calls.

[–] DrakeRichards 25 points 1 year ago (3 children)

I would assume that Lemmy is not very accessible yet, but Lemmy’s mobile apps are under a month old. They are making fast progress and I would expect that to change very soon.

However, Reddit’s app has been out for years and they have been told about its accessibility problems for just as long. The impression I get is that they didn’t prioritize accessibility since third-party apps handled that for them. When they cut off access to these apps, they made it very clear that they have no alternatives in mind; they consider the visually-impaired userbase to be insignificant and simply don’t care about their issues.

[–] DrakeRichards 1 points 1 year ago

This article from April 2022 has some more details. It sounds like Microsoft is planning to implement several different options depending on your use case. This will probably be most commonly used by businesses, not consumers.

[–] DrakeRichards 3 points 1 year ago

$5000 would let me pay off all of our credit card debt and a few medical bills. That would free up about $400 a month just from minimum payments and take off a massive stress load from worrying about all this interest. It would take me around 200 hours to save that up at my current rate.

Instead I feel trapped with what is a comparatively small amount of debt that I’ll be paying off for years if I don’t accrue any other debts.

[–] DrakeRichards 2 points 1 year ago (2 children)

It’s highly unlike that this would replace Windows entirely; it would probably be more like a thin client used to access applications requiring high computing power. The preview they show in the article is like an extra desktop. Think of it like this: you’ve got your normal Windows OS sitting on your physical hardware, but you can also connect to “your” instance of Windows Cloud to do heavy rendering or fire up an AI model.

[–] DrakeRichards 10 points 1 year ago

Apparently not:

The move comes not long after an increase in price for D&D books across the board. Physical copies run $59.95 normally, and the digital + physical bundles are $10 more, putting them around the same price as most newly released video games. The digital + physical bundles work exclusively with D&D Beyond. Meaning other platforms like Roll20 and Fantasy Grounds have their own separate content they do not sell bundled together.

[–] DrakeRichards 3 points 1 year ago (1 children)

That looks stunning! I was trying to figure out if it unfolded or something until I realized it’s an open design; that’s really clever. Do the dice roll out if it at all?

[–] DrakeRichards 3 points 1 year ago (1 children)

Nice! Thank you Biden for giving red-blooded Americans the freedom to choose what they do with their bodies!

[–] DrakeRichards 10 points 1 year ago (1 children)

"One of the big challenges is to find a new way to address what is currently a trial-and-error process so that more people can get better sooner," Williams said. "Bringing in these objective cognitive measures like imaging will make sure we're not using the same treatment on every patient."

It would be incredible if researchers found a way to overcome this hurdle. Going through a long process to find what works is a massive pain that can involve lots of negative side-effects.

view more: ‹ prev next ›