this post was submitted on 06 Jul 2023
16 points (100.0% liked)

Stable Diffusion

4140 readers
1 users here now

Discuss matters related to our favourite AI Art generation technology

Also see

Other communities

founded 1 year ago
MODERATORS
 

Hi there, I'm trying to get (more or less) historically accurate images from the early and high middle ages, but none of the models seems to have a grasp of what "maille armor" or "bucket helmets" are and I either get complete garbage or fantasy armor that mostly resembles the early modern period (the stereotypical shining knight armor).

I assume a Lora, trained on images of the armor and weapons I'd like to include, could fix this problem. I found some neat tutorials for making Lora's and think I'll give it a shot.

Do any of you have experience in making these kinds of style Lora's? What should I take care of? I will be manually downloading images that fit my aestethic and manually tag them - how many images should I use? Any input here is highly appreciated.

you are viewing a single comment's thread
view the rest of the comments
[โ€“] polystruct 1 points 1 year ago (1 children)

Nice answer! Is there a number of concepts, to your knowledge/experience, where a LORA no longer works? For instance, if I want to make a model that understands all car brands and types, assuming that the base model doesn't of course, would a LORA still be sensible here?

Most LORAs I find have a more specialized narrow focus, and I don't know if I would just start with multiple LORAs dealing with individual concepts (a LORA for a "1931 Minerva 8 AL Rollston Convertible Sedan", a LORA for a "Maybach SW 42, 1939", etc.) or if I should try and generate one LORA to ~~rule~~ know them all...

[โ€“] [email protected] 1 points 1 year ago* (last edited 1 year ago)

It would definitely be unwieldy to make LORAs that were so obsessively granular, but it comes down to how specific you want the output to be. It's probably pretty easy to make a Porsche LORA that will spit out credible Porsche silhouettes, but if you want accurate and detailed 911 Turbo S models you have to train it just on those. More accurate means less flexible; you have to pick your poison.

If you want an omni-functional thing then you really want to make your own checkpoint... But LORA are still a potentially good way to approach that because you can merge LORAs into a model, and you can merge LORAs into each other. I have no idea how you do that, but the functionality is there.

My plan is (eventually) to do exactly that with my own niche area of interest, build a model but do it in discrete chunks. So start out with one specific LORA for one specific garment that I have a particular interest in, get that to work well, then make others, then when I have enough that it's tiresome to sort through start merging them together.

That said... I am not an expert, not even slightly.