Link to the generator : https://perchance.org/fusion-ai-image-generator
What is it?
A T2I generator to which I have added prompt switching + various other modes on both the prompt and the negatives.
Note the the public image gallery for this generator is NSFW
I have added sampling that can be enabled/disabled or appended to an existing prompt for every new generation.
I have yet to make the HTML for the generator which is why I am using the t2i-framework plugin for now.
The interface is confusing right now, but the generator is configured so that a user does not have to change any settings (apart from maybe the dataset used) to get things to work.
The big datasets includes the 47K CLIP tokens , 63K unique search words for the NSFW dataset (general category) , and randomized selection of 30K unique tags from danbooru sites from the sd-webui-tagcomplete extension.
Sampling
As in the normal image generator , you can press the dice π² to get a number of tokens from the selected dataset.
As an additional feature , sampling of tokens can be done automatically by selecting the π,βοΈ modes at the bottom
CLIP token dataset
Sampling includes 47K CLIP tokens ordered by rarity (frequency by which these words appeared in the training data) and if they are a prefix-token or a suffix-token.
Suffix tokens have a trailing symbol and are the "normal" tokens, e.g "car" , "banana" , "happy".
Prefix tokens lack the the trailing and will give new properties to existing tokens when placed next to them.
If you wish to invoke a single prefix token in the prompt you can write #prefix# , e.g "photo of a #prefix#banana" to see how this works. Tokens from other datasets can be invoked in a similiar manner.
The #prandom# sampling is done by pairing together a random prefix-token with a random suffix-token. This is useful for randomizing stuff.
β³οΈ Prepositions
Prepositions are stuff a user writes between things in a sentence.
Take the prompt "A photo of a cat on a table" and it becomes "photo cat table" without prepositions.
SD 1.5 was trained on interpreting stuff written as sentences , so adding some prepositions can sometimes be good.
Other times (for exotic tokens or for the #artnsfw# dataset) , the prepositions can make the image more boring.
You can set the % rate for which a preposition will be placed between sampled tokens.
Getting started
For new users: just write something in the Main π field and hit generate.
Should things get too confusing/overwhelming you can open the generator in incognito-mode to get the default settings.
//----//
That's the TLDR of this generator that I made.
If you can ignore the confusing interface , you can have a lot of fun with it.
It works really well on mobile as you don't have to type anything. You can just hit "generate" and it will randomize automatically.
Current status / help
Currently I'm trying to get to grips with the tagcomplete #artnsfw# dataset , and the other datasets.
There is a problem where the generator will sometimes reload when on mobile.
I need also some more options for the β¨Base prompts.
Additionally , I'm thinking about changing the image gallery at some point.
Preferably to an option where users don't have to dump everything (NSFW and non-NSFW) into the "Public" folder in order to be visible, e.g have buttons that a user can click that will change the shown gallery.
![](https://lemmy.world/pictrs/image/804e627a-d677-4393-9c6a-cee8f1005aa2.jpeg)