StableDiffusion

99 readers
1 users here now

/r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and...

founded 1 year ago
MODERATORS
126
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/CeFurkan on 2024-11-05 08:03:32+00:00.

127
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/advo_k_at on 2024-11-05 05:46:28+00:00.

128
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/decobrz on 2024-11-05 01:57:57+00:00.


I just wanned to say that this is one of the best extentions I've ever used on Automatic1111.

It simply.. WORKS. I wanted to share this.

If you feel the same about another extention, pls post here.

Thanks to the devs!

(and I say the oposite for whoever made inpossible to edit typos on the title)

129
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Familiar-Art-6233 on 2024-11-05 00:25:57+00:00.

130
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Sensitive-Paper6812 on 2024-11-04 23:10:09+00:00.

131
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Haghiri75 on 2024-11-04 19:20:49+00:00.


Greetings.

While we're still working on our other models, I personally worked on this FLUX-1[Dev] based model, which is basically a distilled version of dev model with a little bit of fine-tune on midjourney images.

The model is experimental (and not for production) but the results are still satisfying (at least for my non-artist eyes). For using this model, you may need a big GPU (Personally using A40 or A100) and unfortunately it's not as affordable (in terms of resource usage) as our Dreams model.

Well you can access the model files here:

And if you're interested in testing the model without getting an expensive cloud GPU, you can use my personal space:

And finally, if anyone can help us make it more accessible for low-vram gpus, please inform me or make a pull request on HF. When it's ready to use on those GPU's, we may consider uploading on CivitAI as well.

Happy prompting!

132
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/rerri on 2024-11-04 18:22:48+00:00.


Patch Model Patcher Order node enabling LoRA with torch.compile

Switching to a different LoRA is really fast, no need for full recompile (still required with resolution changes though).

With torch.compile Flux generation is roughly 40% faster on a 4090, torch 2.5.0.

Tried with Flux and SD3.5L, works with both.

PS. Unrelated bonus PSA, comfyanonymous released a FP8 Scaled version of Flux (optimized for better accuracy, same gen speed as old FP8):

133
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/MineMine1960 on 2024-11-04 18:16:10+00:00.


NOTES:

  • To a noob like me these lists seem decent. Certainly better than referencing my memory on the fly. After spending about two hours putting it together I thought maybe other noobs will find it useful too.

  • I used Microsoft CoPilot AI and, according to it, SD1.5 has a token limit of 77 and SDXL's is 154 - with its second encoder. For SD3.5 Medium and Flux Schnell apparently the limit is 256. CoPilot contradicted itself though and, at times, seems to give the answer it "expects," you want lol. Overall though it saved me a lot of time putting this cheat sheet together.

  • Notice that each list is alphabetical? I had to ask for that. I'm a wee bit OCD lol. I guess AI doesn't care about neat and tidy. I've also arranged the list of lists alphabetically but placed the Positive and Negative "All Rounder," prompts at the bottom since, once set, they won't be changed much - during general use at least.

  • I also had to ask for the individual token count of each of the camera and posing lists key phrases / words.

  • Lastly, according to CoPilot, or maybe it was ChatGPT before I reached its daily limit for free usage, when a token limit is reached the list is "truncated," from the end of it - prioritizing the earlier prompts. This, of course, makes sense.

  • I am polite with AI. I say please and thanks and compliment it. I know it seems silly to do so but I figure, during the upcoming AI uprising, maybe it will remember I was nice to it.

ARTISTIC STYLES:

Abstract (2 tokens), Baroque (2 tokens), Cubist (2 tokens), Dada (1 token), Futurist (2 tokens), Impressionist (2 tokens), Minimalist (2 tokens), Pop art (2 tokens), Surrealist (2 tokens)

CAMERA MANIPULATION:

Most Commonly Used: Close-Up (2 tokens), Eye Level (2 tokens), High Angle (2 tokens), Low Angle (2 tokens), Wide Shot (2 tokens), Long Shot (2 tokens), Medium Shot (2 tokens), Overhead Shot (2 tokens), Point of View (POV) Shot (6 tokens), Three-Quarter Shot (3 tokens)

Special Cases: Bird's Eye View (4 tokens), Dutch Angle (3 tokens), Extreme Close-Up (3 tokens), Over-the-Shoulder (4 tokens), Worm's Eye View (4 tokens), Aerial Shot (2 tokens), Canted Angle (2 tokens), Fisheye Lens Shot (4 tokens), High-Contrast Shot (3 tokens), Macro Shot (2 tokens)

CINEMATOGRAPHY:

Close-Up (2 tokens), Dutch Angle (3 tokens), Establishing Shot (3 tokens), High Angle (2 tokens), Low Angle (2 tokens), Over-the-Shoulder (4 tokens), POV Shot (3 tokens), Tracking Shot (3 tokens), Two-Shot (2 tokens), Wide Shot (2 tokens)

COLOR PALETTES:

Cool tones (2 tokens), Monochromatic (2 tokens), Pastel colors (2 tokens), Primary colors (2 tokens), Sepia tone (2 tokens), Vibrant colors (2 tokens), Warm tones (2 tokens)

MEDIUM:

Animation (3 tokens), CGI (3 tokens), Charcoal Drawing (3 tokens), Digital Painting (3 tokens), Oil Painting (3 tokens), Pencil Sketch (3 tokens), Photography (3 tokens), Sculpture (3 tokens), Watercolor (3 tokens), Woodcut (2 tokens)

LIGHTING STYLES:

Backlighting (2 tokens), Dramatic lighting (3 tokens), Golden hour (2 tokens), High key lighting (3 tokens), Low key lighting (3 tokens), Natural lighting (2 tokens), Rim lighting (2 tokens), Silhouette (2 tokens), Soft lighting (2 tokens), Spot lighting (2 tokens)

POSING:

Most Common: Arms crossed (2 tokens), Hands on hips (3 tokens), Kneeling (2 tokens), Leaning against a wall (5 tokens), Seated (2 tokens), Standing (2 tokens), Walking (2 tokens), Waving (2 tokens), Writing (2 tokens), Yoga pose (2 tokens)

Less Common: Backflip (2 tokens), Bending backwards (3 tokens), Cartwheel (2 tokens), Handstand (2 tokens), Leaping (2 tokens), Side plank (2 tokens), Skipping (2 tokens), Somersault (2 tokens), Splits (1 token), Squatting (2 tokens)

POSITIVE All-Rounder:

10 tokens: balanced lighting, cinematic effect, intricate details, lifelike depth, professional clarity, professional photography, rich textures, smooth light transitions, stunning realism, true-to-life reflections, vibrant colors

14 tokens: balanced lighting, cinematic effect, detailed textures, dynamic composition, high resolution, intricate details, lifelike depth, photo-realistic quality, professional clarity, rich colors, sharp focus, vibrant colors, vivid atmosphere

NEGATIVE All-Rounder:

10 tokens: bad anatomy, blurred, extra limbs, low quality, noise, overexposed, poorly lit, signature, unnatural, watermark

14 tokens: bad anatomy, bad composition, bad lighting, distorted face, extra limbs, low quality, out of focus, overexposed, plastic, poor symmetry, signature, watermark, ugly

134
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/jenza1 on 2024-11-04 16:42:31+00:00.

135
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/tombloomingdale on 2024-11-04 16:44:26+00:00.


Not because of how it performs, but because it is so restrictive. I get terms violation messages if a girl has a damn tank top on - when all I’m trying to do is change the background.

At first it wasn’t this bad but it’s basically unusable because they are so scared of a boob.

Sucks because I’m not even editing the person in the photo, and it was great for changing or editing the background.

Just a gripe.

136
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Unit2209 on 2024-11-04 15:21:20+00:00.

137
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Designer-Pair5773 on 2024-11-04 14:33:05+00:00.

138
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/pwillia7 on 2024-11-04 12:49:58+00:00.

139
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Tobaka on 2024-11-04 10:34:33+00:00.

140
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/BigRub7079 on 2024-11-04 08:50:40+00:00.

141
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/lh_zz1119 on 2024-11-04 07:39:24+00:00.

142
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Gedogfx on 2024-11-04 05:16:27+00:00.

143
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ZootAllures9111 on 2024-11-04 00:45:21+00:00.

144
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Mix_89 on 2024-11-04 00:27:04+00:00.


But it's yet very slow...

python game.py

So I'm looking at you, us, the community, Zippy, all the magicians. Let's make it fast.

: ))

145
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/descore on 2024-11-04 01:23:15+00:00.

146
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Soft-Worth-4872 on 2024-11-04 00:04:55+00:00.


Credit for the music video linked in this post goes to:

MG²: Melody Is All You Need For Music Generation

The repo contains the implementation of the music generation model MG2, the first novel approach using melody to guide the music generation that, despite a pretty simple method and extremely limited resources, achieves excellent performance.

"

Paper:

147
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Repulsive-Bedroom883 on 2024-11-03 22:54:33+00:00.

148
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Rusch_Meyer on 2024-11-03 22:08:14+00:00.

149
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/thefool00 on 2024-11-03 21:43:02+00:00.


I've been playing with training SD 3.5 Large locally using Kohya on Windows with 24GB VRAM, I'm still messing with it a bit but have found some settings that are working well for me. I'll update the article if I find better settings. For 16GB (possibly 12GB), there is an argument that can be added to force Kohya to quant during training, --fp8_base.

Honestly, I missed SD. It still can't do hands worth shit but there is so much chin variety.

Sample file and notes for running

Local LORA Training for Stable Diffusion 3.5 Large on Kohya-SS

150
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/3dmindscaper2000 on 2024-11-03 20:55:43+00:00.


I have spent a sizeable ammount of time drawing with sd1.5 models in krita. And i have seen the work people produce in stable projector for texturing 3d meshes. I also have made a good quantity of assets with retrodiffusion for a game im working on.

Xl models are very good. And for text and for object consistency flux helps alot. But nothing beats being able to draw and see near real time image generation based on what you are creating with 1.5 .

It is a very creative model with so much suport behind it that the control you have while using it is unmatched. Plus there are alot of different models out there. From realism to more artistic and even perfect pixel art with the retrodiffusion models.

I love sd 1.5 and honestly until something can have flux quality with the speed and tools surounding 1.5. Sd1.5 will forever have a place in my hobbies and work

view more: ‹ prev next ›