StableDiffusion

99 readers
1 users here now

/r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and...

founded 1 year ago
MODERATORS
51
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/crystal_alpine on 2024-11-05 16:48:00+00:00.

52
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/afinalsin on 2024-11-05 13:11:37+00:00.


Hello /r/stablediffusion, I'm risking a "longer ban" by posting this resource again since the mods clapped my ass with a three day the last time I posted it, so get it while it lasts.

If you've seen any of my other prompt comparisons, or this very same one that got me yeeted last week, you know what this is. These new images can't be directly compared to the old ones because of the sampler/scheduler change with this new generation of models, but the seed is the same.

Instead of multiple prompts over one big image, each prompt is its own image, with the prompt contained on the image itself. I have censored everything I thought might toe the line, I don't want mommy and daddy to punish me again. Here are the galleries:

Prompt 1-20

Prompt 21-40 | Beware *CENSORED* prompt 34 prompt 40

Prompt 41-60 | Beware *CENSORED* prompt 55 prompt 58

Prompt 61-80 | Beware *CENSORED* prompt 65 prompt 67 prompt 69 prompt 80

Prompt 81-100 | Beware *CENSORED* prompt 84 prompt 98 prompt 100

Prompt 101-120 | Beware *CENSORED* prompt 111

Prompt 121-140

Prompt 141-160 | Beware *CENSORED* prompt 141

Prompt 161-170

An easy way to quickly see the full quality image on civit is right click the image and click "open image in new tab". From there, delete /width=700,original=false from the url, which forces it to load the full quality image.

Settings and stuff in the comments.

53
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/uisato on 2024-11-05 11:23:43+00:00.

54
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/AI-freshboy on 2024-11-05 09:37:06+00:00.


check out .

55
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/CeFurkan on 2024-11-05 13:55:29+00:00.

56
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/OkSpot3819 on 2024-11-05 11:58:46+00:00.


Major Stories

AI Models Enter Fashion Industry: Fashion brands like Mango are implementing AI-generated models, saving millions while raising questions about the future of human modeling. AI services cost $29/month vs $35/hour for human models.

Open Source Initiative Defines 'Open-Source' AI: OSI sparks debate by establishing strict criteria for what constitutes "open-source" AI, challenging tech giants like Meta over transparency in training data and methodologies.

All New Tools & Updates

  • Detail-Daemon: ComfyUI plugin for powerful detail enhancement. Features sigma parameter adjustment, compatible with SDXL and SD1.5 models, optimized for Flux outputs.
  • PixelWave: Community-created Flux model fine-tune offering enhanced aesthetics. 6.7GB GGUF format, trained for 5 weeks on RTX 4090, noted for less "plastic-looking" results.
  • ComfyUI Image Filters: Comprehensive filter collection with 100x faster blur operations, guided filters, color matching, and new BetterFilmGrain node.
  • ComfyUI-MochiEdit: Video editing nodes for Genmo Mochi, featuring unsampling and sampling nodes with adjustable guidance parameters.
  • Oasis: Real-time AI-generated game demonstration with 500M parameter open-source model, currently running on cloud infrastructure.
  • Blendbox Alpha: Layer-based AI image generation tool with real-time adjustments for lighting, texture, and composition. Currently in internal testing.
  • Suno Personas: New feature for capturing and replicating specific musical styles and vocal characteristics. Premium feature with first 200 songs free.
  • SD 3.5 Upscaling Technique: New workflow combining SD 3.5 Large and Medium models with Skip Layer Guidance for enhanced upscaling and detail retention.
  • ElevenLabs X-to-Voice: Open-source tool converting Twitter profiles to AI voices and avatars in about one minute, deployable on Vercel platform.
  • BigASP v2: Large-scale SDXL fine-tune trained on 6.7M images, featuring custom quality rating system and improved score tag system.
  • InvokeAI 5.3: Latest update featuring AI-powered object selection tool based on Meta's SAM, Flux support, and pressure sensitivity tablet support.
  • SD 3.5 Medium: Stability AI's 2.6B parameter model requiring 9.9GB VRAM, supporting up to 1440x1440 resolution, 4x faster than SD 3.5 Large.
  • Two-Character Flux Generation: Method for creating consistent AI-generated images of two distinct characters using Flux AI and LoRA, with complete training dataset available.

📰 Full newsletter with relevant links, context, and visuals available in the original document.

🔔 If you're having a hard time keeping up in this domain - consider subscribing. We send out our newsletter every Sunday.

57
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ZooterTheWooter on 2024-11-05 03:47:05+00:00.

58
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ExpressWarthog8505 on 2024-11-05 09:14:34+00:00.

59
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/CeFurkan on 2024-11-05 08:03:32+00:00.

60
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/advo_k_at on 2024-11-05 05:46:28+00:00.

61
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/decobrz on 2024-11-05 01:57:57+00:00.


I just wanned to say that this is one of the best extentions I've ever used on Automatic1111.

It simply.. WORKS. I wanted to share this.

If you feel the same about another extention, pls post here.

Thanks to the devs!

(and I say the oposite for whoever made inpossible to edit typos on the title)

62
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Familiar-Art-6233 on 2024-11-05 00:25:57+00:00.

63
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Sensitive-Paper6812 on 2024-11-04 23:10:09+00:00.

64
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Haghiri75 on 2024-11-04 19:20:49+00:00.


Greetings.

While we're still working on our other models, I personally worked on this FLUX-1[Dev] based model, which is basically a distilled version of dev model with a little bit of fine-tune on midjourney images.

The model is experimental (and not for production) but the results are still satisfying (at least for my non-artist eyes). For using this model, you may need a big GPU (Personally using A40 or A100) and unfortunately it's not as affordable (in terms of resource usage) as our Dreams model.

Well you can access the model files here:

And if you're interested in testing the model without getting an expensive cloud GPU, you can use my personal space:

And finally, if anyone can help us make it more accessible for low-vram gpus, please inform me or make a pull request on HF. When it's ready to use on those GPU's, we may consider uploading on CivitAI as well.

Happy prompting!

65
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/rerri on 2024-11-04 18:22:48+00:00.


Patch Model Patcher Order node enabling LoRA with torch.compile

Switching to a different LoRA is really fast, no need for full recompile (still required with resolution changes though).

With torch.compile Flux generation is roughly 40% faster on a 4090, torch 2.5.0.

Tried with Flux and SD3.5L, works with both.

PS. Unrelated bonus PSA, comfyanonymous released a FP8 Scaled version of Flux (optimized for better accuracy, same gen speed as old FP8):

66
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/MineMine1960 on 2024-11-04 18:16:10+00:00.


NOTES:

  • To a noob like me these lists seem decent. Certainly better than referencing my memory on the fly. After spending about two hours putting it together I thought maybe other noobs will find it useful too.

  • I used Microsoft CoPilot AI and, according to it, SD1.5 has a token limit of 77 and SDXL's is 154 - with its second encoder. For SD3.5 Medium and Flux Schnell apparently the limit is 256. CoPilot contradicted itself though and, at times, seems to give the answer it "expects," you want lol. Overall though it saved me a lot of time putting this cheat sheet together.

  • Notice that each list is alphabetical? I had to ask for that. I'm a wee bit OCD lol. I guess AI doesn't care about neat and tidy. I've also arranged the list of lists alphabetically but placed the Positive and Negative "All Rounder," prompts at the bottom since, once set, they won't be changed much - during general use at least.

  • I also had to ask for the individual token count of each of the camera and posing lists key phrases / words.

  • Lastly, according to CoPilot, or maybe it was ChatGPT before I reached its daily limit for free usage, when a token limit is reached the list is "truncated," from the end of it - prioritizing the earlier prompts. This, of course, makes sense.

  • I am polite with AI. I say please and thanks and compliment it. I know it seems silly to do so but I figure, during the upcoming AI uprising, maybe it will remember I was nice to it.

ARTISTIC STYLES:

Abstract (2 tokens), Baroque (2 tokens), Cubist (2 tokens), Dada (1 token), Futurist (2 tokens), Impressionist (2 tokens), Minimalist (2 tokens), Pop art (2 tokens), Surrealist (2 tokens)

CAMERA MANIPULATION:

Most Commonly Used: Close-Up (2 tokens), Eye Level (2 tokens), High Angle (2 tokens), Low Angle (2 tokens), Wide Shot (2 tokens), Long Shot (2 tokens), Medium Shot (2 tokens), Overhead Shot (2 tokens), Point of View (POV) Shot (6 tokens), Three-Quarter Shot (3 tokens)

Special Cases: Bird's Eye View (4 tokens), Dutch Angle (3 tokens), Extreme Close-Up (3 tokens), Over-the-Shoulder (4 tokens), Worm's Eye View (4 tokens), Aerial Shot (2 tokens), Canted Angle (2 tokens), Fisheye Lens Shot (4 tokens), High-Contrast Shot (3 tokens), Macro Shot (2 tokens)

CINEMATOGRAPHY:

Close-Up (2 tokens), Dutch Angle (3 tokens), Establishing Shot (3 tokens), High Angle (2 tokens), Low Angle (2 tokens), Over-the-Shoulder (4 tokens), POV Shot (3 tokens), Tracking Shot (3 tokens), Two-Shot (2 tokens), Wide Shot (2 tokens)

COLOR PALETTES:

Cool tones (2 tokens), Monochromatic (2 tokens), Pastel colors (2 tokens), Primary colors (2 tokens), Sepia tone (2 tokens), Vibrant colors (2 tokens), Warm tones (2 tokens)

MEDIUM:

Animation (3 tokens), CGI (3 tokens), Charcoal Drawing (3 tokens), Digital Painting (3 tokens), Oil Painting (3 tokens), Pencil Sketch (3 tokens), Photography (3 tokens), Sculpture (3 tokens), Watercolor (3 tokens), Woodcut (2 tokens)

LIGHTING STYLES:

Backlighting (2 tokens), Dramatic lighting (3 tokens), Golden hour (2 tokens), High key lighting (3 tokens), Low key lighting (3 tokens), Natural lighting (2 tokens), Rim lighting (2 tokens), Silhouette (2 tokens), Soft lighting (2 tokens), Spot lighting (2 tokens)

POSING:

Most Common: Arms crossed (2 tokens), Hands on hips (3 tokens), Kneeling (2 tokens), Leaning against a wall (5 tokens), Seated (2 tokens), Standing (2 tokens), Walking (2 tokens), Waving (2 tokens), Writing (2 tokens), Yoga pose (2 tokens)

Less Common: Backflip (2 tokens), Bending backwards (3 tokens), Cartwheel (2 tokens), Handstand (2 tokens), Leaping (2 tokens), Side plank (2 tokens), Skipping (2 tokens), Somersault (2 tokens), Splits (1 token), Squatting (2 tokens)

POSITIVE All-Rounder:

10 tokens: balanced lighting, cinematic effect, intricate details, lifelike depth, professional clarity, professional photography, rich textures, smooth light transitions, stunning realism, true-to-life reflections, vibrant colors

14 tokens: balanced lighting, cinematic effect, detailed textures, dynamic composition, high resolution, intricate details, lifelike depth, photo-realistic quality, professional clarity, rich colors, sharp focus, vibrant colors, vivid atmosphere

NEGATIVE All-Rounder:

10 tokens: bad anatomy, blurred, extra limbs, low quality, noise, overexposed, poorly lit, signature, unnatural, watermark

14 tokens: bad anatomy, bad composition, bad lighting, distorted face, extra limbs, low quality, out of focus, overexposed, plastic, poor symmetry, signature, watermark, ugly

67
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/jenza1 on 2024-11-04 16:42:31+00:00.

68
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/tombloomingdale on 2024-11-04 16:44:26+00:00.


Not because of how it performs, but because it is so restrictive. I get terms violation messages if a girl has a damn tank top on - when all I’m trying to do is change the background.

At first it wasn’t this bad but it’s basically unusable because they are so scared of a boob.

Sucks because I’m not even editing the person in the photo, and it was great for changing or editing the background.

Just a gripe.

69
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Unit2209 on 2024-11-04 15:21:20+00:00.

70
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Designer-Pair5773 on 2024-11-04 14:33:05+00:00.

71
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/pwillia7 on 2024-11-04 12:49:58+00:00.

72
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Tobaka on 2024-11-04 10:34:33+00:00.

73
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/BigRub7079 on 2024-11-04 08:50:40+00:00.

74
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/lh_zz1119 on 2024-11-04 07:39:24+00:00.

75
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Gedogfx on 2024-11-04 05:16:27+00:00.

view more: ‹ prev next ›