StableDiffusion

99 readers
1 users here now

/r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and...

founded 1 year ago
MODERATORS
101
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/haofanw on 2024-11-06 10:37:39+00:00.

102
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/BigRub7079 on 2024-11-06 06:29:27+00:00.

103
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Hybridx21 on 2024-11-05 16:14:02+00:00.

104
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/jonesaid on 2024-11-06 00:25:18+00:00.

105
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Competitive-War-8645 on 2024-11-05 13:06:49+00:00.


As part of my masterthesis on stable diffusion and artificial imagery I rendered almost all tokens from vocab.json. I filtered doubles and empty spaces and rendered each 4 images per token with an sdxl lightning model. It took a bit on my shitty hardware, and as this particular experiment still is from the preflux era and thus represent also clip understanding the biases from t5xxl could be different.

But it might help prompting a bit, as it is a visual dictionary instead just guessing the token.

The website is also a bit educational, so if you have additions, i can add them on the fly

thanks to lostinspaz for the inspiration of getting the urge for a deeper understanding for the tokenspace.

106
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/lostinspaz on 2024-11-05 22:18:53+00:00.

107
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Lost_Artichoke_4909 on 2024-11-05 18:24:26+00:00.

108
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Choidonhyeon on 2024-11-05 17:34:43+00:00.

109
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/miaoshouai on 2024-11-05 16:56:31+00:00.


I’m deeply grateful for the feedback from this community. After much effort, PromptGen v2.0 has officially launched! Here’s what to expect in this new version (If you want to know what PromptGen is, please read it from this post):

  • Enhanced image caption quality across all instructions
  • Better recognition of explicit content
  • Improved image composition abilities
  • A new "analyze" mode designed to complement mixed_caption

With the new analyze capbility, PromptGen is able to understand more details and image composistions in the picture.

compare with analyze on and analyze off

v2.0 understands better on character positions in the image

Here's some comparesons between the image generation using PromptGen v2.0 vs Joy Caption Alpha 2

with V2.0 you still get the same fast speed and it is the prefect model to do image captioning in batch.

So, please give the new version a try, I'm looking forward to getting your feedback and working more on the model.

Huggingface Page:

Github Page for ComfyUI MiaoshouAI Tagger:

Flux workflow download:

110
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/FrequentTrick9441 on 2024-11-05 16:15:05+00:00.

111
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Dacrikka on 2024-11-05 17:09:41+00:00.

112
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/crystal_alpine on 2024-11-05 16:48:00+00:00.

113
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/afinalsin on 2024-11-05 13:11:37+00:00.


Hello /r/stablediffusion, I'm risking a "longer ban" by posting this resource again since the mods clapped my ass with a three day the last time I posted it, so get it while it lasts.

If you've seen any of my other prompt comparisons, or this very same one that got me yeeted last week, you know what this is. These new images can't be directly compared to the old ones because of the sampler/scheduler change with this new generation of models, but the seed is the same.

Instead of multiple prompts over one big image, each prompt is its own image, with the prompt contained on the image itself. I have censored everything I thought might toe the line, I don't want mommy and daddy to punish me again. Here are the galleries:

Prompt 1-20

Prompt 21-40 | Beware *CENSORED* prompt 34 prompt 40

Prompt 41-60 | Beware *CENSORED* prompt 55 prompt 58

Prompt 61-80 | Beware *CENSORED* prompt 65 prompt 67 prompt 69 prompt 80

Prompt 81-100 | Beware *CENSORED* prompt 84 prompt 98 prompt 100

Prompt 101-120 | Beware *CENSORED* prompt 111

Prompt 121-140

Prompt 141-160 | Beware *CENSORED* prompt 141

Prompt 161-170

An easy way to quickly see the full quality image on civit is right click the image and click "open image in new tab". From there, delete /width=700,original=false from the url, which forces it to load the full quality image.

Settings and stuff in the comments.

114
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/uisato on 2024-11-05 11:23:43+00:00.

115
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/AI-freshboy on 2024-11-05 09:37:06+00:00.


check out .

116
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/CeFurkan on 2024-11-05 13:55:29+00:00.

117
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/OkSpot3819 on 2024-11-05 11:58:46+00:00.


Major Stories

AI Models Enter Fashion Industry: Fashion brands like Mango are implementing AI-generated models, saving millions while raising questions about the future of human modeling. AI services cost $29/month vs $35/hour for human models.

Open Source Initiative Defines 'Open-Source' AI: OSI sparks debate by establishing strict criteria for what constitutes "open-source" AI, challenging tech giants like Meta over transparency in training data and methodologies.

All New Tools & Updates

  • Detail-Daemon: ComfyUI plugin for powerful detail enhancement. Features sigma parameter adjustment, compatible with SDXL and SD1.5 models, optimized for Flux outputs.
  • PixelWave: Community-created Flux model fine-tune offering enhanced aesthetics. 6.7GB GGUF format, trained for 5 weeks on RTX 4090, noted for less "plastic-looking" results.
  • ComfyUI Image Filters: Comprehensive filter collection with 100x faster blur operations, guided filters, color matching, and new BetterFilmGrain node.
  • ComfyUI-MochiEdit: Video editing nodes for Genmo Mochi, featuring unsampling and sampling nodes with adjustable guidance parameters.
  • Oasis: Real-time AI-generated game demonstration with 500M parameter open-source model, currently running on cloud infrastructure.
  • Blendbox Alpha: Layer-based AI image generation tool with real-time adjustments for lighting, texture, and composition. Currently in internal testing.
  • Suno Personas: New feature for capturing and replicating specific musical styles and vocal characteristics. Premium feature with first 200 songs free.
  • SD 3.5 Upscaling Technique: New workflow combining SD 3.5 Large and Medium models with Skip Layer Guidance for enhanced upscaling and detail retention.
  • ElevenLabs X-to-Voice: Open-source tool converting Twitter profiles to AI voices and avatars in about one minute, deployable on Vercel platform.
  • BigASP v2: Large-scale SDXL fine-tune trained on 6.7M images, featuring custom quality rating system and improved score tag system.
  • InvokeAI 5.3: Latest update featuring AI-powered object selection tool based on Meta's SAM, Flux support, and pressure sensitivity tablet support.
  • SD 3.5 Medium: Stability AI's 2.6B parameter model requiring 9.9GB VRAM, supporting up to 1440x1440 resolution, 4x faster than SD 3.5 Large.
  • Two-Character Flux Generation: Method for creating consistent AI-generated images of two distinct characters using Flux AI and LoRA, with complete training dataset available.

📰 Full newsletter with relevant links, context, and visuals available in the original document.

🔔 If you're having a hard time keeping up in this domain - consider subscribing. We send out our newsletter every Sunday.

118
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ZooterTheWooter on 2024-11-05 03:47:05+00:00.

119
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ExpressWarthog8505 on 2024-11-05 09:14:34+00:00.

120
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/CeFurkan on 2024-11-05 08:03:32+00:00.

121
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/advo_k_at on 2024-11-05 05:46:28+00:00.

122
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/decobrz on 2024-11-05 01:57:57+00:00.


I just wanned to say that this is one of the best extentions I've ever used on Automatic1111.

It simply.. WORKS. I wanted to share this.

If you feel the same about another extention, pls post here.

Thanks to the devs!

(and I say the oposite for whoever made inpossible to edit typos on the title)

123
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Familiar-Art-6233 on 2024-11-05 00:25:57+00:00.

124
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Sensitive-Paper6812 on 2024-11-04 23:10:09+00:00.

125
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Haghiri75 on 2024-11-04 19:20:49+00:00.


Greetings.

While we're still working on our other models, I personally worked on this FLUX-1[Dev] based model, which is basically a distilled version of dev model with a little bit of fine-tune on midjourney images.

The model is experimental (and not for production) but the results are still satisfying (at least for my non-artist eyes). For using this model, you may need a big GPU (Personally using A40 or A100) and unfortunately it's not as affordable (in terms of resource usage) as our Dreams model.

Well you can access the model files here:

And if you're interested in testing the model without getting an expensive cloud GPU, you can use my personal space:

And finally, if anyone can help us make it more accessible for low-vram gpus, please inform me or make a pull request on HF. When it's ready to use on those GPU's, we may consider uploading on CivitAI as well.

Happy prompting!

view more: ‹ prev next ›