StableDiffusion

99 readers
1 users here now

/r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and...

founded 1 year ago
MODERATORS
76
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Robo420- on 2024-11-06 18:27:22+00:00.

77
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ryanontheinside on 2024-11-06 18:23:51+00:00.

78
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/WingsOfPhoenix on 2024-11-06 11:48:15+00:00.


YouTube video:

Note:

If you are experienced and familiar with IP Adapter, you are unlikely to find anything new. However, I do have a 'quality of life' contribution in the form of a 12 GB archive.

A1111:

๐Ÿ“ฆstable-diffusion-webui
 โ”ฃ ๐Ÿ“‚extensions
 โ”ƒ โ”— ๐Ÿ“‚sd-webui-controlnet
 โ”ƒ โ”ƒ โ”— ๐Ÿ“‚annotator
 โ”ƒ โ”ƒ โ”ƒ โ”— ๐Ÿ“‚downloads
 โ”ƒ โ”ƒ โ”ƒ โ”ƒ โ”— ๐Ÿ“‚clip_vision
 โ”ƒ โ”ƒ โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œclip_g.pth
 โ”ƒ โ”ƒ โ”ƒ โ”ƒ โ”ƒ โ”— ๐Ÿ“œclip_h.pth
 โ”— ๐Ÿ“‚models
 โ”ƒ โ”ฃ ๐Ÿ“‚ControlNet
 โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“‚SD1.5
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-faceid-plusv2_sd15.bin
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-faceid_sd15.bin
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-full-face_sd15.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-plus-face_sd15.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-plus_sd15.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter_sd15.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter_sd15_light_v11.bin
 โ”ƒ โ”ƒ โ”ƒ โ”— ๐Ÿ“œip-adapter_sd15_vit-G.safetensors
 โ”ƒ โ”ƒ โ”— ๐Ÿ“‚SDXL
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-faceid-plusv2_sdxl.bin
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-faceid_sdxl.bin
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-plus-face_sdxl_vit-h.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-plus_sdxl_vit-h.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter_sdxl.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”— ๐Ÿ“œip-adapter_sdxl_vit-h.safetensors
 โ”ƒ โ”— ๐Ÿ“‚Lora
 โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“‚SD1.5
 โ”ƒ โ”ƒ โ”ƒ โ”— ๐Ÿ“‚faceid
 โ”ƒ โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-faceid-plusv2_sd15_lora.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”ƒ โ”— ๐Ÿ“œip-adapter-faceid_sd15_lora.safetensors
 โ”ƒ โ”ƒ โ”— ๐Ÿ“‚SDXL
 โ”ƒ โ”ƒ โ”ƒ โ”— ๐Ÿ“‚faceid
 โ”ƒ โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-faceid-plusv2_sdxl_lora.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”ƒ โ”— ๐Ÿ“œip-adapter-faceid_sdxl_lora.safetensors

ComfyUI:

๐Ÿ“ฆComfyUI
 โ”— ๐Ÿ“‚models
 โ”ƒ โ”ฃ ๐Ÿ“‚clip_vision
 โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œCLIP-ViT-bigG-14-laion2B-39B-b160k.safetensors
 โ”ƒ โ”ƒ โ”— ๐Ÿ“œCLIP-ViT-H-14-laion2B-s32B-b79K.safetensors
 โ”ƒ โ”ฃ ๐Ÿ“‚ipadapter
 โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“‚SD1.5
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-faceid-plusv2_sd15.bin
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-faceid_sd15.bin
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-full-face_sd15.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-plus-face_sd15.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-plus_sd15.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter_sd15.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter_sd15_light_v11.bin
 โ”ƒ โ”ƒ โ”ƒ โ”— ๐Ÿ“œip-adapter_sd15_vit-G.safetensors
 โ”ƒ โ”ƒ โ”— ๐Ÿ“‚SDXL
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-faceid-plusv2_sdxl.bin
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-faceid_sdxl.bin
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-plus-face_sdxl_vit-h.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-plus_sdxl_vit-h.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter_sdxl.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”— ๐Ÿ“œip-adapter_sdxl_vit-h.safetensors
 โ”ƒ โ”— ๐Ÿ“‚loras
 โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“‚SD1.5
 โ”ƒ โ”ƒ โ”ƒ โ”— ๐Ÿ“‚faceid
 โ”ƒ โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-faceid-plusv2_sd15_lora.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”ƒ โ”— ๐Ÿ“œip-adapter-faceid_sd15_lora.safetensors
 โ”ƒ โ”ƒ โ”— ๐Ÿ“‚SDXL
 โ”ƒ โ”ƒ โ”ƒ โ”— ๐Ÿ“‚faceid
 โ”ƒ โ”ƒ โ”ƒ โ”ƒ โ”ฃ ๐Ÿ“œip-adapter-faceid-plusv2_sdxl_lora.safetensors
 โ”ƒ โ”ƒ โ”ƒ โ”ƒ โ”— ๐Ÿ“œip-adapter-faceid_sdxl_lora.safetensors```

All LoRAs, IP Adapter models, FaceID models and ClipVision in one download. But, then again, if you are experienced with this, you likely know precisely which model/LoRA/ViT to download.

Nonetheless, I do hope this helps the newcomers learning this for the first time in 2024 and onwards. ๐Ÿ‘

  • Bundle link:
  • Input images:
79
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Gedogfx on 2024-11-06 17:27:02+00:00.

80
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/HornyMetalBeing on 2024-11-06 14:21:05+00:00.

81
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/descore on 2024-11-06 14:05:09+00:00.

82
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/FortranUA on 2024-11-06 12:31:03+00:00.

83
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/haofanw on 2024-11-06 10:37:39+00:00.

84
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/BigRub7079 on 2024-11-06 06:29:27+00:00.

85
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Hybridx21 on 2024-11-05 16:14:02+00:00.

86
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/jonesaid on 2024-11-06 00:25:18+00:00.

87
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Competitive-War-8645 on 2024-11-05 13:06:49+00:00.


As part of my masterthesis on stable diffusion and artificial imagery I rendered almost all tokens from vocab.json. I filtered doubles and empty spaces and rendered each 4 images per token with an sdxl lightning model. It took a bit on my shitty hardware, and as this particular experiment still is from the preflux era and thus represent also clip understanding the biases from t5xxl could be different.

But it might help prompting a bit, as it is a visual dictionary instead just guessing the token.

The website is also a bit educational, so if you have additions, i can add them on the fly

thanks to lostinspaz for the inspiration of getting the urge for a deeper understanding for the tokenspace.

88
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/lostinspaz on 2024-11-05 22:18:53+00:00.

89
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Lost_Artichoke_4909 on 2024-11-05 18:24:26+00:00.

90
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Choidonhyeon on 2024-11-05 17:34:43+00:00.

91
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/miaoshouai on 2024-11-05 16:56:31+00:00.


Iโ€™m deeply grateful for the feedback from this community. After much effort, PromptGen v2.0 has officially launched! Hereโ€™s what to expect in this new version (If you want to know what PromptGen is, please read it from this post):

  • Enhanced image caption quality across all instructions
  • Better recognition of explicit content
  • Improved image composition abilities
  • A new "analyze" mode designed to complement mixed_caption

With the new analyze capbility, PromptGen is able to understand more details and image composistions in the picture.

compare with analyze on and analyze off

v2.0 understands better on character positions in the image

Here's some comparesons between the image generation using PromptGen v2.0 vs Joy Caption Alpha 2

with V2.0 you still get the same fast speed and it is the prefect model to do image captioning in batch.

So, please give the new version a try, I'm looking forward to getting your feedback and working more on the model.

Huggingface Page:

Github Page for ComfyUI MiaoshouAI Tagger:

Flux workflow download:

92
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/FrequentTrick9441 on 2024-11-05 16:15:05+00:00.

93
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Dacrikka on 2024-11-05 17:09:41+00:00.

94
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/crystal_alpine on 2024-11-05 16:48:00+00:00.

95
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/afinalsin on 2024-11-05 13:11:37+00:00.


Hello /r/stablediffusion, I'm risking a "longer ban" by posting this resource again since the mods clapped my ass with a three day the last time I posted it, so get it while it lasts.

If you've seen any of my other prompt comparisons, or this very same one that got me yeeted last week, you know what this is. These new images can't be directly compared to the old ones because of the sampler/scheduler change with this new generation of models, but the seed is the same.

Instead of multiple prompts over one big image, each prompt is its own image, with the prompt contained on the image itself. I have censored everything I thought might toe the line, I don't want mommy and daddy to punish me again. Here are the galleries:

Prompt 1-20

Prompt 21-40 | Beware *CENSORED* prompt 34 prompt 40

Prompt 41-60 | Beware *CENSORED* prompt 55 prompt 58

Prompt 61-80 | Beware *CENSORED* prompt 65 prompt 67 prompt 69 prompt 80

Prompt 81-100 | Beware *CENSORED* prompt 84 prompt 98 prompt 100

Prompt 101-120 | Beware *CENSORED* prompt 111

Prompt 121-140

Prompt 141-160 | Beware *CENSORED* prompt 141

Prompt 161-170

An easy way to quickly see the full quality image on civit is right click the image and click "open image in new tab". From there, delete /width=700,original=false from the url, which forces it to load the full quality image.

Settings and stuff in the comments.

96
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/uisato on 2024-11-05 11:23:43+00:00.

97
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/AI-freshboy on 2024-11-05 09:37:06+00:00.


check out .

98
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/CeFurkan on 2024-11-05 13:55:29+00:00.

99
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/OkSpot3819 on 2024-11-05 11:58:46+00:00.


Major Stories

AI Models Enter Fashion Industry:ย Fashion brands like Mango are implementing AI-generated models, saving millions while raising questions about the future of human modeling. AI services cost $29/month vs $35/hour for human models.

Open Source Initiative Defines 'Open-Source' AI:ย OSI sparks debate by establishing strict criteria for what constitutes "open-source" AI, challenging tech giants like Meta over transparency in training data and methodologies.

All New Tools & Updates

  • Detail-Daemon:ย ComfyUI plugin for powerful detail enhancement. Features sigma parameter adjustment, compatible with SDXL and SD1.5 models, optimized for Flux outputs.
  • PixelWave:ย Community-created Flux model fine-tune offering enhanced aesthetics. 6.7GB GGUF format, trained for 5 weeks on RTX 4090, noted for less "plastic-looking" results.
  • ComfyUI Image Filters:ย Comprehensive filter collection with 100x faster blur operations, guided filters, color matching, and new BetterFilmGrain node.
  • ComfyUI-MochiEdit:ย Video editing nodes for Genmo Mochi, featuring unsampling and sampling nodes with adjustable guidance parameters.
  • Oasis:ย Real-time AI-generated game demonstration with 500M parameter open-source model, currently running on cloud infrastructure.
  • Blendbox Alpha:ย Layer-based AI image generation tool with real-time adjustments for lighting, texture, and composition. Currently in internal testing.
  • Suno Personas:ย New feature for capturing and replicating specific musical styles and vocal characteristics. Premium feature with first 200 songs free.
  • SD 3.5 Upscaling Technique:ย New workflow combining SD 3.5 Large and Medium models with Skip Layer Guidance for enhanced upscaling and detail retention.
  • ElevenLabs X-to-Voice:ย Open-source tool converting Twitter profiles to AI voices and avatars in about one minute, deployable on Vercel platform.
  • BigASP v2:ย Large-scale SDXL fine-tune trained on 6.7M images, featuring custom quality rating system and improved score tag system.
  • InvokeAI 5.3:ย Latest update featuring AI-powered object selection tool based on Meta's SAM, Flux support, and pressure sensitivity tablet support.
  • SD 3.5 Medium:ย Stability AI's 2.6B parameter model requiring 9.9GB VRAM, supporting up to 1440x1440 resolution, 4x faster than SD 3.5 Large.
  • Two-Character Flux Generation:ย Method for creating consistent AI-generated images of two distinct characters using Flux AI and LoRA, with complete training dataset available.

๐Ÿ“ฐ Full newsletter with relevant links, context, and visuals available in the original document.

๐Ÿ”” If you're having a hard time keeping up in this domain - consider subscribing. We send out our newsletter every Sunday.

100
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ZooterTheWooter on 2024-11-05 03:47:05+00:00.

view more: โ€น prev next โ€บ