If you are experienced and familiar with IP Adapter, you are unlikely to find anything new. However, I do have a 'quality of life' contribution in the form of a 12 GB archive.

A1111:

📦stable-diffusion-webui
 ┣ 📂extensions
 ┃ ┗ 📂sd-webui-controlnet
 ┃ ┃ ┗ 📂annotator
 ┃ ┃ ┃ ┗ 📂downloads
 ┃ ┃ ┃ ┃ ┗ 📂clip_vision
 ┃ ┃ ┃ ┃ ┃ ┣ 📜clip_g.pth
 ┃ ┃ ┃ ┃ ┃ ┗ 📜clip_h.pth
 ┗ 📂models
 ┃ ┣ 📂ControlNet
 ┃ ┃ ┣ 📂SD1.5
 ┃ ┃ ┃ ┣ 📜ip-adapter-faceid-plusv2_sd15.bin
 ┃ ┃ ┃ ┣ 📜ip-adapter-faceid_sd15.bin
 ┃ ┃ ┃ ┣ 📜ip-adapter-full-face_sd15.safetensors
 ┃ ┃ ┃ ┣ 📜ip-adapter-plus-face_sd15.safetensors
 ┃ ┃ ┃ ┣ 📜ip-adapter-plus_sd15.safetensors
 ┃ ┃ ┃ ┣ 📜ip-adapter_sd15.safetensors
 ┃ ┃ ┃ ┣ 📜ip-adapter_sd15_light_v11.bin
 ┃ ┃ ┃ ┗ 📜ip-adapter_sd15_vit-G.safetensors
 ┃ ┃ ┗ 📂SDXL
 ┃ ┃ ┃ ┣ 📜ip-adapter-faceid-plusv2_sdxl.bin
 ┃ ┃ ┃ ┣ 📜ip-adapter-faceid_sdxl.bin
 ┃ ┃ ┃ ┣ 📜ip-adapter-plus-face_sdxl_vit-h.safetensors
 ┃ ┃ ┃ ┣ 📜ip-adapter-plus_sdxl_vit-h.safetensors
 ┃ ┃ ┃ ┣ 📜ip-adapter_sdxl.safetensors
 ┃ ┃ ┃ ┗ 📜ip-adapter_sdxl_vit-h.safetensors
 ┃ ┗ 📂Lora
 ┃ ┃ ┣ 📂SD1.5
 ┃ ┃ ┃ ┗ 📂faceid
 ┃ ┃ ┃ ┃ ┣ 📜ip-adapter-faceid-plusv2_sd15_lora.safetensors
 ┃ ┃ ┃ ┃ ┗ 📜ip-adapter-faceid_sd15_lora.safetensors
 ┃ ┃ ┗ 📂SDXL
 ┃ ┃ ┃ ┗ 📂faceid
 ┃ ┃ ┃ ┃ ┣ 📜ip-adapter-faceid-plusv2_sdxl_lora.safetensors
 ┃ ┃ ┃ ┃ ┗ 📜ip-adapter-faceid_sdxl_lora.safetensors

ComfyUI:

📦ComfyUI
 ┗ 📂models
 ┃ ┣ 📂clip_vision
 ┃ ┃ ┣ 📜CLIP-ViT-bigG-14-laion2B-39B-b160k.safetensors
 ┃ ┃ ┗ 📜CLIP-ViT-H-14-laion2B-s32B-b79K.safetensors
 ┃ ┣ 📂ipadapter
 ┃ ┃ ┣ 📂SD1.5
 ┃ ┃ ┃ ┣ 📜ip-adapter-faceid-plusv2_sd15.bin
 ┃ ┃ ┃ ┣ 📜ip-adapter-faceid_sd15.bin
 ┃ ┃ ┃ ┣ 📜ip-adapter-full-face_sd15.safetensors
 ┃ ┃ ┃ ┣ 📜ip-adapter-plus-face_sd15.safetensors
 ┃ ┃ ┃ ┣ 📜ip-adapter-plus_sd15.safetensors
 ┃ ┃ ┃ ┣ 📜ip-adapter_sd15.safetensors
 ┃ ┃ ┃ ┣ 📜ip-adapter_sd15_light_v11.bin
 ┃ ┃ ┃ ┗ 📜ip-adapter_sd15_vit-G.safetensors
 ┃ ┃ ┗ 📂SDXL
 ┃ ┃ ┃ ┣ 📜ip-adapter-faceid-plusv2_sdxl.bin
 ┃ ┃ ┃ ┣ 📜ip-adapter-faceid_sdxl.bin
 ┃ ┃ ┃ ┣ 📜ip-adapter-plus-face_sdxl_vit-h.safetensors
 ┃ ┃ ┃ ┣ 📜ip-adapter-plus_sdxl_vit-h.safetensors
 ┃ ┃ ┃ ┣ 📜ip-adapter_sdxl.safetensors
 ┃ ┃ ┃ ┗ 📜ip-adapter_sdxl_vit-h.safetensors
 ┃ ┗ 📂loras
 ┃ ┃ ┣ 📂SD1.5
 ┃ ┃ ┃ ┗ 📂faceid
 ┃ ┃ ┃ ┃ ┣ 📜ip-adapter-faceid-plusv2_sd15_lora.safetensors
 ┃ ┃ ┃ ┃ ┗ 📜ip-adapter-faceid_sd15_lora.safetensors
 ┃ ┃ ┗ 📂SDXL
 ┃ ┃ ┃ ┗ 📂faceid
 ┃ ┃ ┃ ┃ ┣ 📜ip-adapter-faceid-plusv2_sdxl_lora.safetensors
 ┃ ┃ ┃ ┃ ┗ 📜ip-adapter-faceid_sdxl_lora.safetensors```

All LoRAs, IP Adapter models, FaceID models and ClipVision in one download. But, then again, if you are experienced with this, you likely know precisely which model/LoRA/ViT to download.

Nonetheless, I do hope this helps the newcomers learning this for the first time in 2024 and onwards. 👍

Bundle link:
Input images:

79

1

FLUX DEV CAN DO ANY STYLE WITH LORA (www.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Gedogfx on 2024-11-06 17:27:02+00:00.

80

1

What is the best way to get a model from an image? (www.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/HornyMetalBeing on 2024-11-06 14:21:05+00:00.

81

1

Mochi on RTX 4090, its interpretation of different nationalities (workflow in comments) (old.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/descore on 2024-11-06 14:05:09+00:00.

82

1

UltraRealistic LoRa v2 - Flux (www.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/FortranUA on 2024-11-06 12:31:03+00:00.

83

1

A new IP-Adapter for FLUX is coming soon. (old.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/haofanw on 2024-11-06 10:37:39+00:00.

84

1

Reference Adapter (www.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/BigRub7079 on 2024-11-06 06:29:27+00:00.

85

1

GenXD: Generating Any 3D and 4D Scenes (old.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Hybridx21 on 2024-11-05 16:14:02+00:00.

86

1

61 frames (2.5 seconds) Mochi gen on 3060 12GB! (old.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/jonesaid on 2024-11-06 00:25:18+00:00.

87

1

I rendered almost all tokens from vocab.json (old.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Competitive-War-8645 on 2024-11-05 13:06:49+00:00.

As part of my masterthesis on stable diffusion and artificial imagery I rendered almost all tokens from vocab.json. I filtered doubles and empty spaces and rendered each 4 images per token with an sdxl lightning model. It took a bit on my shitty hardware, and as this particular experiment still is from the preflux era and thus represent also clip understanding the biases from t5xxl could be different.

But it might help prompting a bit, as it is a visual dictionary instead just guessing the token.

The website is also a bit educational, so if you have additions, i can add them on the fly

thanks to lostinspaz for the inspiration of getting the urge for a deeper understanding for the tokenspace.

88

1

That feeling when the main google answer to a question is your own post from months ago :-/ (i.redd.it)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/lostinspaz on 2024-11-05 22:18:53+00:00.

89

1

LORA suggestions for images like this? (www.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Lost_Artichoke_4909 on 2024-11-05 18:24:26+00:00.

90

1

ComfyUI : PulID - Image Inpainting (i.redd.it)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Choidonhyeon on 2024-11-05 17:34:43+00:00.

91

1

PromptGen just gets BETTER! v2.0 is here!! (old.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/miaoshouai on 2024-11-05 16:56:31+00:00.

I’m deeply grateful for the feedback from this community. After much effort, PromptGen v2.0 has officially launched! Here’s what to expect in this new version (If you want to know what PromptGen is, please read it from this post):

Enhanced image caption quality across all instructions
Better recognition of explicit content
Improved image composition abilities
A new "analyze" mode designed to complement mixed_caption

With the new analyze capbility, PromptGen is able to understand more details and image composistions in the picture.

compare with analyze on and analyze off

v2.0 understands better on character positions in the image

Here's some comparesons between the image generation using PromptGen v2.0 vs Joy Caption Alpha 2

with V2.0 you still get the same fast speed and it is the prefect model to do image captioning in batch.

So, please give the new version a try, I'm looking forward to getting your feedback and working more on the model.

Huggingface Page:

Github Page for ComfyUI MiaoshouAI Tagger:

Flux workflow download:

92

1

Official Code and Demo release - ConsiStory: Training-Free Consistent Text-to-Image Generation (old.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/FrequentTrick9441 on 2024-11-05 16:15:05+00:00.

93

1

I used SDXL on Krita to create detailed maps for RPG, tutorial first comment! (www.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Dacrikka on 2024-11-05 17:09:41+00:00.

94

1

Run Mochi natively in Comfy (i.redd.it)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/crystal_alpine on 2024-11-05 16:48:00+00:00.

95

1

170 Prompt Comparison: SD3.5 Large VS Turbo VS Medium VS Medium /w SLG VS Flux.1 Dev VS Flux.1 Schnell CENSORED VERSION (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/afinalsin on 2024-11-05 13:11:37+00:00.

Hello /r/stablediffusion, I'm risking a "longer ban" by posting this resource again since the mods clapped my ass with a three day the last time I posted it, so get it while it lasts.

If you've seen any of my other prompt comparisons, or this very same one that got me yeeted last week, you know what this is. These new images can't be directly compared to the old ones because of the sampler/scheduler change with this new generation of models, but the seed is the same.

Instead of multiple prompts over one big image, each prompt is its own image, with the prompt contained on the image itself. I have censored everything I thought might toe the line, I don't want mommy and daddy to punish me again. Here are the galleries:

Prompt 1-20

Prompt 21-40 | Beware *CENSORED* prompt 34 prompt 40

Prompt 41-60 | Beware *CENSORED* prompt 55 prompt 58

Prompt 61-80 | Beware *CENSORED* prompt 65 prompt 67 prompt 69 prompt 80

Prompt 81-100 | Beware *CENSORED* prompt 84 prompt 98 prompt 100

Prompt 101-120 | Beware *CENSORED* prompt 111

Prompt 121-140

Prompt 141-160 | Beware *CENSORED* prompt 141

Prompt 161-170

An easy way to quickly see the full quality image on civit is right click the image and click "open image in new tab". From there, delete /width=700,original=false from the url, which forces it to load the full quality image.

Settings and stuff in the comments.

96

1

Spectral Analysis - [More info in comments ✨️] (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/uisato on 2024-11-05 11:23:43+00:00.

97

1

Regional Prompting for FLUX is out! (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/AI-freshboy on 2024-11-05 09:37:06+00:00.

check out .

98

1

Tested Hunyuan3D-1, newest SOTA Text-to-3D and Image-to-3D model, thoroughly on Windows, works great and really fast on 24 GB GPUs - tested on RTX 3090 TI (www.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/CeFurkan on 2024-11-05 13:55:29+00:00.

99

1

This week in SD - all the major developments in a nutshell (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/OkSpot3819 on 2024-11-05 11:58:46+00:00.

Major Stories

AI Models Enter Fashion Industry: Fashion brands like Mango are implementing AI-generated models, saving millions while raising questions about the future of human modeling. AI services cost $29/month vs $35/hour for human models.

Open Source Initiative Defines 'Open-Source' AI: OSI sparks debate by establishing strict criteria for what constitutes "open-source" AI, challenging tech giants like Meta over transparency in training data and methodologies.

All New Tools & Updates

Detail-Daemon: ComfyUI plugin for powerful detail enhancement. Features sigma parameter adjustment, compatible with SDXL and SD1.5 models, optimized for Flux outputs.
PixelWave: Community-created Flux model fine-tune offering enhanced aesthetics. 6.7GB GGUF format, trained for 5 weeks on RTX 4090, noted for less "plastic-looking" results.
ComfyUI Image Filters: Comprehensive filter collection with 100x faster blur operations, guided filters, color matching, and new BetterFilmGrain node.
ComfyUI-MochiEdit: Video editing nodes for Genmo Mochi, featuring unsampling and sampling nodes with adjustable guidance parameters.
Oasis: Real-time AI-generated game demonstration with 500M parameter open-source model, currently running on cloud infrastructure.
Blendbox Alpha: Layer-based AI image generation tool with real-time adjustments for lighting, texture, and composition. Currently in internal testing.
Suno Personas: New feature for capturing and replicating specific musical styles and vocal characteristics. Premium feature with first 200 songs free.
SD 3.5 Upscaling Technique: New workflow combining SD 3.5 Large and Medium models with Skip Layer Guidance for enhanced upscaling and detail retention.
ElevenLabs X-to-Voice: Open-source tool converting Twitter profiles to AI voices and avatars in about one minute, deployable on Vercel platform.
BigASP v2: Large-scale SDXL fine-tune trained on 6.7M images, featuring custom quality rating system and improved score tag system.
InvokeAI 5.3: Latest update featuring AI-powered object selection tool based on Meta's SAM, Flux support, and pressure sensitivity tablet support.
SD 3.5 Medium: Stability AI's 2.6B parameter model requiring 9.9GB VRAM, supporting up to 1440x1440 resolution, 4x faster than SD 3.5 Large.
Two-Character Flux Generation: Method for creating consistent AI-generated images of two distinct characters using Flux AI and LoRA, with complete training dataset available.

📰 Full newsletter with relevant links, context, and visuals available in the original document.

🔔 If you're having a hard time keeping up in this domain - consider subscribing. We send out our newsletter every Sunday.

100

1

Is it possible to take old game screenshots like this and bring them to life with a realistic model? If so how would I go about doing this? (i.redd.it)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ZooterTheWooter on 2024-11-05 03:47:05+00:00.

StableDiffusion

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

Major Stories

All New Tools & Updates

This is an automated archive made by the Lemmit Bot.