A new IP-Adapter for FLUX is coming soon. (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/haofanw on 2024-11-06 10:37:39+00:00.

102

1

Reference Adapter (www.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/BigRub7079 on 2024-11-06 06:29:27+00:00.

103

1

GenXD: Generating Any 3D and 4D Scenes (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Hybridx21 on 2024-11-05 16:14:02+00:00.

104

1

61 frames (2.5 seconds) Mochi gen on 3060 12GB! (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/jonesaid on 2024-11-06 00:25:18+00:00.

105

1

I rendered almost all tokens from vocab.json (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Competitive-War-8645 on 2024-11-05 13:06:49+00:00.

As part of my masterthesis on stable diffusion and artificial imagery I rendered almost all tokens from vocab.json. I filtered doubles and empty spaces and rendered each 4 images per token with an sdxl lightning model. It took a bit on my shitty hardware, and as this particular experiment still is from the preflux era and thus represent also clip understanding the biases from t5xxl could be different.

But it might help prompting a bit, as it is a visual dictionary instead just guessing the token.

The website is also a bit educational, so if you have additions, i can add them on the fly

thanks to lostinspaz for the inspiration of getting the urge for a deeper understanding for the tokenspace.

106

1

That feeling when the main google answer to a question is your own post from months ago :-/ (i.redd.it)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/lostinspaz on 2024-11-05 22:18:53+00:00.

107

1

LORA suggestions for images like this? (www.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Lost_Artichoke_4909 on 2024-11-05 18:24:26+00:00.

108

1

ComfyUI : PulID - Image Inpainting (i.redd.it)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Choidonhyeon on 2024-11-05 17:34:43+00:00.

109

1

PromptGen just gets BETTER! v2.0 is here!! (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/miaoshouai on 2024-11-05 16:56:31+00:00.

I’m deeply grateful for the feedback from this community. After much effort, PromptGen v2.0 has officially launched! Here’s what to expect in this new version (If you want to know what PromptGen is, please read it from this post):

Enhanced image caption quality across all instructions
Better recognition of explicit content
Improved image composition abilities
A new "analyze" mode designed to complement mixed_caption

With the new analyze capbility, PromptGen is able to understand more details and image composistions in the picture.

compare with analyze on and analyze off

v2.0 understands better on character positions in the image

Here's some comparesons between the image generation using PromptGen v2.0 vs Joy Caption Alpha 2

with V2.0 you still get the same fast speed and it is the prefect model to do image captioning in batch.

So, please give the new version a try, I'm looking forward to getting your feedback and working more on the model.

Huggingface Page:

Github Page for ComfyUI MiaoshouAI Tagger:

Flux workflow download:

110

1

Official Code and Demo release - ConsiStory: Training-Free Consistent Text-to-Image Generation (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/FrequentTrick9441 on 2024-11-05 16:15:05+00:00.

111

1

I used SDXL on Krita to create detailed maps for RPG, tutorial first comment! (www.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Dacrikka on 2024-11-05 17:09:41+00:00.

112

1

Run Mochi natively in Comfy (i.redd.it)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/crystal_alpine on 2024-11-05 16:48:00+00:00.

113

1

170 Prompt Comparison: SD3.5 Large VS Turbo VS Medium VS Medium /w SLG VS Flux.1 Dev VS Flux.1 Schnell CENSORED VERSION (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/afinalsin on 2024-11-05 13:11:37+00:00.

Hello /r/stablediffusion, I'm risking a "longer ban" by posting this resource again since the mods clapped my ass with a three day the last time I posted it, so get it while it lasts.

If you've seen any of my other prompt comparisons, or this very same one that got me yeeted last week, you know what this is. These new images can't be directly compared to the old ones because of the sampler/scheduler change with this new generation of models, but the seed is the same.

Instead of multiple prompts over one big image, each prompt is its own image, with the prompt contained on the image itself. I have censored everything I thought might toe the line, I don't want mommy and daddy to punish me again. Here are the galleries:

Prompt 1-20

Prompt 21-40 | Beware *CENSORED* prompt 34 prompt 40

Prompt 41-60 | Beware *CENSORED* prompt 55 prompt 58

Prompt 61-80 | Beware *CENSORED* prompt 65 prompt 67 prompt 69 prompt 80

Prompt 81-100 | Beware *CENSORED* prompt 84 prompt 98 prompt 100

Prompt 101-120 | Beware *CENSORED* prompt 111

Prompt 121-140

Prompt 141-160 | Beware *CENSORED* prompt 141

Prompt 161-170

An easy way to quickly see the full quality image on civit is right click the image and click "open image in new tab". From there, delete /width=700,original=false from the url, which forces it to load the full quality image.

Settings and stuff in the comments.

114

1

Spectral Analysis - [More info in comments ✨️] (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/uisato on 2024-11-05 11:23:43+00:00.

115

1

Regional Prompting for FLUX is out! (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/AI-freshboy on 2024-11-05 09:37:06+00:00.

check out .

116

1

Tested Hunyuan3D-1, newest SOTA Text-to-3D and Image-to-3D model, thoroughly on Windows, works great and really fast on 24 GB GPUs - tested on RTX 3090 TI (www.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/CeFurkan on 2024-11-05 13:55:29+00:00.

117

1

This week in SD - all the major developments in a nutshell (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/OkSpot3819 on 2024-11-05 11:58:46+00:00.

Major Stories

AI Models Enter Fashion Industry: Fashion brands like Mango are implementing AI-generated models, saving millions while raising questions about the future of human modeling. AI services cost $29/month vs $35/hour for human models.

Open Source Initiative Defines 'Open-Source' AI: OSI sparks debate by establishing strict criteria for what constitutes "open-source" AI, challenging tech giants like Meta over transparency in training data and methodologies.

All New Tools & Updates

Detail-Daemon: ComfyUI plugin for powerful detail enhancement. Features sigma parameter adjustment, compatible with SDXL and SD1.5 models, optimized for Flux outputs.
PixelWave: Community-created Flux model fine-tune offering enhanced aesthetics. 6.7GB GGUF format, trained for 5 weeks on RTX 4090, noted for less "plastic-looking" results.
ComfyUI Image Filters: Comprehensive filter collection with 100x faster blur operations, guided filters, color matching, and new BetterFilmGrain node.
ComfyUI-MochiEdit: Video editing nodes for Genmo Mochi, featuring unsampling and sampling nodes with adjustable guidance parameters.
Oasis: Real-time AI-generated game demonstration with 500M parameter open-source model, currently running on cloud infrastructure.
Blendbox Alpha: Layer-based AI image generation tool with real-time adjustments for lighting, texture, and composition. Currently in internal testing.
Suno Personas: New feature for capturing and replicating specific musical styles and vocal characteristics. Premium feature with first 200 songs free.
SD 3.5 Upscaling Technique: New workflow combining SD 3.5 Large and Medium models with Skip Layer Guidance for enhanced upscaling and detail retention.
ElevenLabs X-to-Voice: Open-source tool converting Twitter profiles to AI voices and avatars in about one minute, deployable on Vercel platform.
BigASP v2: Large-scale SDXL fine-tune trained on 6.7M images, featuring custom quality rating system and improved score tag system.
InvokeAI 5.3: Latest update featuring AI-powered object selection tool based on Meta's SAM, Flux support, and pressure sensitivity tablet support.
SD 3.5 Medium: Stability AI's 2.6B parameter model requiring 9.9GB VRAM, supporting up to 1440x1440 resolution, 4x faster than SD 3.5 Large.
Two-Character Flux Generation: Method for creating consistent AI-generated images of two distinct characters using Flux AI and LoRA, with complete training dataset available.

📰 Full newsletter with relevant links, context, and visuals available in the original document.

🔔 If you're having a hard time keeping up in this domain - consider subscribing. We send out our newsletter every Sunday.

118

1

Is it possible to take old game screenshots like this and bring them to life with a realistic model? If so how would I go about doing this? (i.redd.it)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ZooterTheWooter on 2024-11-05 03:47:05+00:00.

119

1

Regional Prompting for Flux (www.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ExpressWarthog8505 on 2024-11-05 09:14:34+00:00.

120

1

Tencent / Hunyuan3D-1 published with Codes Weights and Gradio app - repo link in oldest comment (www.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/CeFurkan on 2024-11-05 08:03:32+00:00.

121

1

Smooth Detailed version of NoobAI v1 (Illustrious fine-tune) (www.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/advo_k_at on 2024-11-05 05:46:28+00:00.

122

1

Automatic1111 exention appreciation post (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/decobrz on 2024-11-05 01:57:57+00:00.

I just wanned to say that this is one of the best extentions I've ever used on Automatic1111.

It simply.. WORKS. I wanted to share this.

If you feel the same about another extention, pls post here.

Thanks to the devs!

(and I say the oposite for whoever made inpossible to edit typos on the title)

123

1

Just released a LoRA version of my RPG Maps model for Stable Diffusion Large! (www.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Familiar-Art-6233 on 2024-11-05 00:25:57+00:00.

124

1

ComfyCanvas for easy canvas use in ComfyUI (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Sensitive-Paper6812 on 2024-11-04 23:10:09+00:00.

125

1

Mann-E FLUX[Dev] Edition released. You can test it now! (old.reddit.com)

submitted 1 week ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Haghiri75 on 2024-11-04 19:20:49+00:00.

Greetings.

While we're still working on our other models, I personally worked on this FLUX-1[Dev] based model, which is basically a distilled version of dev model with a little bit of fine-tune on midjourney images.

The model is experimental (and not for production) but the results are still satisfying (at least for my non-artist eyes). For using this model, you may need a big GPU (Personally using A40 or A100) and unfortunately it's not as affordable (in terms of resource usage) as our Dreams model.

Well you can access the model files here:

And if you're interested in testing the model without getting an expensive cloud GPU, you can use my personal space:

And finally, if anyone can help us make it more accessible for low-vram gpus, please inform me or make a pull request on HF. When it's ready to use on those GPU's, we may consider uploading on CivitAI as well.

Happy prompting!

StableDiffusion

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

Major Stories

All New Tools & Updates

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.

This is an automated archive made by the Lemmit Bot.