StableDiffusion

97 readers
1 users here now

/r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and...

founded 1 year ago
MODERATORS
1
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ScarletEnthusiast on 2024-09-18 17:21:57+00:00.

2
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Old_Reach4779 on 2024-09-18 16:18:06+00:00.


Hugging face:

Hugging face space:

Github:

Comfyui node: (kijai just inserted i2v example workflow 😍)

License: Apache-2.0 license !

3
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/jerrydavos on 2024-09-18 16:16:21+00:00.

4
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Old_Reach4779 on 2024-09-18 11:55:02+00:00.


source: https://github.com/THUDM/CogVideo/tree/CogVideoX_dev

edit2:

they released it!

edit:

Hugging face model just released! link

(still github main branch is not merged)

Today will be a long loooong loooooooooooooooooooooong day!

5
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/stableee on 2024-09-18 11:51:24+00:00.

6
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/hkunzhe on 2024-09-18 11:39:09+00:00.


Alibaba PAI have been using the EasyAnimate framework to fine-tune CogVideoX and open-sourced CogVideoX-Fun, which includes both 5B and 2B models. Compared to the original CogVideoX, we have added the I2V and V2V functionality and support for video generation at any resolution from 256x256x49 to 1024x1024x49.

HF Space:

Code:

ComfyUI node:

Models: &

Discord: 

7
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/an303042 on 2024-09-18 10:25:17+00:00.

8
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Kinda-Brazy on 2024-09-18 09:15:50+00:00.

9
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/oodelay on 2024-09-18 01:56:52+00:00.

10
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/jmbirn on 2024-09-17 20:46:27+00:00.

11
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/jenza1 on 2024-09-17 20:02:23+00:00.

12
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/tom83_be on 2024-09-17 18:21:14+00:00.


Update: Now runs with about 7 GB VRAM, see bold text on updated settings below!

I posted a guide (basically working settings) for OneTrainer LoRA/DoRA training here. There was a question concerning support for 8 GB VRAM. I tried a few settings and it seems to run at just below 8 GB VRAM. Since I do not own such a card I need people with these cards to validate it (maybe there are spikes that I do not see).

Please do the folkowing:

  • Use the settings provided here:
  • EMA OFF (training tab) => maybe not needed, see update below
  • Rank = 16, Alpha = 16 (LoRA tab)
  • activating "fused back pass" in the optimizer settings (training tab) seems to yield another 100MB of VRAM saving => maybe not needed, see update below
  • "LoRA weight data type" (LoRA tab) to bfloat16 again saves some VRAM. => maybe not needed, see update below
  • Update: You can also set "gradient checkpointing" to "CPU_OFFLOADED" in the "training"-tab. After that it runs with less than 7 GB VRAM, but a bit slower for me (3,7 s/it vs. 3.4 s/it). Thanks to u/setothegreat for that idea! If you keep EMA enabled, still use float32 as the "LoRA weight data type" and also do not activate "fused back pass", it still runs at 7,2 GB VRAM and 3,9 s/it for me. So it might be enough to

It now trains with just below 7,8 / 7,9 GB of VRAM. I would like to get feedback from 8 GB VRAM users if this works.

I can also give no guarantee on quality/success of the training! Let's find out together!

PS: I am using my card for training/AI only; the operating system is using the internal GPU, so all of my VRAM is free. For 8 GB VRAM users this might be crucial to get it to work...

13
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Secure-Message-8378 on 2024-09-18 00:52:17+00:00.

14
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/RalFingerLP on 2024-09-17 21:11:07+00:00.

15
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/jenza1 on 2024-09-17 20:26:24+00:00.

16
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Equal_Couple6552 on 2024-09-17 20:01:11+00:00.

17
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/FugueSegue on 2024-09-17 19:19:17+00:00.


The rule the angry moderator cited was: "Your post/comment has been removed because it contains content created with closed source tools. OP has stated they used Photoshop and Topaz on some elements."

This is the message I just sent to all the moderators of this subreddit:

Why did you delete my post? According to the message I received:

"Your post/comment has been removed because it contains content created with closed source tools. OP has stated they used Photoshop and Topaz on some elements."

THERE IS NO RULE ABOUT THAT. If you're referring to rule #1:

"All posts must be Open-Source / Local AI image generation related. All tools used to create post content must be open source/local AI image generation. Comparisons with other AI generation platforms are accepted."

You're saying I violated that rule?!?!? THAT'S INSANE! Are one of your moderators really THAT vindictive? Almost EVERYONE uses Photoshop and any other image processor to get their work done! This includes preparing datasets, inpainting with SD plugins, to final presentation. ALL of the work that was done to create that image was done with Stable Diffusion models and LoRAs! I use Photoshop to do my inpainting with ComfyUI! ALMOST ALL WORKING DIGITAL ARTISTS USE PHOTOSHOP! It's a standard tool! I use Topaz whenever I need to enlarge an element that I send through img2img!

Are you really going to be THAT dogmatic about rule #1? Because if you do, then you'll have to delete half the images posted here! You'll have to start a massive, ugly inquisition.

Did it ever occur to you to ASK me about these things? Or asking if I used Adobe's generative fill? Because I didn't! Did you consider making even the SLIGHTEST inquiry? Instead of just deleting the post about a painting I worked on? On my cake day, no less.

Do you want generative AI art accepted in the rest of the art world? Because this isn't the way to do it.

18
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Patient-Librarian-33 on 2024-09-17 17:00:20+00:00.

19
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/EcoPeakPulse on 2024-09-17 18:03:11+00:00.

20
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Disastrous-Hope-2537 on 2024-09-17 11:32:21+00:00.


Paper:

21
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/TheArchivist314 on 2024-09-17 03:20:07+00:00.

22
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/DustWorlds on 2024-09-17 00:28:12+00:00.

23
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/StelfieTT on 2024-09-17 08:03:44+00:00.

24
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/tom83_be on 2024-09-17 07:37:21+00:00.

25
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ToastersRock on 2024-09-16 23:54:21+00:00.


Here it is. It is not perfect and does require writing prompts that describe a scene with miniature people. Check some of the sample images for examples.

view more: next ›