StableDiffusion

97 readers
1 users here now

/r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and...

founded 1 year ago
MODERATORS
101
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/OkSpot3819 on 2024-09-13 09:22:22+00:00.


  • Open-source of Qwen2-VL (VLM) coming soon (GITHUB) via NielsRogge on X
  • FineVideo: 66M words across 43K videos spanning 3.4K hours - CC-BY licensed video understanding dataset. It enables advanced video understanding, focusing on mood analysis, storytelling, and media editing in multimodal settings (HUGGING FACE)
  • Fluxgym Update: automatically generates sample images during training; use ANY resolution, not just 512 or 1024 (for example 712, etc.) via cocktailpeanut on X (creator)
  • Fish Speech 1.4: text to speech model trained on 700K hours of speech, multilingual (8 languages); voice cloning; low latency; ~1GB model weights (OPEN WEIGHTS) (HUGGING FACE SPACES)
  • Out of Focus v1.0: uses diffusion inversion for prompt-based image manipulation using Gradio UI, requires a high-end GPU for optimal performance (GITHUB)
  • Google NotebookLM launches "Audio Overview" feature: can turn any document into a podcast conversation. Once you upload the document and hit the generate button, two AI moderators will kick off a conversation-like discussion, diving deep into the main takeaways from the document (LINK)
  • Video Model is coming to Adobe Firefly via icreatelife on X
  • Midjourney is pioneering a new 3D exploration format for images, led by Alex Evans, innovator behind Dreams' graphics via MartinNebelong on X
  • FBRC & AWS present Culver Cup GenAI film competition at LA Tech Week via me :) on X
  • Coming soon: Vchitect 2.0 - A new text-to-video and Image-to-video model.
  • UVR5 UI: Ultimate Vocal Remover with Gradio UI (GITHUB)
  • Vidu AI Update: new "Reference to Video" feature, you can now apply consistency to anything—whether real or fictional (LINK)
  • Vchitect 2.0: new image2video/text2video model soon (LINK)
  • and slightly unrelated, but special mention: 🍓!

Wednesday's updates - link

Last week's updates - link

102
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/hackerzcity on 2024-09-13 00:40:59+00:00.


Now you Can Create a Own LoRAs using FluxGym that is very easy to install you can do it by one click installation and manually

This step-by-step guide covers installation, configuration, and training your own LoRA models with ease. Learn to generate and fine-tune images with advanced prompts, perfect for personal or professional use in ComfyUI. Create your own AI-powered artwork today!

You just have to follow Step to create Own LoRs so best of Luck

103
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/DiienOfficial on 2024-09-13 06:43:10+00:00.

104
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/deadlyorobot on 2024-09-13 04:51:32+00:00.

105
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/hudsonreaders on 2024-09-13 03:23:38+00:00.

106
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/theroom_ai on 2024-09-12 18:11:57+00:00.

107
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ChristinaTreasure on 2024-09-12 16:32:54+00:00.

108
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/phr00t_ on 2024-09-12 23:57:28+00:00.


A commit yesterday to the CogVideo repo added Image2Video support!

Merge pull request #272 from THUDM/CogVideoX_dev · THUDM/CogVideo@87ad61b (github.com)

I added a feature request on the ComfyUI wrapper:

Image2Video Support (CogVideo recent update) · Issue #54 · kijai/ComfyUI-CogVideoXWrapper (github.com)

EDIT: This isn't Image2Video yet, it is work towards supporting Image2Video. The developer said it will be released within the month:

hope for image to video · Issue #270 · THUDM/CogVideo (github.com)

109
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/wonderflex on 2024-09-12 21:40:22+00:00.

110
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Z3ROCOOL22 on 2024-09-12 21:18:04+00:00.


111
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/CyberEcho777 on 2024-09-12 22:23:33+00:00.

112
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/EndlessSeaofStars on 2024-09-12 16:25:30+00:00.


I know a lot of people here poke gentle fun of r/restofthefuckingowl but Flux actually did a decent job of it :)

numbered step-by-step drawing from sketched pencil outline to drawing of an owl on a tree branch

Steps: 24, Sampler: Euler, Schedule type: Beta, CFG scale: 1, Distilled CFG Scale: 3.5, Seed: 337531687, Size: 832x1280, Model hash: 275ef623d3, Model: flux1-dev-fp8, Template: numbered step-by-step drawing from sketched pencil outline to drawing of an owl on a tree branch, Beta schedule alpha: 0.6, Beta schedule beta: 0.6, Version: f2.0.1v1.10.1-previous-528-ge55cde9b, Module 1: ae

113
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Patient-Librarian-33 on 2024-09-12 16:09:11+00:00.

114
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/an303042 on 2024-09-12 17:54:33+00:00.

115
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/seekingforwhat on 2024-09-12 17:45:53+00:00.


PuLID-FLUX provides a tuning-free ID customization solution for FLUX.1-dev model.

github link:

description about the model:

visual results:

Showcase of PuLID-FLUX

116
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/RepresentativeJob937 on 2024-09-12 15:04:56+00:00.


Code:

Writeup:

117
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/MooseBoys on 2024-09-12 07:47:07+00:00.

118
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Howlesh on 2024-09-12 09:14:20+00:00.

119
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/jonbristow on 2024-09-12 07:29:52+00:00.


100% of the top posts are about flux now

120
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Eveune on 2024-09-12 01:31:28+00:00.

121
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/shootthesound on 2024-09-11 20:56:45+00:00.

122
123
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/terra-incognita68 on 2024-09-11 21:53:28+00:00.

124
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/MonoNova on 2024-09-11 20:07:43+00:00.

125
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/CrasHthe2nd on 2024-09-11 14:21:30+00:00.

view more: ‹ prev next ›