SDForAll

41 readers
1 users here now

We're open again. A subreddit about Stable Diffusion. https://normalcity.life/c/sdforall - If you're on Lemmy or are interested in joining, this...

founded 1 year ago
MODERATORS
1
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/CeFurkan on 2024-11-13 00:55:21+00:00.

Original Title: EasyAnimate Early Testing - It is literally Runway but Open Source and FREE, Text-to-Video, Image-to-Video (both beginning and ending frame), Video-to-Video, Works on 24 GB GPUs on Windows, supports 960px resolution, supports very long videos with Overlap

2
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/CeFurkan on 2024-11-05 13:59:51+00:00.

3
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/ArtisMysterium on 2024-11-03 14:08:02+00:00.

4
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/ComprehensiveHand515 on 2024-11-03 01:34:43+00:00.

5
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/Vegetable_Writer_443 on 2024-10-29 18:06:56+00:00.

6
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/CeFurkan on 2024-10-20 19:33:58+00:00.

Original Title: Official Limp Bizkit song video is fully (entire video) AI and one of my Patreon supporter made it. I am first time seeing an entire AI video for such stuff - I think he used FLUX and even trained to generate images

7
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/uisato on 2024-10-11 14:37:56+00:00.

8
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/Vegetable_Writer_443 on 2024-10-08 16:42:50+00:00.

9
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/Storybook_Tobi on 2024-10-07 07:56:16+00:00.

10
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/Main_Minimum_2390 on 2024-09-29 09:41:04+00:00.

11
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/OkSpot3819 on 2024-09-06 09:09:05+00:00.


  • SKYBOX AI: create 360° worlds with one image ()
  • Text-Guided-Image-Colorization: influence the colorisation of objects in your images using text prompts (uses SDXL and CLIP) (GITHUB)
  • Meta's Sapiens segmentation model is now available on Hugging Faces Spaces (HUGGING FACE DEMO)
  • Anifusion.ai: create comic books using UI via web app ()
  • MiniMax: NEW Chinese text2video model (), they also do free music generation (https://hailuoai.com/music)
  • Viewcrafter: generate high-fidelity novel views from single or sparse input images with accurate camera pose control (GITHUB CODE | HUGGING FACE DEMO)
  • LumaLabsAI released V 6.1 of Dream Machine which now features camera controls
  • RB-Modulation (IP-Adapter alternative by Google): training-free personalization of diffusion models using stochastic optimal control (HUGGING FACE DEMO)
  • New ChatGPT Voices: Fathom, Glimmer, Harp, Maple, Orbit, Rainbow (1, 2 and 3 - not working yet), Reef, Ridge and Vale (X Video Preview)
  • FluxMusic: SOTA open-source text-to-music model (GITHUB | JUPYTER NOTEBOOK | PAPER)
  • P2P-Bridge: remove noise from 3D scans (GITHUB | PAPER)
  • HivisionIDPhoto: uses a set of models and workflows for portrait recognition, image cutout & ID photo generation (HUGGING FACE DEMO | GITHUB)
  • ComfyUI-AdvancedLivePortrait Update (GITHUB)
  • ComfyUI v0.2.0: support for Flux controlnets from Xlab and InstantX; improvement to queue management; node library enhancement; quality of life updates (BLOG POST)
  • A song made by SUNO breaks 100k views on Youtube (LINK)

These will all be covered in the weekly newsletter, check out the most recent issue.

Here are the updates from the previous week:

  • Joy Caption Update: Improved tool for generating natural language captions for images, including NSFW content. Significant speed improvements and ComfyUI integration.
  • FLUX Training Insights: New article suggests FLUX can understand more complex concepts than previously thought. Minimal captions and abstract prompts can lead to better results.
  • Realism Techniques: Tips for generating more realistic images using FLUX, including deliberately lowering image quality in prompts and reducing guidance scale.
  • LoRA Training for Logos: Discussion on training LoRAs of company logos using FLUX, with insights on dataset size and training parameters.

⚓ Links, context, visuals for the section above ⚓

  • FluxForge v0.1: New tool for searching FLUX LoRA models across Civitai and Hugging Face repositories, updated every 2 hours.
  • Juggernaut XI: Enhanced SDXL model with improved prompt adherence and expanded dataset.
  • FLUX.1 ai-toolkit UI on Gradio: User interface for FLUX with drag-and-drop functionality and AI captioning.
  • Kolors Virtual Try-On App UI on Gradio: Demo for virtual clothing try-on application.
  • CogVideoX-5B: Open-weights text-to-video generation model capable of creating 6-second videos.
  • Melyn's 3D Render SDXL LoRA: LoRA model for Stable Diffusion XL trained on personal 3D renders.
  • sd-ppp Photoshop Extension: Brings regional prompt support for ComfyUI to Photoshop.
  • GenWarp: AI model that generates new viewpoints of a scene from a single input image.
  • Flux Latent Detailer Workflow: Experimental ComfyUI workflow for enhancing fine details in images using latent interpolation.

⚓ Links, context, visuals for the section above ⚓

12
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/rupertavery on 2024-08-26 01:47:40+00:00.

13
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/OkSpot3819 on 2024-09-05 08:45:11+00:00.


  • MiniMax: NEW Chinese text2video model (), they also do free music generation (https://hailuoai.com/music)
  • LumaLabsAI released V 6.1 of Dream Machine which now features camera controls
  • RB-Modulation (IP-Adapter alternative by Google): training-free personalization of diffusion models using stochastic optimal control (HUGGING FACE DEMO)
  • New ChatGPT Voices: Fathom, Glimmer, Harp, Maple, Orbit, Rainbow (1, 2 and 3 - not working yet), Reef, Ridge and Vale (X Video Preview)
  • Text-Guided-Image-Colorization: influence the colorisation of objects in your images using text prompts (uses SDXL and CLIP) (GITHUB)
  • Meta's Sapiens segmentation model is now available on Hugging Faces Spaces (HUGGING FACE DEMO)
  • FluxMusic: SOTA open-source text-to-music model (GITHUB | JUPYTER NOTEBOOK | PAPER)
  • SKYBOX AI: create 360° worlds with one image ()
  • P2P-Bridge: remove noise from 3D scans (GITHUB | PAPER)
  • HivisionIDPhoto: uses a set of models and workflows for portrait recognition, image cutout & ID photo generation (HUGGING FACE DEMO | GITHUB)
  • Anifusion.ai: create comic books using UI via web app ()
  • ComfyUI-AdvancedLivePortrait Update (GITHUB)
  • ComfyUI v0.2.0: support for Flux controlnets from Xlab and InstantX; improvement to queue management; node library enhancement; quality of life updates (BLOG POST)
  • A song made by SUNO breaks 100k views on Youtube (LINK)

These will all be covered in the weekly newsletter, check out the most recent issue.

Here are the updates from the previous week:

  • Joy Caption Update: Improved tool for generating natural language captions for images, including NSFW content. Significant speed improvements and ComfyUI integration.
  • FLUX Training Insights: New article suggests FLUX can understand more complex concepts than previously thought. Minimal captions and abstract prompts can lead to better results.
  • Realism Techniques: Tips for generating more realistic images using FLUX, including deliberately lowering image quality in prompts and reducing guidance scale.
  • LoRA Training for Logos: Discussion on training LoRAs of company logos using FLUX, with insights on dataset size and training parameters.

⚓ Links, context, visuals for the section above ⚓

  • FluxForge v0.1: New tool for searching FLUX LoRA models across Civitai and Hugging Face repositories, updated every 2 hours.
  • Juggernaut XI: Enhanced SDXL model with improved prompt adherence and expanded dataset.
  • FLUX.1 ai-toolkit UI on Gradio: User interface for FLUX with drag-and-drop functionality and AI captioning.
  • Kolors Virtual Try-On App UI on Gradio: Demo for virtual clothing try-on application.
  • CogVideoX-5B: Open-weights text-to-video generation model capable of creating 6-second videos.
  • Melyn's 3D Render SDXL LoRA: LoRA model for Stable Diffusion XL trained on personal 3D renders.
  • sd-ppp Photoshop Extension: Brings regional prompt support for ComfyUI to Photoshop.
  • GenWarp: AI model that generates new viewpoints of a scene from a single input image.
  • Flux Latent Detailer Workflow: Experimental ComfyUI workflow for enhancing fine details in images using latent interpolation.

⚓ Links, context, visuals for the section above ⚓

Want updates emailed to you weekly? Subscribe.

14
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/OkSpot3819 on 2024-09-03 09:32:20+00:00.


Hey! 👋 Here are this week's roundup of the latest developments in FLUX, Stable Diffusion, and the broader AI art world.

Click here to read the full article with proper formatting, links, visuals, etc.

🛠️ FLUX: Latest in Realism, LoRAs, and General Updates

FLUX continues to evolve rapidly, with several key developments this week:

  • Joy Caption update: Faster processing (2.5s per image on 3090 GPU)
  • New insights on FLUX training: Minimal captions often lead to better results
  • Realism techniques: Using "low quality" prompts for more natural looks
  • LoRA training: Success with small datasets (< 15 images) for company logos

Full version.

🏛️ California's AI Image Ban: A Potential Game-Changer

California has proposed a new bill (AB 3211) that could dramatically reshape AI-generated imagery:

  • Requires robust, hard-to-remove watermarking for AI-generated images
  • May effectively ban most existing AI image generation tools in California
  • Supported by major tech companies, raising concerns about regulatory capture
  • Significant controversy over technological feasibility and potential impact on innovation

Full version.

📚 Generative AI: A Quick Refresher

For those new to the field or seeking an update:

  • Generative AI creates original content (text, images, video, audio)
  • Works on prediction principles using large language models or GANs
  • Wide-ranging applications from writing assistance to visual content creation
  • Presents risks including job displacement, misinformation, and ethical concerns

Full version.

📡 On Our Radar: Exciting New Tools and Techniques

We're also tracking some emerging tools that could reshape your AI art workflow:

  • Juggernaut XI: Enhanced SDXL model with improved prompt adherence
  • FLUX.1 ai-toolkit UI on Gradio: Simplifies image captioning and processing
  • Kolors Virtual Try-On App: Test clothing styles virtually
  • CogVideoX-5B: New open-weights text-to-video model
  • Melyn's 3D Render SDXL LoRA: Generate detailed 3D-style renders
  • FluxForge v0.1: Search tool for FLUX LoRA models
  • Regional Prompt Support for ComfyUI in Photoshop: Precise control over AI generation
  • GenWarp: Generate new viewpoints from a single image
  • Flux Latent Detailer Workflow: Enhance fine details while avoiding the "overcooked" look

Full version.

Want updates emailed to you weekly? Subscribe.

15
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/Chuka444 on 2024-08-31 15:11:12+00:00.

16
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/MrLunk on 2024-08-21 19:58:20+00:00.


(Workflow and links by OpenArt user: CgTopTips)

Workflow + info link:

ENJOY !

NeuraLunk

17
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/cgpixel23 on 2024-08-17 21:05:24+00:00.

18
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/ArtisMysterium on 2024-08-17 19:38:25+00:00.

19
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/MrLunk on 2024-08-16 09:28:48+00:00.


A better working version of this workflow:

NEW V.2 workflow:

This workflow takes in 2 images and then Florence2 generates prompts from those images.

These prompts are then averaged by 'ConditioningAvarge' node.

(set the 'ConditioningAvarge' node between 0.45 and .55 to make the result more like one or the other image.

This version is different from the previous one because it doesn't work on an 'Empty Latent' but instead also combines the Latents encoded from the input images witch gives a lot better results then the previous version.

The images used for the previews for this workflow are attached on the right side of this workflow page as 'Assets'.

~I like feedback and comments ! So please leave them below ! :)~

ENJOY !

#NeuraLunk

20
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/MrLunk on 2024-08-15 15:54:23+00:00.


This stuff is just so much fun to play with...

Input 1

Input 2

Result :)

Input 2 images and Florence2 will create prompts from them.

Then the workflow will generate a new image based on a conditioning-avarage of the two generated prompts.

Raise or lower the Conditioning avarage from 0.45 to 0.55 to get a result more shifted towards one of the florence2 generated prompt inputs.

This stuff is just so much fun to play with... Input 2 images and Florence2 will create prompts from them. Then the workflow will generate a new image based on a conditioning-avarage of the two generated prompts. Raise or lower the Conditioning avarage from 0.45 to 0.55 to get a result more shifted towards one of the florence2 generated prompt inputs. Have fun playing with this ! Greetz, #NeuraLunk

Workflow link:

Have fun playing with this !

Greetz, #NeuraLunk

21
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/ArtisMysterium on 2024-08-10 20:00:35+00:00.

22
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/ArtisMysterium on 2024-08-04 16:01:30+00:00.

23
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/Amoxletsne on 2024-07-29 17:21:43+00:00.

24
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/CAMPFIREAI on 2024-07-27 14:20:27+00:00.

25
 
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/sdforall by /u/ArtisMysterium on 2024-07-07 19:24:50+00:00.

view more: next ›