StableDiffusion

101

1

Friday update for stable diffusion 🥳 - all the major relevant ai tools in a nut shell (old.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/OkSpot3819 on 2024-09-13 09:22:22+00:00.

Open-source of Qwen2-VL (VLM) coming soon (GITHUB) via NielsRogge on X
FineVideo: 66M words across 43K videos spanning 3.4K hours - CC-BY licensed video understanding dataset. It enables advanced video understanding, focusing on mood analysis, storytelling, and media editing in multimodal settings (HUGGING FACE)
Fluxgym Update: automatically generates sample images during training; use ANY resolution, not just 512 or 1024 (for example 712, etc.) via cocktailpeanut on X (creator)
Fish Speech 1.4: text to speech model trained on 700K hours of speech, multilingual (8 languages); voice cloning; low latency; ~1GB model weights (OPEN WEIGHTS) (HUGGING FACE SPACES)
Out of Focus v1.0: uses diffusion inversion for prompt-based image manipulation using Gradio UI, requires a high-end GPU for optimal performance (GITHUB)
Google NotebookLM launches "Audio Overview" feature: can turn any document into a podcast conversation. Once you upload the document and hit the generate button, two AI moderators will kick off a conversation-like discussion, diving deep into the main takeaways from the document (LINK)
Video Model is coming to Adobe Firefly via icreatelife on X
Midjourney is pioneering a new 3D exploration format for images, led by Alex Evans, innovator behind Dreams' graphics via MartinNebelong on X
FBRC & AWS present Culver Cup GenAI film competition at LA Tech Week via me :) on X
Coming soon: Vchitect 2.0 - A new text-to-video and Image-to-video model.
UVR5 UI: Ultimate Vocal Remover with Gradio UI (GITHUB)
Vidu AI Update: new "Reference to Video" feature, you can now apply consistency to anything—whether real or fictional (LINK)
Vchitect 2.0: new image2video/text2video model soon (LINK)
and slightly unrelated, but special mention: 🍓!

Wednesday's updates - link

Last week's updates - link

102

1

Now With help of FluxGym You can create your Own LoRAs (old.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/hackerzcity on 2024-09-13 00:40:59+00:00.

Now you Can Create a Own LoRAs using FluxGym that is very easy to install you can do it by one click installation and manually

This step-by-step guide covers installation, configuration, and training your own LoRA models with ease. Learn to generate and fine-tune images with advanced prompts, perfect for personal or professional use in ComfyUI. Create your own AI-powered artwork today!

You just have to follow Step to create Own LoRs so best of Luck

103

1

FLUX generated people always look the same (i.redd.it)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/DiienOfficial on 2024-09-13 06:43:10+00:00.

104

1

No polyamory allowed (i.redd.it)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/deadlyorobot on 2024-09-13 04:51:32+00:00.

105

1

Not going back to this grocery store (www.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/hudsonreaders on 2024-09-13 03:23:38+00:00.

106

1

Ps1 - Art Cover (Flux) Lora (www.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/theroom_ai on 2024-09-12 18:11:57+00:00.

107

1

My first attempt at a fantasy character (www.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ChristinaTreasure on 2024-09-12 16:32:54+00:00.

108

1

CogVideoX: Image2Video support! (old.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/phr00t_ on 2024-09-12 23:57:28+00:00.

A commit yesterday to the CogVideo repo added Image2Video support!

Merge pull request #272 from THUDM/CogVideoX_dev · THUDM/CogVideo@87ad61b (github.com)

I added a feature request on the ComfyUI wrapper:

Image2Video Support (CogVideo recent update) · Issue #54 · kijai/ComfyUI-CogVideoXWrapper (github.com)

EDIT: This isn't Image2Video yet, it is work towards supporting Image2Video. The developer said it will be released within the month:

hope for image to video · Issue #270 · THUDM/CogVideo (github.com)

109

1

Double IPAdapter with FaceSwap Workflow (www.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/wonderflex on 2024-09-12 21:40:22+00:00.

110

1

FLUX.1-dev-Controlnet-Inpainting-Alpha (old.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Z3ROCOOL22 on 2024-09-12 21:18:04+00:00.

111

1

AI 10 years ago: (i.redd.it)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/CyberEcho777 on 2024-09-12 22:23:33+00:00.

112

1

Flux made the rest of the F***ing owl (old.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink

This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/EndlessSeaofStars on 2024-09-12 16:25:30+00:00.

I know a lot of people here poke gentle fun of r/restofthefuckingowl but Flux actually did a decent job of it :)

numbered step-by-step drawing from sketched pencil outline to drawing of an owl on a tree branch

Steps: 24, Sampler: Euler, Schedule type: Beta, CFG scale: 1, Distilled CFG Scale: 3.5, Seed: 337531687, Size: 832x1280, Model hash: 275ef623d3, Model: flux1-dev-fp8, Template: numbered step-by-step drawing from sketched pencil outline to drawing of an owl on a tree branch, Beta schedule alpha: 0.6, Beta schedule beta: 0.6, Version: f2.0.1v1.10.1-previous-528-ge55cde9b, Module 1: ae

113

1

Interesting attention guidance experiments (www.reddit.com)

submitted 6 days ago by bot@lemmit.online to c/stablediffusion@lemmit.online

0 comments fedilink