SDForAll

41 readers

1 users here now

We're open again. A subreddit about Stable Diffusion. https://normalcity.life/c/sdforall - If you're on Lemmy or are interested in joining, this...

founded 1 year ago

MODERATORS

submitted 2 months ago by bot@lemmit.online to c/sdforall@lemmit.online

0 comments fedilink hide all child comments

The original was posted on /r/sdforall by /u/OkSpot3819 on 2024-09-05 08:45:11+00:00.

MiniMax: NEW Chinese text2video model (), they also do free music generation (https://hailuoai.com/music)
LumaLabsAI released V 6.1 of Dream Machine which now features camera controls
RB-Modulation (IP-Adapter alternative by Google): training-free personalization of diffusion models using stochastic optimal control (HUGGING FACE DEMO)
New ChatGPT Voices: Fathom, Glimmer, Harp, Maple, Orbit, Rainbow (1, 2 and 3 - not working yet), Reef, Ridge and Vale (X Video Preview)
Text-Guided-Image-Colorization: influence the colorisation of objects in your images using text prompts (uses SDXL and CLIP) (GITHUB)
Meta's Sapiens segmentation model is now available on Hugging Faces Spaces (HUGGING FACE DEMO)
FluxMusic: SOTA open-source text-to-music model (GITHUB | JUPYTER NOTEBOOK | PAPER)
SKYBOX AI: create 360° worlds with one image ()
P2P-Bridge: remove noise from 3D scans (GITHUB | PAPER)
HivisionIDPhoto: uses a set of models and workflows for portrait recognition, image cutout & ID photo generation (HUGGING FACE DEMO | GITHUB)
Anifusion.ai: create comic books using UI via web app ()
ComfyUI-AdvancedLivePortrait Update (GITHUB)
ComfyUI v0.2.0: support for Flux controlnets from Xlab and InstantX; improvement to queue management; node library enhancement; quality of life updates (BLOG POST)
A song made by SUNO breaks 100k views on Youtube (LINK)

These will all be covered in the weekly newsletter, check out the most recent issue.

Here are the updates from the previous week:

Joy Caption Update: Improved tool for generating natural language captions for images, including NSFW content. Significant speed improvements and ComfyUI integration.
FLUX Training Insights: New article suggests FLUX can understand more complex concepts than previously thought. Minimal captions and abstract prompts can lead to better results.
Realism Techniques: Tips for generating more realistic images using FLUX, including deliberately lowering image quality in prompts and reducing guidance scale.
LoRA Training for Logos: Discussion on training LoRAs of company logos using FLUX, with insights on dataset size and training parameters.

FluxForge v0.1: New tool for searching FLUX LoRA models across Civitai and Hugging Face repositories, updated every 2 hours.
Juggernaut XI: Enhanced SDXL model with improved prompt adherence and expanded dataset.
FLUX.1 ai-toolkit UI on Gradio: User interface for FLUX with drag-and-drop functionality and AI captioning.
Kolors Virtual Try-On App UI on Gradio: Demo for virtual clothing try-on application.
CogVideoX-5B: Open-weights text-to-video generation model capable of creating 6-second videos.
Melyn's 3D Render SDXL LoRA: LoRA model for Stable Diffusion XL trained on personal 3D renders.
sd-ppp Photoshop Extension: Brings regional prompt support for ComfyUI to Photoshop.
GenWarp: AI model that generates new viewpoints of a scene from a single input image.
Flux Latent Detailer Workflow: Experimental ComfyUI workflow for enhancing fine details in images using latent interpolation.