Thread #108298416
HomeIndexCatalogAll ThreadsNew ThreadReply
H
Previous /sdg/ thread : >>108282577

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Flux.2 Dev/Klein
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/black-forest-labs/FLUX.2-klein-4B
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Anima
https://huggingface.co/circlestone-labs/Anima

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/aco/sdg
>>>/b/degen
>>>/d/ddg
>>>/e/edg
>>>/gif/vdg
>>>/h/hdg
>>>/r/realistic+parody
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vp/napt
>>>/vt/vtai

OP https://rentry.co/twkuk8tz
+Showing all 64 replies.
>>
>mfw Resource news

03/04/2026

>Helios: Real Real-Time Long Video Generation Model
https://pku-yuangroup.github.io/Helios-Page/

>Toward Early Quality Assessment of Text-to-Image Diffusion Models
https://github.com/Guhuary/ProbeSelect

>CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance
https://hanyang-21.github.io/CFG-Ctrl

>SIGMark: Scalable In-Generation Watermark with Blind Extraction for Video Diffusion
https://jeremyzhao1998.github.io/SIGMark-release

>Flimmer: Video LoRA training toolkit for diffusion transformer models
github.com/alvdansen/flimmer-trainer

03/03/2026

>Alibaba’s Qwen tech lead steps down after major AI push
https://techcrunch.com/2026/03/03/alibabas-qwen-tech-lead-steps-down-after-major-ai-push

>Adaptive Spectral Feature Forecasting for Diffusion Sampling Acceleration
https://hanjq17.github.io/Spectrum

>Kiwi-Edit: Versatile Video Editing via Instruction and Reference Guidance
https://github.com/showlab/Kiwi-Edit

>Let Your Image Move with Your Motion! -- Implicit Multi-Object Multi-Motion Transfer
https://ethan-li123.github.io/FlexiMMT_page

>Neural Discrimination-Prompted Transformers for Efficient UHD Image Restoration and Enhancement
https://github.com/supersupercong/uhdpromer

>OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens
https://openvglab.github.io/OmniLottie

>Flow-Factory: A Unified Framework for Reinforcement Learning in Flow-Matching Models
https://github.com/X-GenGroup/Flow-Factory

03/02/2026

>Accelerating Masked Image Generation by Learning Latent Controlled Dynamics
https://github.com/Kaiwen-Zhu/MIGM-Shortcut

>Open-sourced a one-click ComfyUI setup for RTX 50-series on Windows
https://github.com/hiroki-abe-58/ComfyUI-Win-Blackwell

>stable-diffusion-webui-codex v0.2.0-alpha
https://github.com/sangoi-exe/stable-diffusion-webui-codex

>ComfyUI SeedVR2 Tiler
https://github.com/BacoHubo/ComfyUI_SeedVR2_Tiler
>>
>mfw Research news

03/04/2026

>BrandFusion: A Multi-Agent Framework for Seamless Brand Integration in Text-to-Video Generation
https://arxiv.org/abs/2603.02816

>From "What" to "How": Constrained Reasoning for Autoregressive Image Generation
https://arxiv.org/abs/2603.02712

>TC-Padé: Trajectory-Consistent Padé Approximation for Diffusion Acceleration
https://arxiv.org/abs/2603.02943

>DREAM: Where Visual Understanding Meets Text-to-Image Generation
https://arxiv.org/abs/2603.02667

>Generative Visual Chain-of-Thought for Image Editing
https://pris-cv.github.io/GVCoT

>SemanticDialect: Semantic-Aware Mixed-Format Quantization for Video Diffusion Transformers
https://arxiv.org/abs/2603.02883

>StepVAR: Structure-Texture Guided Pruning for Visual Autoregressive Models
https://arxiv.org/abs/2603.01757

>Conditioned Activation Transport for T2I Safety Steering
https://arxiv.org/abs/2603.03163

>NOVA: Sparse Control, Dense Synthesis for Pair-Free Video Editing
https://arxiv.org/abs/2603.02802

>Beyond Language Modeling: An Exploration of Multimodal Pretraining
https://beyond-llms.github.io

>FiDeSR: High-Fidelity and Detail-Preserving One-Step Diffusion Super-Resolution
https://arxiv.org/abs/2603.02692

>Preconditioned Score and Flow Matching
https://arxiv.org/abs/2603.02337

>Kling-MotionControl Technical Report
https://arxiv.org/abs/2603.03160

>Cultural Counterfactuals: Evaluating Cultural Biases in Large Vision-Language Models with Counterfactual Examples
https://arxiv.org/abs/2603.02370

>Semantic Similarity is a Spurious Measure of Comic Understanding: Lessons Learned from Hallucinations in a Benchmarking Experiment
https://arxiv.org/abs/2603.01950

>ProGIC: Progressive and Lightweight Generative Image Compression with Residual Vector Quantization
https://arxiv.org/abs/2603.02897

>RealOSR: Latent Guidance Boosts Diffusion-based Real-world Omnidirectional Image Super-Resolutions
https://arxiv.org/abs/2412.09646
>>
>>
Thanks for baking :)
>>
gn all
>>
>>108298558
gn
>>
>>108298558
gn
>>
>>
>>108298558
Gn anon
>>
>>108298654
>I paid a premium for new RAM but when the box arrived it was filled with quokka??
>>
>>108298672
he was hungry and ate the chips :(
>>
dead general
>>
>>108298680
>*ba-dum-tss*
>>
>>108298680
pushing banana to the max on this one
>>
>banana
he just can't help himself
>>
I'm shocked you guys haven't gotten tired of this shit already
>>
>>108298742
artists don't get bored of art
>>
>tfw when tired
>>
>>108298767
>1girl, portrait
>hit 'generate'
such artists, much wow
>>
>>108298785
you're not an artist, so you'd never be able to understand
>>
>>108298797
many such cases. sad!
>>
>>
>>
>>
say grace :)
https://suno.com/s/xBlXU9lhOP3hBdwM
>>
>>108299232
see ya space cowboy
>>
>>108299222
checked
I was expecting it to sound more russian
maybe the burger theme americanizes everything it touches

>>108299232
gn
>>
i miss schizo anon
>>
>>108300131
Thank you for the page 8 nigbobump schizo-misser-anon
>>
>>
>>108298416
Which model and proompt for OP?
>>
>>108301048
nice

gm anon
>>
>>
>>108300319
It is an automated script, not a person.
>>
>>108301067
z-turbo
>>108301132
thx
>>
>>
>>108301905
>>108301912
Hey nigbo!
Can you be unemployed and lonely in your containment thread? /ldg/ exists because you're an insufferable cripple so feel free to fuck off!
>>
>>
>>108302005
gm
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>108303369
ga
>>
>>
>>108298544
Wow, I love the style here, anon, may I ask how did you achieve it? Is it a specific artist?
>>
>>
>>
>>108303480
I used this prompt
>"A detailed concept art piece of a futuristic warrior standing in a post-apocalyptic landscape, with towering ruins, distant fires, and a robotic companion by their side."
And changed it a bit to.
>"A detailed concept art piece of a Steampunk redhead female engineer wearing aviator cap and goggles on forehead, she is saluting while holding a jetpack, standing firm on the middle of farm field landscape, with clear blue skies, distant clouds, and a giant zeppelin in the distance"
So I guess a clean "template" version of this prompt would be
>A detailed concept art piece of a <character>, <location> landscape, with <objects in the background>, distant <object in the sky>, <rest (optional)>
>>
>>
>>
>>
>>
>>
>>
>>
>>

Reply to Thread #108298416


Supported: JPG, PNG, GIF, WebP, WebM, MP4, MP3 (max 4MB)