Thread #108282577
HomeIndexCatalogAll ThreadsNew ThreadReply
H
Previous /sdg/ thread : >>108268244

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Flux.2 Dev/Klein
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/black-forest-labs/FLUX.2-klein-4B
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Anima
https://huggingface.co/circlestone-labs/Anima

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/aco/sdg
>>>/b/degen
>>>/d/ddg
>>>/e/edg
>>>/gif/vdg
>>>/h/hdg
>>>/r/realistic+parody
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vp/napt
>>>/vt/vtai

OP https://rentry.co/twkuk8tz
+Showing all 169 replies.
>>
>mfw Resource news

03/01/2026

>Accelerating Masked Image Generation by Learning Latent Controlled Dynamics
https://github.com/Kaiwen-Zhu/MIGM-Shortcut

>Open-sourced a one-click ComfyUI setup for RTX 50-series on Windows
https://github.com/hiroki-abe-58/ComfyUI-Win-Blackwell

>stable-diffusion-webui-codex v0.2.0-alpha
https://github.com/sangoi-exe/stable-diffusion-webui-codex

>ComfyUI SeedVR2 Tiler
https://github.com/BacoHubo/ComfyUI_SeedVR2_Tiler

02/28/2026

>Anima 2B Style Explorer
https://github.com/ThetaCursed/Anima-Style-Explorer

>LoRWeB: Spanning the Visual Analogy Space with a Weight Basis of LoRAs
https://research.nvidia.com/labs/par/lorweb

>Nexa - Your On-the-Go ComfyUI Companion
https://github.com/Arif-salah/Nexa_comfyui

>Z-Image-Turbo Controlnet Union 2.1 version 2602
https://huggingface.co/alibaba-pai/Z-Image-Turbo-Fun-Controlnet-Union-2.1

>FL PixelGen: Pixel-space diffusion text-to-image generation and LoRA training nodes
https://github.com/filliptm/ComfyUI-FL-PixelGen

>ComfyUI NKD Sigmas Curve
https://github.com/Nekodificador/ComfyUI-NKD-Sigmas-Curve

>Capable, Open, and Safe: Combating AI Misuse
https://bfl.ai/blog/capable-open-and-safe-combating-ai-misuse

02/27/2026

>WISER: Wider Search, Deeper Thinking, and Adaptive Fusion for Training-Free Zero-Shot Composed Image Retrieval
https://github.com/Physicsmile/WISER

>Aligning Few-Step Diffusion Models with Dense Reward Difference Learning
https://github.com/ZiyiZhang27/sdpo

>Huihui-Qwen3.5-27B-abliterated
https://huggingface.co/huihui-ai/Huihui-Qwen3.5-27B-abliterated

>ComfyUI Yedp Action Director
https://github.com/yedp123/ComfyUI-Yedp-Action-Director

>Nano Banana 2: Combining Pro capabilities with lightning-fast speed
https://blog.google/innovation-and-ai/technology/ai/nano-banana-2

>ComfyUI适配 FireRed-Image-Edit-1.0: general-purpose image editing
https://huggingface.co/FireRedTeam/FireRed-Image-Edit-1.0-ComfyUI
>>
>mfw Research news

03/01/2026

>Mode Seeking meets Mean Seeking for Fast Long Video Generation
https://primecai.github.io/mmm

>SwitchCraft: Training-Free Multi-Event Video Generation with Attention Controls
https://arxiv.org/abs/2602.23956

>Diffusion Probe: Generated Image Result Prediction Using CNN Probes
https://arxiv.org/abs/2602.23783

>SenCache: Accelerating Diffusion Model Inference via Sensitivity-Aware Caching
https://arxiv.org/abs/2602.24208

>MSVBench: Towards Human-Level Evaluation of Multi-Shot Video Generation
https://arxiv.org/abs/2602.23969

>Enhancing Spatial Understanding in Image Generation via Reward Modeling
https://dagroup-pku.github.io/SpatialT2I

>DesignSense: A Human Preference Dataset and Reward Modeling Framework for Graphic Layout Generation
https://arxiv.org/abs/2602.23438

>Interpretable Debiasing of Vision-Language Models for Social Fairness
https://arxiv.org/abs/2602.24014

>Venus: Benchmarking and Empowering Multimodal Large Language Models for Aesthetic Guidance and Cropping
https://arxiv.org/abs/2602.23980

>A Difference-in-Difference Approach to Detecting AI-Generated Images
https://arxiv.org/abs/2602.23732

>Fixed Anchors Are Not Enough: Dynamic Retrieval and Persistent Homology for Dataset Distillation
https://arxiv.org/abs/2602.24144

>U-Mind: A Unified Framework for Real-Time Multimodal Interaction with Audiovisual Generation
https://arxiv.org/abs/2602.23739

>Joint Geometric and Trajectory Consistency Learning for One-Step Real-World Super-Resolution
https://arxiv.org/abs/2602.24240

>NAU-QMUL: Utilizing BERT and CLIP for Multi-modal AI-Generated Image Detection
https://arxiv.org/abs/2602.23863

>RAViT: Resolution-Adaptive Vision Transformer
https://arxiv.org/abs/2602.24159

>PixelRush: Ultra-Fast, Training-Free High-Resolution Image Generation via One-step Diffusion
https://arxiv.org/abs/2602.12769

>Diff-Aid: Inference-time Adaptive Interaction Denoising for Rectified T2I Generation
https://arxiv.org/abs/2602.13585
>>
>>
>>
>>
>>
>>108281043
https://suno.com/s/CVdLn8Ay2Acckcy2
>>
i miss schizo anon
>>
>>
>>108284156
better but what does that have to do with /sdg/ lel
also some of those lyrics are a bit...forced
>>
>>108284195
Forced soul vs artificial soul.
>>
>>108284224
how about natural soul
>>
>>
>>
>>
>>
>>
>>
>>
>>
gm
>>
>vibe killed
>>
>>
>>108285562
gm
>>
>>108285562
gm
thumbnail made me think quokka had a zoomer broccoli cut fr fr no cap
>>
>>108284195
i was just fuckin around and kinda liked it, only spent like 10 minutes on it.
>>
>>
Quite the ded shitter
>>
Officer Quokka
>>108285610
Kek, how is that cut even called?
>>
File: 21e.jpg (485.9 KB)
485.9 KB
485.9 KB JPG
>>108286450
I've only ever heard it called a broccoli cut
>>
>>108286534
we can use AI to derive the next popular vegetable based haircut
>>
>>
>>108286593
>>
>>
ew gross, it replied to me
>>
good pm anon
nice gen up there.
>>
>>
>>
>>
>>
>>
>>108287764
cool alien dude
>>
this site has seedance 2 btw, paid only btw but its real full seedance 2 https://yapper.so/i/CSJG2AYC96
>>
>>
>>
>>
>>
UNIT
>>
>>
is 2048 the max dimension limit for zimg? I've also been noticing many loras are trained at 1024 for some reason
>>
>>108288368
When i was using 2304x1296, it did weird things like repeating stuff on the edges, images being off-center, etc. how much it matters really depends on the scene. even in these you've been doing if you look closely at the right side it's always just kind of... off from the rest of the image. pic rel is a good example, the right side it started putting another book for no reason. you can usually spot some artifacting in those sections too. i don't know what the max actually is, probably based on total megapixels desu
>>
>>
>>108288429
>the right side it's always just kind of... off
yeah, thats where the question came from. I was doing some other gens too that were doing similar things cuz they were too big. I'd gen smaller and upscale but zit isn't great in my upscaling workflow either cuz it artifacts a lot there too

>>108288449
nice
hero or villain?
>>
>>108288485
the weirdest part is that the rest of the image comes out great, it's like it starts another image tiled on the extra width
>>
>>
>>
>>
>>
>>108289268
you're heckin' cute and valid
let no one tell you otherwise
>>
>>
>>108289384
can I also be cute an valid
>>
>>
>>108290133
nice watercolor effect
>>
>>
here I was hoping nigbo was finally hit by a bus

sadly I was wrong
>>
I finally got this prompt doing decent stuff. still tons of errors tho
>>
>>108290550
>still tons of errors tho
that's a signature in every one of your crap gens
>>
>>108290550
where I was before

>>108290556
hahaha you got me xD
>>
gn
>>
I would like to do this.
>>
i miss schizo anon
>>
nano banana 2 is so good, i wish there was a way to jailbreak it
>>
G'mornin Anons, have a great day!
>>
>>
>>
>>
8giglet here, should i even bother trying to gen with wan on this machine or just pay up and use something like runpod
have no interest in actually buying a bigger gpu cause i'll probably get bored in a week
>>
>>108292495
>>>/g/ldg
>>
>>108293210
what exactly is the point of this general these days?
>>
>>
Morning anons
>>108292495
I have 6gb, sadly there's really not many option for us as even the most optimized models would take a while to gen, it's not worth it, if you have the money, just pay for it.
>>
>>108292495
z image turbo works great on 8gb
>>
>>108292495
ofc i didn't read the post, wan your fucked
>>
>>
>>
>>
>>
>>108294024
Your gens look like shit
Do you really never try out new stuff? You post the same slop all the time and that's super boring
>>
>>
>>
>>
File: file.png (759.3 KB)
759.3 KB
759.3 KB PNG
>Ask it to make the background transparent
>does this
>>
the official racism dataset

>Large-Scale Dataset and Benchmark for Skin Tone Classification in the Wild
https://arxiv.org/abs/2603.02475
>>
>>108294476
https://github.com/lllyasviel/LayerDiffuse
>>
>>
>>
>>
>>108294494
Holy based
>>
>>
>>
>>108294476
>Pet box
>No breathing holes
Oh no
>>
>>
>>
>>
>>
>>108295955
new lora status?
>>
>>108295653
>>108296113
Hey debo please fuck off from /ldg/ you're not welcome in the real thread
>>
>>108296277
sorry, sdg rules is that you must post a gen for your opinion to be considered
>>
that just sounds like an invitation
>>
>>
>>108296113
idk why the z-turbo lora failed but the z-base is mostly ok, although it turns her asian more often than not
i'm too tired lately to try again
>>
>>
>>108296447
not sure if this is relevant but I saw this comfy node yesterday

Been using Z-Image Turbo and my LoRAs were working but something always felt off. Dug into it and turns out the issue is architectural, Z-Image Turbo uses fused QKV attention instead of separate to_q/to_k/to_v like most other models. So when you load a LoRA trained with the standard diffusers format, the default loader just can't find matching keys and quietly skips them. Same deal with the output projection (to_out.0 vs just out).

Basically your attention weights get thrown away and you're left with partial patches, which explains why things feel off but not completely broken.

So I made a node that handles the conversion automatically. It detects if the LoRA has separate Q/K/V, fuses them into the format Z-Image actually expects, and builds the correct key map using ComfyUI's own z_image_to_diffusers utility. Drop-in replacement, just swap the node.

Repo: https://github.com/capitan01R/Comfyui-ZiT-Lora-loader
>>
>>108296561
yah in my case it's more the lora itself than comfy (since i havent updated in a while), i just have to re-do it again more carefully, i just rushed it last time and all samples and images were just ugly blurry mess. i'll probably try again over the weekend. but the base lora works on z-turbo well enough for now
i did try that node and it made no difference to the z-turbo lora i made
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>108297051
i noticed zturbo has a harder time with feet at certain angles compared to klein
>>
>>
>>
>>
>>108297059
Hmm. Perhaps. I didn't try Klein enough to make a feet comparison. The feet other my z-turbo gens seems ok except for the occasional 6th toe.
>>
>>108297135
yah it's mainly kneeling or feet behind kind of poses that it seems to struggle with
>>
>>
>>
>>108297221
ah. Let me try a few with that pose.
>>
>>108297334
i admit it's difficult for most models, but z-image gives up too early
>>
>>
>>108297367
Yeah, they sometimes turnout weird.
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>battlesuit turns vaguely egyptian
heh
>>
me in the back
>>
>>
>>
>>
>>
>>
Quokka and Wolfuokka
>>
>>108298329
tasmanian wolf?
>>
>>
>>
>>
new
>>108298416
>>108298416
>>108298416
>>
>>
>>
>>
>>
>>
>>
>>
>>108298419
Thanks :)
>>

Reply to Thread #108282577


Supported: JPG, PNG, GIF, WebP, WebM, MP4, MP3 (max 4MB)