Thread #108282577
File: 1772485058350742.jpg (58.6 KB)
58.6 KB JPG
Previous /sdg/ thread : >>108268244
>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix
>Z-Image
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
>Flux.2 Dev/Klein
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/black-forest-labs/FLUX.2-klein-4B
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B
>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF
>Anima
https://huggingface.co/circlestone-labs/Anima
>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info
>Index of guides and other tools
https://rentry.org/sdg-link
>Related boards
>>>/aco/sdg
>>>/b/degen
>>>/d/ddg
>>>/e/edg
>>>/gif/vdg
>>>/h/hdg
>>>/r/realistic+parody
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vp/napt
>>>/vt/vtai
OP https://rentry.co/twkuk8tz
169 RepliesView Thread
>>
>mfw Resource news
03/01/2026
>Accelerating Masked Image Generation by Learning Latent Controlled Dynamics
https://github.com/Kaiwen-Zhu/MIGM-Shortcut
>Open-sourced a one-click ComfyUI setup for RTX 50-series on Windows
https://github.com/hiroki-abe-58/ComfyUI-Win-Blackwell
>stable-diffusion-webui-codex v0.2.0-alpha
https://github.com/sangoi-exe/stable-diffusion-webui-codex
>ComfyUI SeedVR2 Tiler
https://github.com/BacoHubo/ComfyUI_SeedVR2_Tiler
02/28/2026
>Anima 2B Style Explorer
https://github.com/ThetaCursed/Anima-Style-Explorer
>LoRWeB: Spanning the Visual Analogy Space with a Weight Basis of LoRAs
https://research.nvidia.com/labs/par/lorweb
>Nexa - Your On-the-Go ComfyUI Companion
https://github.com/Arif-salah/Nexa_comfyui
>Z-Image-Turbo Controlnet Union 2.1 version 2602
https://huggingface.co/alibaba-pai/Z-Image-Turbo-Fun-Controlnet-Union- 2.1
>FL PixelGen: Pixel-space diffusion text-to-image generation and LoRA training nodes
https://github.com/filliptm/ComfyUI-FL-PixelGen
>ComfyUI NKD Sigmas Curve
https://github.com/Nekodificador/ComfyUI-NKD-Sigmas-Curve
>Capable, Open, and Safe: Combating AI Misuse
https://bfl.ai/blog/capable-open-and-safe-combating-ai-misuse
02/27/2026
>WISER: Wider Search, Deeper Thinking, and Adaptive Fusion for Training-Free Zero-Shot Composed Image Retrieval
https://github.com/Physicsmile/WISER
>Aligning Few-Step Diffusion Models with Dense Reward Difference Learning
https://github.com/ZiyiZhang27/sdpo
>Huihui-Qwen3.5-27B-abliterated
https://huggingface.co/huihui-ai/Huihui-Qwen3.5-27B-abliterated
>ComfyUI Yedp Action Director
https://github.com/yedp123/ComfyUI-Yedp-Action-Director
>Nano Banana 2: Combining Pro capabilities with lightning-fast speed
https://blog.google/innovation-and-ai/technology/ai/nano-banana-2
>ComfyUI适配 FireRed-Image-Edit-1.0: general-purpose image editing
https://huggingface.co/FireRedTeam/FireRed-Image-Edit-1.0-ComfyUI
>>
>mfw Research news
03/01/2026
>Mode Seeking meets Mean Seeking for Fast Long Video Generation
https://primecai.github.io/mmm
>SwitchCraft: Training-Free Multi-Event Video Generation with Attention Controls
https://arxiv.org/abs/2602.23956
>Diffusion Probe: Generated Image Result Prediction Using CNN Probes
https://arxiv.org/abs/2602.23783
>SenCache: Accelerating Diffusion Model Inference via Sensitivity-Aware Caching
https://arxiv.org/abs/2602.24208
>MSVBench: Towards Human-Level Evaluation of Multi-Shot Video Generation
https://arxiv.org/abs/2602.23969
>Enhancing Spatial Understanding in Image Generation via Reward Modeling
https://dagroup-pku.github.io/SpatialT2I
>DesignSense: A Human Preference Dataset and Reward Modeling Framework for Graphic Layout Generation
https://arxiv.org/abs/2602.23438
>Interpretable Debiasing of Vision-Language Models for Social Fairness
https://arxiv.org/abs/2602.24014
>Venus: Benchmarking and Empowering Multimodal Large Language Models for Aesthetic Guidance and Cropping
https://arxiv.org/abs/2602.23980
>A Difference-in-Difference Approach to Detecting AI-Generated Images
https://arxiv.org/abs/2602.23732
>Fixed Anchors Are Not Enough: Dynamic Retrieval and Persistent Homology for Dataset Distillation
https://arxiv.org/abs/2602.24144
>U-Mind: A Unified Framework for Real-Time Multimodal Interaction with Audiovisual Generation
https://arxiv.org/abs/2602.23739
>Joint Geometric and Trajectory Consistency Learning for One-Step Real-World Super-Resolution
https://arxiv.org/abs/2602.24240
>NAU-QMUL: Utilizing BERT and CLIP for Multi-modal AI-Generated Image Detection
https://arxiv.org/abs/2602.23863
>RAViT: Resolution-Adaptive Vision Transformer
https://arxiv.org/abs/2602.24159
>PixelRush: Ultra-Fast, Training-Free High-Resolution Image Generation via One-step Diffusion
https://arxiv.org/abs/2602.12769
>Diff-Aid: Inference-time Adaptive Interaction Denoising for Rectified T2I Generation
https://arxiv.org/abs/2602.13585
>>
File: deWM_zi_00013_.png (3.4 MB)
3.4 MB PNG
>>
File: paper-zit-2026-03-01_00243_.png (2.2 MB)
2.2 MB PNG
>>
File: deWM_zi_00014_.png (3.6 MB)
3.6 MB PNG
>>
>>
>>
>>
File: 00000003-960835846447843-z-turbo-mt.jpg (1.6 MB)
1.6 MB JPG
>>
File: 00000004-619152000979513-z-turbo-mt.jpg (1.4 MB)
1.4 MB JPG
>>108284156
better but what does that have to do with /sdg/ lel
also some of those lyrics are a bit...forced
>>
>>
File: 00000005-393156861681559-z-turbo-mt.jpg (1.1 MB)
1.1 MB JPG
>>108284224
how about natural soul
>>
File: 00000008-131571729879043-z-turbo-mt.jpg (1.5 MB)
1.5 MB JPG
>>
File: 00023-1189546213.png (1.3 MB)
1.3 MB PNG
>>
File: 00031-1189546219.png (1.3 MB)
1.3 MB PNG
>>
File: 00000009-961643818768097-z-turbo-mt.jpg (1.4 MB)
1.4 MB JPG
>>
File: 00000010-398362517945231-z-turbo-mt.jpg (1.6 MB)
1.6 MB JPG
>>
File: 00000013-33193570602728-z-turbo-mt.jpg (1.4 MB)
1.4 MB JPG
>>
File: 00000014-461346537788133-z-turbo-mt.jpg (1.3 MB)
1.3 MB JPG
>>
File: 00000015-630071646108841-z-turbo-mt.jpg (1.4 MB)
1.4 MB JPG
>>
File: deCA_zi_00041_.png (3.4 MB)
3.4 MB PNG
gm
>>
>>
File: 00000016-718566125749057-z-turbo-mt.jpg (1.6 MB)
1.6 MB JPG
>>
Morning anons
>>
File: 00000017-1095254098680946-z-turbo-mt.jpg (2 MB)
2 MB JPG
>>108285562
gm
>>
File: deCA_zi_00042_.png (3.1 MB)
3.1 MB PNG
>>108285562
gm
thumbnail made me think quokka had a zoomer broccoli cut fr fr no cap
>>
File: paper-zit-2026-03-01_00187_.png (1.9 MB)
1.9 MB PNG
>>108284195
i was just fuckin around and kinda liked it, only spent like 10 minutes on it.
>>
File: 00152-2084199451.png (1.7 MB)
1.7 MB PNG
>>
>>
File: 1772564771791_a535b62a-1410-4aff-995f-70314d6c3159.png (831.4 KB)
831.4 KB PNG
Officer Quokka
>>108285610
Kek, how is that cut even called?
>>
File: 21e.jpg (485.9 KB)
485.9 KB JPG
>>108286450
I've only ever heard it called a broccoli cut
>>
File: deCA_zi_00043_.jpg (1.2 MB)
1.2 MB JPG
>>108286534
we can use AI to derive the next popular vegetable based haircut
>>
>>
File: 00011-3744966258.png (1.7 MB)
1.7 MB PNG
>>108286593
>>
File: 00000021-781482003261398-z-turbo-mt.jpg (1.8 MB)
1.8 MB JPG
>>
File: deCA_zi_00045_.png (3.5 MB)
3.5 MB PNG
ew gross, it replied to me
>>
good pm anon
nice gen up there.
>>
>>
File: deCA_zi_00046_.png (3.5 MB)
3.5 MB PNG
>>
File: 00000023-38171207012539-z-turbo-mt.jpg (1.4 MB)
1.4 MB JPG
>>
File: 00000025-1124398964603614-z-turbo-mt.jpg (1.5 MB)
1.5 MB JPG
>>
File: 00000026-154332964805807-z-turbo-mt.jpg (1.3 MB)
1.3 MB JPG
>>
File: deCA_zi_00047_.png (3.2 MB)
3.2 MB PNG
>>108287764
cool alien dude
>>
>>
>>
>>
File: deCA_zi_00048_.png (3.5 MB)
3.5 MB PNG
>>
File: models-zit-2026-03-03_00003_.png (2.7 MB)
2.7 MB PNG
>>
File: 1741108368507388.png (1.2 MB)
1.2 MB PNG
UNIT
>>
File: models-zit-2026-03-03_00012_.png (2.5 MB)
2.5 MB PNG
>>
File: deCA_zi_00049_.png (2.7 MB)
2.7 MB PNG
is 2048 the max dimension limit for zimg? I've also been noticing many loras are trained at 1024 for some reason
>>
File: paper-zit-2026-02-05_00098_.png (3.7 MB)
3.7 MB PNG
>>108288368
When i was using 2304x1296, it did weird things like repeating stuff on the edges, images being off-center, etc. how much it matters really depends on the scene. even in these you've been doing if you look closely at the right side it's always just kind of... off from the rest of the image. pic rel is a good example, the right side it started putting another book for no reason. you can usually spot some artifacting in those sections too. i don't know what the max actually is, probably based on total megapixels desu
>>
File: download - 2025-04-22T132729.406.jpg (64.6 KB)
64.6 KB JPG
>>
File: deCA_zi_00050_.png (3.9 MB)
3.9 MB PNG
>>108288429
>the right side it's always just kind of... off
yeah, thats where the question came from. I was doing some other gens too that were doing similar things cuz they were too big. I'd gen smaller and upscale but zit isn't great in my upscaling workflow either cuz it artifacts a lot there too
>>108288449
nice
hero or villain?
>>
File: models-zit-2026-03-03_00126_.png (2.5 MB)
2.5 MB PNG
>>108288485
the weirdest part is that the rest of the image comes out great, it's like it starts another image tiled on the extra width
>>
File: 00000028-891118521741572-z-turbo-mt.jpg (1.4 MB)
1.4 MB JPG
>>
File: 00000029-746038305485169-z-turbo-mt.jpg (1.3 MB)
1.3 MB JPG
>>
File: models-zit-2026-03-03_00135_.png (2.8 MB)
2.8 MB PNG
>>
File: models-zit-2026-03-03_00144_.png (2.7 MB)
2.7 MB PNG
>>
>>
File: 00000030-806050982211682-z-turbo-mt.jpg (1.2 MB)
1.2 MB JPG
>>
File: deCD_zi_00003_.png (3.4 MB)
3.4 MB PNG
>>108289384
can I also be cute an valid
>>
>>
File: deCD_zi_00005_.png (3.3 MB)
3.3 MB PNG
>>108290133
nice watercolor effect
>>
File: models-zit-2026-03-03_00231_.png (2.7 MB)
2.7 MB PNG
>>
>>
>>
>>
File: deJS_zi_00007_.jpg (968.2 KB)
968.2 KB JPG
>>108290550
where I was before
>>108290556
hahaha you got me xD
>>
File: deCD_zi_00012_.png (3.5 MB)
3.5 MB PNG
gn
>>
File: Ffcfsq4XkAEx3rq.jpg (539 KB)
539 KB JPG
I would like to do this.
>>
>>
>>
File: 000000_60800_.png (3.3 MB)
3.3 MB PNG
G'mornin Anons, have a great day!
>>
File: 00000031-121444365233853-z-turbo-mt.jpg (1.5 MB)
1.5 MB JPG
>>
>>
File: 00000002-268552492768140-z-turbo-mt.jpg (2.3 MB)
2.3 MB JPG
>>
>>
>>
>>
>>
Morning anons
>>108292495
I have 6gb, sadly there's really not many option for us as even the most optimized models would take a while to gen, it's not worth it, if you have the money, just pay for it.
>>
File: models-zit-2026-03-03_00213_.png (2.6 MB)
2.6 MB PNG
>>108292495
z image turbo works great on 8gb
>>
>>
>>
File: 00087-3294099861.png (1.8 MB)
1.8 MB PNG
>>
File: 00000003-48564749113681-z-turbo-mt.jpg (1.5 MB)
1.5 MB JPG
>>
>>
>>
>>
File: deCD_zi_00013_.png (3.3 MB)
3.3 MB PNG
>>
>>
>>
File: deCD_zi_00015_.png (3.2 MB)
3.2 MB PNG
the official racism dataset
>Large-Scale Dataset and Benchmark for Skin Tone Classification in the Wild
https://arxiv.org/abs/2603.02475
>>
File: deCD_zi_00016_.png (3.5 MB)
3.5 MB PNG
>>108294476
https://github.com/lllyasviel/LayerDiffuse
>>
File: 00000005-889778250843165-z-turbo-mt.jpg (1.4 MB)
1.4 MB JPG
>>
File: 00000007-1035954493876270-z-turbo-mt.jpg (1.8 MB)
1.8 MB JPG
>>
File: deCD_zi_00017_.png (2.8 MB)
2.8 MB PNG
>>
File: 1772651370792_473a1f41-20ae-4d20-8043-64ff57031e8e.png (959.3 KB)
959.3 KB PNG
>>108294494
Holy based
>>
File: 00000008-729384968417940-z-turbo-mt.jpg (1.4 MB)
1.4 MB JPG
>>
File: deCD_zi_00019_.png (3.1 MB)
3.1 MB PNG
>>
>>108294476
>Pet box
>No breathing holes
Oh no
>>
File: 00000002-401231644625605-z-turbo-mt.jpg (1.8 MB)
1.8 MB JPG
>>
File: 00000004-978895367907822-z-turbo-mt.jpg (1.3 MB)
1.3 MB JPG
>>
File: deCD_zi_00020_.png (3.7 MB)
3.7 MB PNG
>>
File: 00000005-381636874412363-z-turbo-mt.jpg (1.3 MB)
1.3 MB JPG
>>
File: deCD_zi_00024_.png (3.2 MB)
3.2 MB PNG
>>108295955
new lora status?
>>
>>108295653
>>108296113
Hey debo please fuck off from /ldg/ you're not welcome in the real thread
>>
File: deCD_zi_00026_.png (3.5 MB)
3.5 MB PNG
>>108296277
sorry, sdg rules is that you must post a gen for your opinion to be considered
>>
File: models-zit-2026-03-03_00106_.png (2.5 MB)
2.5 MB PNG
that just sounds like an invitation
>>
File: 00307-1604700477.png (2.7 MB)
2.7 MB PNG
>>
File: 00000007-883746406175674-z-turbo-mt.jpg (1.4 MB)
1.4 MB JPG
>>108296113
idk why the z-turbo lora failed but the z-base is mostly ok, although it turns her asian more often than not
i'm too tired lately to try again
>>
File: 00000008-630630088708709-z-turbo-mt.jpg (1.6 MB)
1.6 MB JPG
>>
File: deCD_zi_00027_.png (3.4 MB)
3.4 MB PNG
>>108296447
not sure if this is relevant but I saw this comfy node yesterdayBeen using Z-Image Turbo and my LoRAs were working but something always felt off. Dug into it and turns out the issue is architectural, Z-Image Turbo uses fused QKV attention instead of separate to_q/to_k/to_v like most other models. So when you load a LoRA trained with the standard diffusers format, the default loader just can't find matching keys and quietly skips them. Same deal with the output projection (to_out.0 vs just out).
Basically your attention weights get thrown away and you're left with partial patches, which explains why things feel off but not completely broken.
So I made a node that handles the conversion automatically. It detects if the LoRA has separate Q/K/V, fuses them into the format Z-Image actually expects, and builds the correct key map using ComfyUI's own z_image_to_diffusers utility. Drop-in replacement, just swap the node.
Repo: https://github.com/capitan01R/Comfyui-ZiT-Lora-loader
>>
File: 00000010-950786313398987-z-turbo-mt.jpg (1.9 MB)
1.9 MB JPG
>>108296561
yah in my case it's more the lora itself than comfy (since i havent updated in a while), i just have to re-do it again more carefully, i just rushed it last time and all samples and images were just ugly blurry mess. i'll probably try again over the weekend. but the base lora works on z-turbo well enough for now
i did try that node and it made no difference to the z-turbo lora i made
>>
File: 00000002-859719912803697-z-turbo-mt.jpg (1.4 MB)
1.4 MB JPG
>>
File: 00000003-628188907625487-z-turbo-mt.jpg (1.3 MB)
1.3 MB JPG
>>
File: 00339-1347746357.png (2.6 MB)
2.6 MB PNG
>>
File: deCD_zi_00028_.png (3.3 MB)
3.3 MB PNG
>>
File: 00000005-943555773241792-z-turbo-mt.jpg (1.1 MB)
1.1 MB JPG
>>
File: 00000006-649351654728117-z-turbo-mt.jpg (1.5 MB)
1.5 MB JPG
>>
File: 00355-1815521286.png (2.6 MB)
2.6 MB PNG
>>
>>
File: 00000007-1046366095584942-z-turbo-mt.jpg (2.5 MB)
2.5 MB JPG
>>108297051
i noticed zturbo has a harder time with feet at certain angles compared to klein
>>
>>
File: 00000009-911784914884595-z-turbo-mt.jpg (1.9 MB)
1.9 MB JPG
>>
File: 00362-1909569381.png (2.8 MB)
2.8 MB PNG
>>
>>108297059
Hmm. Perhaps. I didn't try Klein enough to make a feet comparison. The feet other my z-turbo gens seems ok except for the occasional 6th toe.
>>
File: 00000010-158767495711844-z-turbo-mt.jpg (2.1 MB)
2.1 MB JPG
>>108297135
yah it's mainly kneeling or feet behind kind of poses that it seems to struggle with
>>
File: 00000011-328693428740780-z-turbo-mt.jpg (1.4 MB)
1.4 MB JPG
>>
File: 00000012-1117535874054239-z-turbo-mt.jpg (1.6 MB)
1.6 MB JPG
>>
>>108297221
ah. Let me try a few with that pose.
>>
File: 00000013-1004335600395132-z-turbo-mt.jpg (1.1 MB)
1.1 MB JPG
>>108297334
i admit it's difficult for most models, but z-image gives up too early
>>
File: 00000014-1008344006580780-z-turbo-mt.jpg (1.4 MB)
1.4 MB JPG
>>
>>108297367
Yeah, they sometimes turnout weird.
>>
File: 00000015-639245056490386-z-turbo-mt.jpg (1.5 MB)
1.5 MB JPG
>>
File: 00000016-885029160921449-z-turbo-mt.jpg (1.3 MB)
1.3 MB JPG
>>
File: 00000017-644768196736413-z-turbo-mt.jpg (1.3 MB)
1.3 MB JPG
>>
File: 00000018-40691918935007-z-turbo-mt.jpg (1.3 MB)
1.3 MB JPG
>>
File: 00000019-691086997059095-z-turbo-mt.jpg (1.9 MB)
1.9 MB JPG
>>
File: 00000020-1082728430679269-z-turbo-mt.jpg (1.9 MB)
1.9 MB JPG
>>
File: 00000022-638880985224868-z-turbo-mt.jpg (1.3 MB)
1.3 MB JPG
>>
File: 00000023-494169887715204-z-turbo-mt.jpg (1.3 MB)
1.3 MB JPG
>>
File: 00000024-1015107865111056-z-turbo-mt.jpg (1.3 MB)
1.3 MB JPG
>>
File: 00000025-816407145140650-z-turbo-mt.jpg (1.3 MB)
1.3 MB JPG
>>
File: 00000026-655126922835959-z-turbo-mt.jpg (1.3 MB)
1.3 MB JPG
>>
File: 00000027-857228317305873-z-turbo-mt.jpg (1.4 MB)
1.4 MB JPG
>>
File: 00000028-1102400426908865-z-turbo-mt.jpg (1.3 MB)
1.3 MB JPG
>battlesuit turns vaguely egyptian
heh
>>
File: 00000030-330329461752538-z-turbo-mt.jpg (1.3 MB)
1.3 MB JPG
me in the back
>>
File: 00000033-765182267337815-z-turbo-mt.jpg (1.8 MB)
1.8 MB JPG
>>
File: 00000034-1086956529278273-z-turbo-mt.jpg (1.2 MB)
1.2 MB JPG
>>
File: 00000035-189963806459036-z-turbo-mt.jpg (1.2 MB)
1.2 MB JPG
>>
File: 00000036-899324978216768-z-turbo-mt.jpg (1.1 MB)
1.1 MB JPG
>>
File: 00000038-417623658430953-z-turbo-mt.jpg (1.4 MB)
1.4 MB JPG
>>
Quokka and Wolfuokka
>>
File: 00000039-166727297408429-z-turbo-mt.jpg (1.2 MB)
1.2 MB JPG
>>108298329
tasmanian wolf?
>>
File: deCD_zi_00033_.png (3.5 MB)
3.5 MB PNG
>>
File: 00000040-392299456736850-z-turbo-mt.jpg (1.2 MB)
1.2 MB JPG
>>
File: 00000041-514910521451118-z-turbo-mt.jpg (1.1 MB)
1.1 MB JPG
>>
File: 00000042-983682974714801-z-turbo-mt.jpg (999.4 KB)
999.4 KB JPG
>>
File: 00000043-724605136817317-z-turbo-mt.jpg (989.5 KB)
989.5 KB JPG
>>
File: 00000044-181334590352697-z-turbo-mt.jpg (967.3 KB)
967.3 KB JPG
>>
File: 00000045-407011256986115-z-turbo-mt.jpg (935.5 KB)
935.5 KB JPG
>>
>>
File: 00000047-763939301224903-z-turbo-mt.jpg (1.3 MB)
1.3 MB JPG
>>
File: models-zit-2026-03-03_00207_.png (2.6 MB)
2.6 MB PNG
>>
>>108298419
Thanks :)
>>
File: paper-zit-2026-03-01_00185_.png (2.2 MB)
2.2 MB PNG