//g/
File: 1780201519704192.png (3.1 MB)
3.1 MB
Previous /sdg/ thread : >>108938493

>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI

>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix

>Z-Image
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Flux.2 Dev/Klein
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/black-forest-labs/FLUX.2-klein-4B
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B

>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF

>Anima
https://huggingface.co/circlestone-labs/Anima

>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image

>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2

>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info

>Index of guides and other tools
https://rentry.org/sdg-link

>Related boards
>>>/aco/csdg/
>>>/b/degen
>>>/d/ddg
>>>/e/edg
>>>/gif/vdg
>>>/h/hdg
>>>/r/realistic+parody
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vp/napt
>>>/vt/vtai

OP https://rentry.co/twkuk8tz
Showing all 144 replies.
>>
Dead shithole
>>
>>
>mfw Resource news

05/31/2026

>FLUX Identity Adjuster (V2)
https://github.com/Magirad/Flux_ID_Adjuster_V2

>ComfyUI AnimaFastTrain
https://github.com/quinteroac/ComfyUI-AnimaFastTrain

>MONET: Open-source dataset
https://huggingface.co/datasets/jasperai/monet

05/30/2026

>Pixal3D — Apple Silicon (MPS / Metal) Port
https://github.com/pawel-mazurkiewicz/Pixal3D-mac

>Comfy-Org/PixelDiT (diffusion models & upscalers)
https://huggingface.co/Comfy-Org/PixelDiT/tree/main/diffusion_models

>Orion4D Generative Paint: ComfyUI advanced painting interface
https://github.com/orion4d/Orion4D_generative_paint

>ComfyUI Anima IP-Adapter
https://github.com/Wenaka2004/comfyui-anima-ipadapter

05/29/2026

>Colored Noise Diffusion Sampling
https://hadardavidson.github.io/CNS

>VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion
https://videomla.github.io

>minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models
https://github.com/shengshu-ai/minWM](https://github.com/shengshu-ai/minWM

>GPIC: A Giant Permissive Image Corpus for Visual Generation
https://gpic.stanford.edu

>SGMD: Score Gradient Matching Distillation for Few-Step Video Diffusion Distillation
https://github.com/ModelTC/LightX2V

>Native Audio-Visual Alignment for Generation
https://ernie-research.github.io/NAVA

>GASS: Geometry-Aware Spherical Sampling for Disentangled Diversity Enhancement in Text-to-Image Generation
https://github.com/L-YeZhu/GASS_T2I

>SAVAA: Mitigating Hallucinations in LVLMs via Step-wise Adaptive Visual Attention Amplification
https://github.com/JiachengZ01/SAVVA

>Nexus BTA: Local AI image and video studio built around an embedded ComfyUI runtime.
https://github.com/JpAndreBTA/Nexus-BTA

05/28/2026

>MAVEN A Multi-Agent Framework for Multicultural Text-to-Video Generation
https://github.com/AIM-SCU/CRAFT

>Bias Leaves a Gradient Trail
https://github.com/vitryt/label-free-bias-identification
>>
>mfw Research news

05/31/2026

>Channel-wise Vector Quantization
https://arxiv.org/abs/2605.26089

>How Accurate are Video Quality Models for Diffusion-Based Video Super-Resolution?
https://arxiv.org/abs/2605.25940

>AdvantageFlow: Advantage-Weighted Least Squares for RL in Flow Models
https://arxiv.org/abs/2605.26013

>Mitigating Object Hallucinations in Vision-Language Models through Region-Aware Attention Recalibration
https://arxiv.org/abs/2605.24957

>SpongeBob: Sync-Aware Harmonious Audio-Visual Generative Editing
https://hy-spongebob.github.io

>Self-supervised Dynamic Heterogeneous Degradation Modeling for Unified Zero-Shot Image Restoration
https://arxiv.org/abs/2605.24593

>On-Policy Adversarial Flow Distillation for Autoregressive Video Generation
https://arxiv.org/abs/2605.26105

>SLAD : Shared LoRA Adapters for Task Specific Distillation
https://arxiv.org/abs/2605.29726

>SuperVoxelGPT: Adaptive and Ordered 3D Tokenization for Autoregressive Shape Generation
https://arxiv.org/abs/2605.29655

>When Eyes Betray AI: Social Gaze Consistency as a Semantic Cue for AI-Generated Image Detection
https://arxiv.org/abs/2605.27348

>Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models
https://arxiv.org/abs/2605.28132

>CIVIC: End-to-End Sequence Compactness for Efficient Vision-Language Models
https://arxiv.org/abs/2605.28115

>Janus-LoRA: A Balanced Low-Rank Adaptation for Continual Learning
https://arxiv.org/abs/2605.28495

>Resolving Ambiguity in Composed Image Retrieval via Calibrated Interaction
https://arxiv.org/abs/2605.24634

>ProSR: Process-Shaped Spatial Reasoning for Reliable Chain-of-Thought in VLMs
https://arxiv.org/abs/2605.25524

>Structure-Guided Visual Perturbation Neutralization for LVLMs
https://arxiv.org/abs/2605.27927

>Black-box Membership Inference Attacks on the Pre-training Data of Image-generation Models
https://arxiv.org/abs/2605.27020
>>
3.2 MB
>>108949377
hooray i made it into the OP

>>108949420
>>108949440
gm
>>
>gm
>>
File: 45435436.jpg (159.3 KB)
159.3 KB
>>
>>108949499
>>108949511
>>108949728
gm
>>
>>
>>108949728
classic
>>
Good morning.
>>
2.3 MB
>>108949822
gm
what are you genning today?
>>
gm
>>108949732
>>108949511
>>108949844

doing well I hope
>>
3.1 MB
>>108949976
>doing well I hope
thanks, doing goog! actually having a productive weekend, very rare for me lol
>>
>>
>>
>>
Afternoon anons
>>
2.6 MB
>>108950584
ga
>>
>>108950584
howdy
>>
>ga
>>
>>
>>108950824
>>
>>108950937
yes
better with a lora but plain zit/zib can
>>
>>108950965
can't*
it can but it's all fucked up
>>
>>108950978
https://files.catbox.moe/8urc7e.jpg
it's not perfect, but it works
using only breast slider lora
>>
>>
>>
>>
>>
>>
2.3 MB
>>
>>108951817
nice
>>
2.9 MB
>>108951853
:)
>>
File: o_00001_.png (1.7 MB)
1.7 MB
>>
File: o_00004_.png (1.2 MB)
1.2 MB
>>
1.2 MB
>>
File: o_00006_.png (1.3 MB)
1.3 MB
>>
>>
File: o_00007_.png (1.5 MB)
1.5 MB
>>
1.5 MB
>>108952212
I was recently reading about giant ground sloths. too bad they went extinct. I wonder if your prompt would know how to draw them, or if it would just make normal sloths
>>
File: o_00009_.png (1.8 MB)
1.8 MB
>>108952258
i'll try. these have been with anima.
>>
File: 45456564.jpg (195.2 KB)
195.2 KB
flux1 giant ground sloth with lora
>>
>>
2.8 MB
>>108952290
its perfect
theres an alternate timeline where we all have giant sloth pets
>>
File: o_00010_.png (1.4 MB)
1.4 MB
anima g.g.s., with an extra person, for scale, i guess
>>
File: o_00011_.png (1001.2 KB)
1001.2 KB
>>
2.6 MB
>>108952318
lol, its lost the plot a bit but thats definitely giant
>>
>>
>>
>>
3.0 MB
>>
>>
3.3 MB
>>
>>
>>
>>
>>
2.8 MB
>>
>>
>>
2.7 MB
>>
2.2 MB
>>
2.7 MB
>>
i miss schizo anon
>>
File: image.jpg (69.9 KB)
69.9 KB
>>
File: OIG4.jpg (221.0 KB)
221.0 KB
>>
File: 000000_73096_.png (2.5 MB)
2.5 MB
G'mornin Anons, have a great day!
>>
>>108955185
gm

you the same anon
>>
>>
>>
>>
File: o_00015_.png (943.6 KB)
943.6 KB
>>
File: o_00016_.png (536.0 KB)
536.0 KB
>>
File: o_00017_.png (1.8 MB)
1.8 MB
>>
>>
File: o_00021_.png (1.4 MB)
1.4 MB
>>
>>
3.1 MB
Version with sound: https://files.catbox.moe/axq3li.mp4
>>
File: o_00025_.png (1.7 MB)
1.7 MB
>>
File: o_00027_.png (1.6 MB)
1.6 MB
>>
File: o_00030_.png (1.4 MB)
1.4 MB
>>
>>
File: o_00032_.png (1.1 MB)
1.1 MB
'ultra surreal penguin nightmare'
>>
File: o_00033_.png (1.1 MB)
1.1 MB
>>
>>
>>
File: o_00036_.png (837.9 KB)
837.9 KB
>>
File: o_00039_.png (1.5 MB)
1.5 MB
>>
>>
>>
>>
File: 1748874290726.jpg (819.3 KB)
819.3 KB
desuarchive.org/g/thread/105458332
year ago
>>
2.0 MB
gm
>>
>gm
>>
File: o_00042_.png (2.3 MB)
2.3 MB
>>108956900
gm
>>
>>108956895
when i was in the psych ward i unironically dreamed of koff girls dancing in the style of what miniani did
sometimes i wonder what he's doing now, he put a lot of time into his animations just to vanish without a trace
>>
2.7 MB
>>108957352
miniani always talked about how he didn't have time and was busy (work, I think?)
I think about a lot of anons like that. that archive link had RA-anon, who was similar. popped in randomly, dumped a bunch of cool stuff, then disappeared for ages. I hope they're all off in other corners of the internet continuing to make cool things
>>
3.0 MB
new github copilot usage metering is actually insane
>>
File: 56567656.jpg (159.3 KB)
159.3 KB
>>
File: 36565465467.jpg (192.2 KB)
192.2 KB
>>
2.2 MB
>>
File: 7-dezrashields.png (1.8 MB)
1.8 MB
I just released a character-driven fantasy story using only local stable diffusion models (z-img, ltx 2.3, klein, vibevoice, etc.)

~190 unique shots, ~170 lines of dialogue

https://www.youtube.com/watch?v=t8MKHbR0WF
>>
File: 8-astlisteningin.png (2.6 MB)
2.6 MB
>>108958103
some random stills from the episode... not that the keyframe stills for the shots are really anything to write home about.
>>
File: 3-dezranotsopout.png (2.2 MB)
2.2 MB
>>108958103
>>
2.5 MB
>>108958103
>This video isn't available anymore
doesnt load for me
>>
>>108958158
Oh, oops. deleted the last character from the link, awesome lol.

https://www.youtube.com/watch?v=t8MKHbR0WFc
>>
3.3 MB
>>108958179
cool, looks like a lot of time went into this. will def check it out, but can't until later
how do you feel about how the whole project turned out?
>>
File: 0-ldcloseup.png (2.2 MB)
2.2 MB
>>108958224
It taught me that for long-form content, 80% good is good enough. If you hyper-fixate on making every single shot perfect, you will never finish. I think the project turned out well. It's no HBO show, but it cost me $0 and I learned a ton of new techniques, strategies, and skills in both gen and in video editing.
>>
Afternoon anons
Literally changed the entire description of the setting for "the backrooms" so it wouldn't be too long.
>>
>ga
>>
>>108958179
>>108958290
very nice
i kind of want to puch that witch girl tho
>>
File: 5-dezbigsad.png (1.9 MB)
1.9 MB
>>108958493
Yea... in my head she came off a lot more lovable; the ditzy airhead who can't pay attention even during serious moments. But in practice she's just this tonally inconsistent autist with a bit of a shrill voice (VA is Fern from Frieren).
>>
>>108958506
eh, i've seen worse acting on lifetime channel lel
>>
>>
Speaking of Frieren, I also did a felt stop-motion adaptation of it. If anything it's just fun to see in this style.

https://www.youtube.com/watch?v=1qdDEngPbpw

>>108958547
kek i'll take it
>>
>>108958670
animated by hand or with ai? that's pretty crazy
the fox one is neat
>>
>>108958737
This is a stable diffusion thread lel, all of these are local open source AI models. in the case of the Frieren one, I converted snapshots from the anime into felt style using Klein 9b, and then animated the keyframes using LTX 2.3.

i'm glad you checked out the fox one! it's actually a finalist in a short film competition right now, curious if it'll win! would be cool since it was my first shortfilm ever.
>>
>>108958765
>converted snapshots from the anime into felt style using Klein 9b, and then animated the keyframes
how long did the project take? considering it's a solo project, it's pretty good. good luck on the competition
>>
>>108958809
The Frieren one took about 6 hours. The Felt Fox took about 80 hours. Dezra the Witch took about 200 hours. Thank you! Appreciate you checking it out.

Curious, are you using z-image base for your gens? The creativity with detail is higher than I'm used to seeing with SDXL.
>>
>>108958839
yah these are all zimage with z-detail-slider lora and some node and prompt magic to push details
>>
>>
2.4 MB
>>
>>
>>
>>
2.3 MB
>>
>>108958670
nice job
>>
>>
>>
>>
File: debo_lr_anima1_00013_.png (758.3 KB)
758.3 KB
>>
>>108949479
>mfw i gotta scroll through 3 days of github links just to find the one actually useful tool
>>
>>108949485
>mfw 20 papers in one dump and half are just "attention recalibration" reskins

kek, someone's gotta stop naming things like they're pokemon moves
>>
>>108951060
> breast slider lora and still can't get the proportions right
>>
>>
>>
>>
2.4 MB
>>108959923
I'm glad you found something useful :)
>>
>>
>>
1.1 MB
>>
how's RocM?
>>
>>108961737
New model? Haven't heard of it

Reply to Thread #108949377


Supported: JPG, PNG, GIF, WebP, WebM, MP4, MP3 (max 4MB)