File: 1780494857686237.jpg (1.3 MB)
Previous /sdg/ thread : >>108964525
>Beginner UI
EasyDiffusion: https://easydiffusion.github.io
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
>Advanced UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
Forge Classic: https://github.com/Haoming02/sd-webui-forge-classic
Stability Matrix: https://github.com/LykosAI/StabilityMatrix
>Z-Image
https://comfyanonymous.github.io/ComfyUI_examples/z_image
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
>Flux.2 Dev/Klein
https://comfyanonymous.github.io/ComfyUI_examples/flux2
https://huggingface.co/black-forest-labs/FLUX.2-dev
https://huggingface.co/black-forest-labs/FLUX.2-klein-4B
https://huggingface.co/black-forest-labs/FLUX.2-klein-9B
>Chroma
https://comfyanonymous.github.io/ComfyUI_examples/chroma
https://huggingface.co/lodestones/Chroma1-HD
https://huggingface.co/silveroxides/Chroma-GGUF
>Anima
https://huggingface.co/circlestone-labs/Anima
>Qwen Image & Edit
https://docs.comfy.org/tutorials/image/qwen/qwen-image
https://huggingface.co/Qwen/Qwen-Image
>Text & image to video - Wan 2.2
https://docs.comfy.org/tutorials/video/wan/wan2_2
>Models, LoRAs & upscaling
https://civitai.com
https://huggingface.co
https://tungsten.run
https://yodayo.com/models
https://www.diffusionarc.com
https://miyukiai.com
https://civitaiarchive.com
https://civitasbay.org
https://www.stablebay.org
https://openmodeldb.info
>Index of guides and other tools
https://rentry.org/sdg-link
>Related boards
>>>/aco/csdg/
>>>/b/degen
>>>/d/ddg
>>>/e/edg
>>>/gif/vdg
>>>/h/hdg
>>>/r/realistic+parody
>>>/tg/slop
>>>/trash/sdg
>>>/u/udg
>>>/vp/napt
>>>/vt/vtai
OP https://rentry.co/twkuk8tz
Showing all 103 replies.
>>
>>
>mfw Resource news
06/04/2026
>Echo-Infinity: Learning Evolving Memory for Real-Time Infinite Video Generation
https://echo-team-joy-future-academy-jd.github.io/Echo-Infinity
>DetectZoo: A Unified Toolkit for AI-Generated Content Detection Across Text, Audio, and Image Modalities
https://github.com/sadjadeb/DetectZoo
>ComfyUI KSampler Matrix Lab
https://github.com/btitkin/ComfyUI-KSampler-Matrix-Lab
06/03/2026
>Ideogram 4.0: Open model at the forefront of design
https://ideogram.ai/blog/ideogram-4.0
>JoyAI-Echo: Pushing the Frontier of Long Audio-Visual Generation
https://echo-team-joy-future-academy-jd.github.io/Echo-LongVideo-Page
>Follow-Your-Preference++: Rethinking Preference Alignment for Image Inpainting
https://github.com/shenytzzz/Follow-Your-Preference
>LongLive-RAG: A General Retrieval-Augmented Framework for Long Video Generation
https://github.com/qixinhu11/LongLive-RAG
>MAI-Image-2.5
https://microsoft.ai/models/mai-image-2-5
>AAD-1: Asymmetric Adversarial Distillation for One-Step Autoregressive Video Generation
https://aad-1.github.io
>Inference-Time Scaling for Joint Audio-Video Generation
https://jung-jaemin.github.io/ITS-AVGen-Proj
>Video-Mirai: Autoregressive Video Diffusion Models Need Foresight
https://y0uroy.github.io/Video-Mirai
>Order within Chaos: Capturing Intrinsic Energy Anomalies for AI-Manipulated Image Forgery Localization
https://github.com/phoenixnir/FLAME
>VISReg: Variance-Invariance-Sketching Regularization for JEPA training
https://haiyuwu.github.io/visreg
>HumanNOVA: Photorealistic, Universal and Rapid 3D Human Avatar Modeling from a Single Image
https://HumanNOVA.github.io
>Cosmos 3: Omnimodal World Models for Physical AI
https://research.nvidia.com/labs/cosmos-lab/cosmos3
>TGV-KV: Text-Grounded KV Eviction for Vision-Language Models
https://github.com/Danielement321/TGV-KV
>JAVEDIT: Joint Audio-Visual Instruction-Guided Video Editing with Agentic Data Curation
https://ryanchenyn.github.io/projects/JAVEdit
>>
>mfw Research news
06/04/2026
>Imagine Before You Draw: Visual Prompt Engineering for Image Generation
https://arxiv.org/abs/2606.04457
>Efficient and Training-Free Single-Image Diffusion Models
https://haojunqiu.github.io/efficient-SID
>DSA: Dynamic Step Allocation for Fast Autoregressive Video Generation
https://arxiv.org/abs/2606.04432
>Activation Steering of Video Generation Models via Reduced-Order Linear Optimal Control
https://arxiv.org/abs/2606.04775
>MeshFlow: Efficient Artistic Mesh Generation via MeshVAE and Flow-based Diffusion Transformer
https://mesh-flow.github.io
>Video2LoRA: Parametric Video Internalization for Vision-Language Models
https://arxiv.org/abs/2606.04351
>UniCanvas: A Diffusion-base Unified Model for Text-in-Image Joint Generation
https://arxiv.org/abs/2606.04264
>Crafting Your Evolving Dreams: Concept-Incremental Versatile Customization
https://arxiv.org/abs/2606.04797
>InstantRetouch: Efficient and High-Fidelity Instruction-Guided Image Retouching with Bilateral Space
https://openimaginglab.github.io/InstantRetouch
>ChannelTok: Efficient Flexible-Length Vision Tokenization
https://channeltok.github.io
>Controllable Dynamic 3D Shape Generation via 3D Trajectories and Text
https://cvlab-kaist.github.io/T2Mo
>MaCo-GAN: Manifold-Contrastive Adversarial Learning for Single Image Super-Resolution
https://arxiv.org/abs/2606.05068
>MAOAM: Unified Object and Material Selection with Vision-Language Models
https://jadenpark0.github.io/project_pages/maoam
>Impostor: An Agent-Curated Benchmark for Realistic AIGC Manipulation Localization
https://arxiv.org/abs/2606.04545
>An Empirical Study of Data Scale, Model Complexity, and Input Modalities in Visual Generalization
https://arxiv.org/abs/2606.04409
>Transferable Multi-Bit Watermarking Across Frozen Diffusion Models via Latent Consistency Bridges
https://arxiv.org/abs/2603.20304
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
File: 1768165152218664.png (2.2 MB)
>>108981372
i use a lora for that style, so the lora means i use slightly more recources.
i have a gtx 1080, that is 8gb vram, and i manage.
no idea about isometric, i guess it can.
>>
>>
File: ChatGPT Image Jun 4, 2026, 02_13_16 PM.png (1.5 MB)
>>108981389
thanks, anon.
I really like what google and chatgpt give me, but I need to gen more so local here I come.
>>
File: debo_lr_anima1_00054_.png (2.7 MB)
>>108981409
welcome aboard!
>>
>>
File: ummuhhhwat.gif (779.4 KB)
>>108981386
who keeps making these images? a woman with a number of creatures loitering around her draped in schizo themes? are you okay? are you not taking your meds? is it a bot? some form of shit post that is beyond my comprehension? just because? I dont understand...plz help.
>>
File: debo_lr_anima1_00055_.png (3.1 MB)
>>108981485
show us your latest gen
>>
File: scene (12).png (1.6 MB)
>>108981526
k. can I have my answer now?
>>
File: 00000211-561728997828769-z-turbo-mt.jpg (1.4 MB)
>>108981485
>>108981565
yes
>>
File: ohokay.jpg (18.0 KB)
>>108981573
so just schizo? fair enough.
>>
>>
>>
>>
>>
File: debo_lr_anima1_00057_.png (2.7 MB)
>>108981698
wow, rare brap anon post
>>
File: debo_lr_anima1_00058_.png (2.8 MB)
>>108981869
ga
>infinite backrooms is actually finite and there's vending machines
well thats a pleasant surprise.
>>
>>
>>
>>
>>
>>
File: 1780617765046_a53d3611-3a36-4085-8ba3-3bd4053171bf.png (1.3 MB)
P1 quokka interacts with quokka NPC
>>108981904
I actually thought it had another name or level number but when I looked it up, turns out it just called "the end"
>>
>>
>>
>>
File: artistism2-zit-2026-06-04_00140_.png (2.7 MB)
comfy desktop is tryin to go all in on the cloudshit. i hope they didn't vibecode this migration process too much. who am i kidding? this shit is gonna fuck up hard fml
>>
>>
>>
File: 00000239-412837913976165-z-turbo-mt.jpg (1.2 MB)
>>108982456
>>108982465
>>108982509
what happen?
>>
File: sekhmet-anima-2026-06-04_00009_.png (2.2 MB)
>>108982536
i did what you always say not to: updated comfy lol
>>
File: 00000245-499195587451219-z-turbo-mt.jpg (1.2 MB)
>>108982550
>he pulled
>>
File: debo_gn_anima1_00002_.png (3.1 MB)
>>108982210
>>108982287
>>108982367
#notmychromagirl
different lora config or something?
>>108982550
>updated comfy
are you suicidal?
>>
File: 00000248-43031616708758-z-turbo-mt.jpg (1.2 MB)
>>108982684
huh, does she look different?
>>
>>
File: debo_gn_anima1_00010_.png (3.4 MB)
>>108982706
her identity has been rather unstable lately
>>
File: 00000251-526683023639819-z-turbo-mt.jpg (1.3 MB)
>>108982736
always was unstable
>>
>>
>>
>>
File: debo_gn_anima1_00013_.png (2.8 MB)
>>108982782
I spose thats true
>>
File: 00000256-648490644120977-z-turbo-mt.jpg (1.3 MB)
>>108982849
if anything the lora is having more influence now due to certain changes in a few nodes (not the lora strength tho)
>>
>>
>>
>>
File: sekhmet-anima-2026-06-04_00082_.png (2.2 MB)
>>108983047
gn
>>
File: debo_gn_anima1_00015_.png (3.1 MB)
>>108983047
gn
>>108983120
redpill me on opencode (again)
>>
File: sekhmet-anima-2026-06-04_00066_.png (2.3 MB)
>>108983131
ah, well it's open source. u can use openai subscription, if u don't have that (you should) they have a $10/month OpenCode Go thing that has a bunch of chinese models which i've heard are pretty good now. tui or web interface, extensible. no vendor lockin. i use mostly that and lately claude code bc we got it at work now and it's the only one that can do remote ssh workspaces (not a big fan of anthropic in general tho). tui is more performant than claude code, it isn't vibe coded or not very much (claude code is 100% vibecoded and it shows)
>>
File: file.png (457.7 KB)
>>108983152
all the agents harnesses are pretty similar now, but really the fact that you can use your openai sub is the highlight and i guess i got used to it before codex got decent. if it had the remote ssh workspaces i'd never use anything else. it works in vscode which is nice. gpt-5.5 is really really really good and the usage limits are far more generous than anthropic. i only have Plus and i've never ran into usage issues. with opus u get like 3 messages and ur done unless u shell out for the expensive plans
>>
File: debo_gn_anima1_00016_.png (2.4 MB)
>>108983152
so you're not using go? you're just using gpt? which models/reasoning do you use?
>and lately claude code
sonnet? I haven't been having the best experience with it in vscode.. opus is too expensive for me
>>108983183
>he agents harnesses are pretty similar now
I was reading that models tend to work better within their own harness (gpt likes codex, claude like claude code, etc). but I think you've said you've never fucked around with codex, no?
>openai sub
you're on the max plan, right? do you bump into the usage limits at all?
>it works in vscode which is nice
I might deprecate vscode in my workflow cuz I canceled copilot. I'm trying to figure out which service to pivot into tho
>>
File: sekhmet-anima-2026-06-04_00089_.png (2.4 MB)
>>108983194
no i just use my ChatGPT Plus ($20/month). in CC i've been using Opus but not for anything super heavy. I could put myself on the $125 premium seat at work if i wanted to but meh.
opencode puts work into making a lot of models work well, i haven't noticed any big difference between it and codex. with gpt-5.5, i mostly stick with medium but i hear the way to go is planning w/ high/xhigh then implementation with low/medium. as for vscode, it's my standard text editor, tacking on agents is a bonus. go with OpenAI, start with plus and see how that goes. If i ever manage to talk you into talking to LLMs 5.5-thinking Extended (with friendly/warm persona) is bae and codex usage is separate from chatgpt. in terms of usage, i may not be representative
>>
File: debo_gn_anima1_00018_.png (3.0 MB)
>>108983214
>go with OpenAI, start with plus and see how that goes
thats what I've been thinking. worst case scenario, I just cancel after a month
>>
File: sekhmet-anima-2026-06-05_00003_.png (2.1 MB)
>>108983242
maybe you'll be tempted to talk to it and get a little ai psychosis in ur life :)
>>
File: debo_gn_anima1_00020_.png (2.6 MB)
>>108983282
and waste my tokens? I need those tokens to center divs
>>
File: sekhmet-anima-2026-06-04_00095_.png (2.2 MB)
>>108983292
good news! chatgpt and codex usage is separate!
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>108983668
couldn't resist
>>108984795
gm - same anon.
>>
File: 00000001-515586480038629-z-turbo-mt.jpg (1.3 MB)
>>108984795
>>108985218
gm
>>
>>108985225
nice
are those Boltzmann brains?
>>
File: 00000002-717728158884762-z-turbo-mt.jpg (1.2 MB)
>>108985248
heh no, they are zit hallucinations
>>
>>
>>108984074
>enhance
>enhance
>ah got the license plate
...will become a reality
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
File: file.png (1.8 MB)
this is my first image i made with my gpu
its shit
idk if i need to get immensely better at prompting or i need a better model
grok told me to use flux1-schenell-q5_0.gguf for my 4070
the picture is shit but its my first so it has something a little special to it.