File: collage.jpg (1.3 MB)
Discussion and Development of Local Image and Video Models
Previous: >>108951930
https://rentry.org/ldg-lazy-getting-started-guide
>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, & Upscalers
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/tdrussell/diffusion-pipe
https://github.com/kohya-ss/sd-scripts
https://github.com/kohya-ss/musubi-tuner
>Z
https://huggingface.co/Tongyi-MAI/Z-Image
>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
https://animadex.net
>Qwen
https://huggingface.co/collections/Qwen/qwen-image
>Klein
https://huggingface.co/collections/black-forest-labs/flux2
>Wan
https://github.com/Wan-Video/Wan2.2
>LTX-2.3
https://huggingface.co/collections/Lightricks/ltx-23
>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46
>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage
>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/b/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg
>Local Text
>>>/g/lmg
>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
Showing all 110 replies.
>>
>>
>>
>>108958327
>>Maintain Thread Quality
>https://rentry.org/debo
>https://rentry.org/animanon
This is like a troon tramp stamp. Guaranteed to have melties about "Julien" and "Nik". Debo standing by
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
File: ss (2026-06-01 at 14.37.51).png (433.0 KB)
>>108958461
Too bad, order placed
>>
>>108958327
Thank you for baking this thread, anon
>>108958345
Thank you for blessing this thread, anon
>>
>mfw Resource news
06/01/2026
>Bernini Latent Semantic Planning for Video Diffusion
https://bernini-ai.github.io
>NVIDIA Launches Cosmos 3, the Open Frontier Foundation Model for Physical AI
https://nvidianews.nvidia.com/news/nvidia-launches-cosmos-3-the-open-f rontier-foundation-model-for-physic al-ai
>LVSA: Training-Free Sparse Attention for Long Video Diffusion
https://github.com/JiusiServe/LongVideoSparseAttention
>RayDer: Scalable Self-Supervised Novel View Synthesis from Real-World Video
https://compvis.github.io/rayder
>DecMem: Towards Minute-Long Consistent World Generation with Decoupled Memory
https://jeffreyyzh.github.io/DecMem-Page
>Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models
https://jiazheng-xing.github.io/nexus-lumos-home
>Envisioning Beyond the Few: Disentangled Semantics and Primitives for Few-Shot Atypical Layout-to-Image Generation
https://github.com/iCVTEAM/DSP
>PEEK: Picking Essential frames via Efficient Knowledge distillation
https://github.com/momentslab/peek
>CameraNoise: Enabling Faithful Camera Control in Video Diffusion through Geometry-Flow-Guided Noise Warping
https://gulucaptain.github.io/CameraNoise
>Nvidia unveils new superchip to bring AI functions into personal computers
https://www.cbc.ca/news/business/nvidia-ai-personal-computer-9.7218820
>Qwen3.7-Plus: Multimodal Agent Intelligence
https://qwen.ai/blog?id=qwen3.7-plus
05/31/2026
>FLUX Identity Adjuster (V2)
https://github.com/Magirad/Flux_ID_Adjuster_V2
>ComfyUI AnimaFastTrain
https://github.com/quinteroac/ComfyUI-AnimaFastTrain
>MONET: Open-source dataset
https://huggingface.co/datasets/jasperai/monet
05/30/2026
>Pixal3D — Apple Silicon (MPS / Metal) Port
https://github.com/pawel-mazurkiewicz/Pixal3D-mac
>Comfy-Org/PixelDiT (diffusion models & upscalers)
https://huggingface.co/Comfy-Org/PixelDiT/tree/main/diffusion_models
>Orion4D Generative Paint
https://github.com/orion4d/Orion4D_generative_paint
>>
>mfw Research news
06/01/2026
>DTG-Restore: Training-Free Diffusion Refinement for Generative Video Super-Resolution
https://arxiv.org/abs/2605.30431
>TunerDiT: Training-free Progressive Steering of Diffusion Transformer for Multi-Event Video Generation
https://arxiv.org/abs/2605.31590
>SlotMemory: Object-Centric KV Memory for Streaming Long-Video Generation
https://tj12323.github.io/SlotMemory
>SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer
https://arxiv.org/abs/2605.30409
>OmniMem: Scalable and Adaptive Memory Retrieval for Long Video Generation
https://wuyushuwys.github.io/OmniMem
>Robust Dreamer: Deviation-Aware Latent Gaussian Memory for Action-Controlled AR Video Generation
https://arxiv.org/abs/2605.30855
>Mitigating Content Shift and Hallucination in GenAI Image Editing via Structural Refinement
https://arxiv.org/abs/2605.30437
>Parallel Tempering Initial Sampling in Inference-Time Reward Alignment
https://arxiv.org/abs/2605.30991
>Benchmarking and Enhancing Text-to-Image Models for Generating Visual Representations in Early Arithmetic Education
https://arxiv.org/abs/2605.31212
>Benchmarking Single-Step Inpainting Methods for Multi-Object 3D Gaussian Splatting Scenes
https://arxiv.org/abs/2605.30987
>MergeTok: Unified Continuous and Discrete Visual Tokenization via Token Merging
https://arxiv.org/abs/2605.30904
>Guidance for Low-Level Perceptual Editing in Unconditional Diffusion Models
https://arxiv.org/abs/2605.31162
>Representation Forcing for Bottleneck-Free Unified Multimodal Models
https://yuqingwang1029.github.io/RepresentationForcing
>A Unifying View of Variational Generative Wasserstein Flows
https://arxiv.org/abs/2605.31369
>Vision-Language Models Suppress Female Representations Under Ambiguous Input
https://arxiv.org/abs/2605.31556
>What Makes LVLMs Hallucinate Less? Unveiling the Architectural Factors Behind Hallucination Robustness
https://arxiv.org/abs/2605.30911
>>
>Using comfyUI
>Crashes
>Loads workflow
>Missing model
>Download model
>0% for 20 minutes
>Restart
>Crashes
>Try to download model again
>Stucked at 25%
>You need this extension
>Git clone
>Doesn't work
>Crashes
I sure do fucking love comfyUI, which is not comfy at all
>>
>>
>>
File: civit payouts.png (56.5 KB)
>>108958458
The website is total garbage and it is not worth engaging with the brown userbase for the sake of pennies they are paying.
200$ is a normal salary in India apparently but nothing worthwhile where I live.
You also need to game the system by spamming lots of poorly trained mediocre loras and jeetmixes to farm meaningful amount of buzz.
Not to mention, I have no faith that the website will be around for long, or that they won't arbitrarily suspend caching out.
>>
>>
File: 201637CUI_00002_.png (1.1 MB)
>>108958697
How is Civitai still standing anyway? It must cost millions a month to maintain it. Who is funding it?
>>
>>
>>108958629
>>108958637
thanks!
>>
>>
>>
>>
>>
>>
>>
>>108959079
Let's take a look at a couple famous chinese sayings.
>He who has never been cheated cannot be a good businessman
>If you can cheat, then cheat
>The first time you cheat me, be ashamed. The second time it is I who must be ashamed.
>>
>>
>>108959275
Forgot to redirect the post, sorry >>108959236
>>
>>
>>
>>
>>
>>
I'm trying to setup pixal3d in comfy and I'm becoming insane. There is always something breaking. Is there a guide or something?
I'm so tilted right now and I hate comfy with ally heart and soul I fucking hate it. I just want to use pixal3d.
>>
>started training at 40 epochs
>now extended 4 times to 100 epochs and probably counting because validation STILL keeps fucking dropping and samples STILL keep fucking improving
the things one does to goon in peace
>>
>>
>>
>>
>>
https://huggingface.co/RuneXX/LTX-2.3-Workflows/blob/main/Video-2-Vide o/Extend-Any-Video/LTX-2.3_-_V2V_Ex tend_Any_Video_Multi-Extend_long_vi deo.json
can extend any video and even clone voices, ltx 2.3 is pretty versatile.
https://files.catbox.moe/mmu2it.mp4
>>
File: 1770335662574855.png (5.6 KB)
>>108959965
Mia Yikk won, mikutroons btfo
>>
>>
I managed to make pixal3d work. Despite the model themselves being good generations, the textures are fucking ass, specially the eyes. Any reason what I could be doing wrong? Because the example images and videos I've seen seem pretty accurate to the source image.
>>
>>
>>
>>
>>
>>108960409
>a single term in a joke already got a mikutroon panties up in a bunch to search the entire archives for it and do the raped schizo special of accusing all people who ever used that word to be the same person
most mentally sane and not tranny-like mukutroon behaviour
>>
>>
>>108958647
>>108958666
I haven't used the regular frontend in months. No one should.
>>
File: zit_00003_.png (1.1 MB)
>>108958327
>>108958327
>Discussion and Development of Local Image and Video Models
AND MUSIC!!!
>>
>>
>>
>>
>>
>>
>>
File: FK9B__00003_.png (1.6 MB)
prompt from dalle thread:
>>108935557
>>
>>
>>
>>108961259
that guy has a lot of neat ltx 2.3 workflows for diff tasks (video extend, custom audio, whatever).
then I have a basic workflow for z image turbo, klein edit, and some other stuff. but most of the time I just mess with LTX 2.3 i2v, klein edit, or zimage if I want to make realistic stuff.
>>
>>108961295
what an interesting account https://civitai.red/models/2266799/heavens-gate-lets-start-a-vaporwave -ufo-cult
>>
>>
>>
>>
>>
>>
>>
>>
>>108961168
>>108961148
>>108961134
looks like my "fixes" have been causing crashes. I was using a model unloader, idk, we'll see, but looks like a vanilla launch is working better, and without that node. again, we'll see lol
>>
>>
>>
>>
File: 1764307906992637.png (3.0 MB)
>>108961335
>can we have a yuri thread
what should the 2girls be doing?
>>
>>
>>
>>
>>
>>
File: 1772264355612927.png (199.3 KB)
>>108961421
what ever happened to the non-woke traditional gay men like this?
>>
>>
>>
>>
File: 1771549469468585.png (606.7 KB)
>>108961313
a tale as old as your average npc leftshit (underage)
>"its just a rainbow chuddy, it literally doesnt matter!"
>ok
>makes a client-side mod to remove it
>"REEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE"
almost like its not just a rainbow but a humiliation ritual those who you view as enemies have to accept or else get censored or banned
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
ltx director is also fun. the node is like using premiere to add elements in the timeline:
https://files.catbox.moe/0v6ml7.mp4
>>108961922
im using 2.3 distilled fp8, seems fine
>>
>>
>>
>>
>>108961933
>>108961956
ty.
btw quality of the starfield guy is a bit off.
i saw on leddit star trek tng vids where they sing 90s euro-dance songs. must be higher precision since quality is quite up there (sound is a bit off tho).
>>
>>