Thread #108664784
File: highlights_g_108659074_1776900433_1.jpg (2.9 MB)
2.9 MB JPG
Discussion and Development of Local Image and Video Models
Previous: >>108659074
https://rentry.org/ldg-lazy-getting-started-guide
>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows
>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe
>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
>Qwen
https://huggingface.co/collections/Qwen/qwen-image
>Klein
https://huggingface.co/collections/black-forest-labs/flux2
>LTX-2
https://huggingface.co/Lightricks/LTX-2
>Wan
https://github.com/Wan-Video/Wan2.2
>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46
>Illustrious
https://rentry.org/comfyui_guide_1girl
>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage
>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg
>Local Text
>>>/g/lmg
>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
331 RepliesView Thread
>>
File: 186950.jpg (1.1 MB)
1.1 MB JPG
What happened to local?
>>
>>
>>
>>108664800
the worst part is that Alibaba is still here, feeding the llm fags, but they seem to have abandoned us :( >>108664796
>>
>>
>>108664800
>>108664809
>GPT was able to deduce Alibaba's sellout plan 2 years ago, we just didn't listen
holy shit, saas is actually insanely powerful
>>
File: nxyz-2026-04-22 12-47-59-er_sde-3.5-32-0098.jpg (650 KB)
650 KB JPG
Why are the api niggers so upset at local? They mad they can't gen boobies or what?
>>
>>
>>
>>
>>
>>
>>
>>
>>
File: 1750235241161732.jpg (239.4 KB)
239.4 KB JPG
>>108664820
>They mad they can't gen boobies or what?
have you not seen the grok threads? we can get our fill of boobs whenever we want
>>
>>
>>
>>108664784
Thank you for baking this thread, anon
>>108664802
Thank you for blessing this thread, anon
>>
>mfw Resource news
04/22/2026
>Embedding Arithmetic: A Lightweight, Tuning-Free Framework for Post-hoc Bias Mitigation in Text-to-Image Models
https://github.com/cvims/EMBEDDING-ARITHMETIC
>Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generation
https://github.com/CompVis/patch-forcing
>TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generation
https://github.com/Hong-yu-Zhang/TS-Attn
>AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model
https://yutian10.github.io/AnyRecon
>SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editing
https://github.com/vivoCameraResearch/SmartPhotoCrafter
>Soft Label Pruning and Quantization for Large-Scale Dataset Distillation
https://github.com/he-y/soft-label-pruning-quantization-for-dataset-di stillation
>Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation
https://github.com/AMAP-ML/EMF
>Enhancing Continual Learning of Vision-Language Models via Dynamic Prefix Weighting
https://github.com/YonseiML/dpw
>IR-Flow: Bridging Discriminative and Generative Image Restoration via Rectified Flow
https://github.com/fanzh03/IR-Flow
>TRELLIS.2-stableprojectorz: Trellis.2 optimized to fit inside 8GB gpus
https://github.com/IgorAherne/TRELLIS.2-stableprojectorz
>Fizgig — Klein 9B LoRA Studio
https://github.com/shootthesound/Fizgig
>Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items
https://huggingface.co/datasets/TaobaoTmall-AlgorithmProducts/Tstars-V TON
04/21/2026
>MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Style Mapping
https://jeoyal.github.io/MegaStyle
>UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Models
https://github.com/Yovecent/UDM-GRPO
>Noise-Adaptive Diffusion Sampling for Inverse Problems Without Task-Specific Tuning
https://github.com/NA-HMC/NA-HMC
>>
File: 1748735612394339.jpg (325.8 KB)
325.8 KB JPG
>>108664868
im not upset, im happy as a clam genning these cool pics with gpt image 2
>>
>mfw Research news
04/22/2026
>Memorize When Needed: Decoupled Memory Control for Spatially Consistent Long-Horizon Video Generation
https://arxiv.org/abs/2604.18215
>Diff-SBSR: Learning Multimodal Feature-Enhanced Diffusion Models for Zero-Shot Sketch-Based 3D Shape Retrieval
https://arxiv.org/abs/2604.19135
>ReImagine: Rethinking Controllable High-Quality Human Video Generation via Image-First Synthesis
https://arxiv.org/abs/2604.19720
>Long-Text-to-Image Generation via Compositional Prompt Decomposition
https://arxiv.org/abs/2604.18258
>HP-Edit: A Human-Preference Post-Training Framework for Image Editing
https://arxiv.org/abs/2604.19406
>Geometric Decoupling: Diagnosing the Structural Instability of Latent
https://arxiv.org/abs/2604.18804
>CreatiParser: Generative Image Parsing of Raster Graphic Designs into Editable Layers
https://arxiv.org/abs/2604.19632
>Allo SR $^2$: Rectifying One-Step Super-Resolution to Stay Real via Allomorphic Generative Flows
https://arxiv.org/abs/2604.19238
>Learning to Credit the Right Steps: Objective-aware Process Optimization for Visual Generation
https://arxiv.org/abs/2604.19234
>Deep sprite-based image models: An analysis
https://arxiv.org/abs/2604.19480
>LLM-as-Judge Framework for Evaluating Tone-Induced Hallucination in Vision-Language Models
https://arxiv.org/abs/2604.18803
>Hierarchically Robust Zero-shot Vision-language Models
https://arxiv.org/abs/2604.18867
>Rethinking Dataset Distillation: Hard Truths about Soft Labels
https://arxiv.org/abs/2604.18811
>Guiding Distribution Matching Distillation with Gradient-Based Reinforcement Learning
https://arxiv.org/abs/2604.19009
>Benign Overfitting in Adversarial Training for Vision Transformers
https://arxiv.org/abs/2604.19724
>BARD: Bridging AutoRegressive and Diffusion Vision-Language Models Via Highly Efficient Progressive Block Merging and Stage-Wise Distillation
https://arxiv.org/abs/2604.16514
>>
>>
File: HGjBeyBbUAAoSnV.jpg (235 KB)
235 KB JPG
Why do people throw a fit about Grok/GPT/NBP gens? They are supported fully in ComfyUI now and integrate well into local workflows. Despite this, the same freetards continue to cry over them. This isn't linux general you boomers.
>>
>>108664901
>>108664887
>>108664862
>This isn't linux general you boomers.
it literally is, it's a local thread, if you want to spam your API garbage you're simply off topic, what's hard to understand about that? get the fuck out >>108653190
>>
>>
File: 65236327326123.jpg (1.3 MB)
1.3 MB JPG
>>
>try Images 2.0 to add some text on an image that just happens to have a shirtless man in the background
>I'm focused on the text don't even notice the man. prompt to "make it a little bigger"
>We’re so sorry, but the image we created may violate our guardrails around nudity, sexuality, or erotic content. If you think we got it wrong, please retry or edit your prompt.
local will still be needed.
>>
File: 1758771180858155.png (2.1 MB)
2.1 MB PNG
>>108664993
i dont think gpt image is that strict, i just genned this a minute ago
try playing around with the prompt a little
>>
>>
>>108664993
>We’re so sorry, but the image we created may violate our guardrails around nudity, sexuality, or erotic content. If you think we got it wrong, please retry or edit your prompt.
>change the prompt to say make the text a little bigger
>just werks
maybe another day local will be needed
>>
>>
File: Flux2-Klein_00101_.jpg (265.5 KB)
265.5 KB JPG
>>108664993
>search for court cases
>chatgpt spitting out text
>deletes reply
>Stopped searching
>ask it the same question
>I have to time it and click stop before it deletes reply
Censorship is cancer.
>>
File: 1772551419712987.png (1.5 MB)
1.5 MB PNG
Realism lora for anima
https://civitai.com/models/1662740/lenovo-ultrareal
>>
>>
>>
>>
>>
>>
File: file.png (160 KB)
160 KB PNG
ai could never
>>108665062
my crystal maidens are all chubby now
>>
>>
>>
>>
>>108665149
Apparently I can't read the civit page
>This version corresponds to 40 epochs (120 passes over the data when considering the 3 resolutions)
120x153 steps.
>>108665161
He sets LR low so it needs more steps I think.
>>
>>
>>
>>
>>108665169
>>108665188
To clarify, I mean when using his default hyperparams. It works quite well.
>>
Has anyone experimented whether anima responds nicely to timestep shifts during training btw?
>>108665188
I am skeptical you actually trained a decent anima lora with the method you preach.
I had a very bad time trying to train lora for anima with high LR and lower step counts.
All epochs looked shit.
>>
>>
>>108665198
>>108665192
I also tried a low LR run (coincidentally similar to his, tried before he published his example lora) with 8-10k steps (I think, I don't remember too well), that also looked bad. Maybe the 18k ballpark figure is needed, I intend to try that.
>>
>>
>>108665198
Anima trains fine with a decent dataset, his sane defaults, and somewhere between 2k and 4k steps. Rarely do I have to go up to 4k. Often around 3k is fine. I have yet to feel the need to change any params from his example LoRA.
>>
>>
>>108665205
Just looked it up.
I tried 8k steps with 0.00004 LR. I am planning to try 0.00002 or 0.00003 with 18k or slightly below that.
Oh btw I just remembered his 18k steps is with gradient_accumulation_steps = 4
So does that loosely equal 4500 "real" steps?
>>
>>
>>108665205
>>108665247
Just use his trainer and the config from his example. All I'm meaning is it just werked for me.
>>108665200
You should already be running Linux desu.
>>
File: file.png (3.7 MB)
3.7 MB PNG
>>108665149
I did about 5k steps with 200 images for this stonetoss lora, using big russ's example config. Prior to that I did a couple with prodigy optimizer and they turned out ok too.
>>108665062
Seemed pretty bad when I tested it.
>>108665126
Have you tried InterpAny-Clearer? It's some stuff built on top of RIFE, I never really noticed blurring that much with RIFE, just awful artifacting with fast motion which InterpAny-Clearer fixed for me. It's a bit slower than RIFE I think.
>>
>>
>>
>>
>>
>>
>>
>>
>>108665304
>I did about 5k steps with 200 images for this stonetoss lora, using big russ's example config. Prior to that I did a couple with prodigy optimizer and they turned out ok too.
Thanks for the reference point anon.
>>
>>
File: _AnimaPreview3_00048_.jpg (367.9 KB)
367.9 KB JPG
OneTrainer support FUCKING when
>>
>>
>>
File: GPT 2 Partner Nodes.png (696.4 KB)
696.4 KB PNG
What we gennin' tonight, GPTgods??
>>
>>
i'm getting astray heads/body parts with the anima turbo lora, is there anything i can do to fix it? already tried editing the prompt and changing steps from 12 to 8 but the problem persists. it doesn't happen without the lora. it also doesn't happen if i use the highres aesthetic boost at a certain value (definitely doesn't on 1) but i dislike the aesthetics of turbo + high res, they remove too much detail together
>>
>>108665451
It comes from the SD3 paper. Shift values above 1 make the model spend more time on higher timesteps/sigmas, which improve composition for flow models (anything SD3 or newer). Going too high makes the image blurry/fucks up details so you can't crank it indefinitely. This is for genning images. If you never touched it, Comfy automatically uses the value 3 for most image models, feel free to experiment with model sampling node sometime.
I am not too well versed about precise impact for lora training, but it might help under some situations I believe.
>>
>>
>>108665477
>I am not too well versed about precise impact for lora training, but it might help under some situations I believe.
you adjust it regarding dataset complexity and training resolution
>>108665473
>>108665490
can you post image, not sure what you mean exactly
>>
File: 1776836990754323.jpg (639.1 KB)
639.1 KB JPG
>>
>>
>>
>>
>>
>>
>>
Is the API diffusion thread a psyop? I tried the new OAI one on openrouter, but it really seems like an inferior NBP that's slightly less fascist about censoring in exchange for sucking. Maybe it has to do with Openrouter? But the effusive praise feels kinda paid for.
>>
>>
>>108665695
It's thinly veiled shitposting yes.
I am not sure what quality level openrouter version is running, I tried it on FAL where you can set it to run at high quality.
Regardless only thing it particularly excels at is cramming a lot of text into the image without fucking it up.
Otherwise its abilities range from "maybe good but NBP can already do this" to shit.
>>
>>
>>
>>
>>
>>
>>108665804
Wan 2.2 for video.
Anima for anime/illustration (SFW and NSFW)
Klein 9b for Edit
Realism still Z-Image Turbo?
NSFW Realism Chroma (broken schizo model but you should be able to gen fast until you luck out enough with seed, if you don't mind the body horror)
>>
File: Anima realism.png (3.8 MB)
3.8 MB PNG
>>108665062
https://civitai.red/models/1862761/nicegirls-ultrareal
https://civitai.red/models/1662740/lenovo-ultrareal
https://github.com/DanrisiUA/ComfyUI-LoRA-Block-Filter
it's actually not bad at all, download his workflow and use those 2 lors to get this kind of quality, it's really long though since in his custom node he's using a very slow sampler, I tried to switch to good old euler but it wasn't as good
it's actually not bad at all, you download one of his images and you have to use those 2 loras, it's also fucking
>>
File: AS15T__00009_.png (370.2 KB)
370.2 KB PNG
fix my Anima prompta woman with her right arm amputated at the elbow holds a garment in her right hand, in front of racks of clothing in a department store.
>>
>>
File: ANIMA_P___00014_.png (954.4 KB)
954.4 KB PNG
>>108665840
oops, anima lol
>>
File: ComfyUI_temp_dmcrj_00017_.jpg (788.3 KB)
788.3 KB JPG
>>108665062
that guy has been milking the same dataset for like 2 years already lol
>>
>>
File: ComfyUI_0001.jpg (1.7 MB)
1.7 MB JPG
anima is great, is what chroma should've been
>>
>>
>>108665871
I'm actually impressed by the realism, like it's just a lora on top of a model heavily trained on anime only, why is it so good? did rusell see something we didn't in the fucking chronos architecture?? kek
>>
>>
>>
>>
>>
>>
>>108665857
>>108665871
Catbox please?
>>
File: Anima_0005.jpg (1.7 MB)
1.7 MB JPG
its the ye-pop dataset, but its better if you train a photorealism lora and combine it with anima
>>108665875
Don't fix what isn't broken mentality, sadly they went the reddit grifter route, but it works for them so its ok I guess, but their loras are very underwhelming since their dataset is so low-quality (noisy 2000s photos), it worked with flux but they never improved
>>
>>
File: look at him go!.png (353.6 KB)
353.6 KB PNG
>>108665881
>>108665923
>Nvdia gave us an hidden gem and it took us more than a year to notice
Jensen sempai...
>>
>>
>>108665828
it's funny that it looks so good when russel is still training it at 512x resolution, remember the chroma days when people were coping about the fact that chroma had shit anatomy because of the low res training, and that everything would be magically fixed once he would've switched to 1024x kek
>>
File: Anima_0007.jpg (1.9 MB)
1.9 MB JPG
>>
daily reminder that kekstone is still training Zeta-Chroma-X0-Dino-Nuggies-PixelPred-FlowInversion-UOOHH-pomf-PTHC-ed ition instead of using his compute to train Anima for like one or two epochs on the Chroma dataset which is all it would take
>>
File: Anima_0009.jpg (1.7 MB)
1.7 MB JPG
>>
>>108665975
>>108665963
Images like this kind of creep me out. It's like a glimpse into an alternate reality but there's no soul involved at all so it just feels like something that could have been but never was. I don't like it. I'm not even talking about your gens in particular just all of these non-chalant/natural instagram type "photos".
>>
File: based.gif (956.7 KB)
956.7 KB GIF
>>108665963
>>108665975
it can do good realism while having gozillions of anime characters and style, while doing good NSFW and the anatomy is actually pretty good too, sasuka trusell...
>>
>>108665969
I like..or liked Chroma, but that model is just too slow and clunky to run, you gotta play with the unet, add some flash loras, play with the loras weights, do a 2-pass, its just too messy, the only reason I like it, is that its dataset its really good and diverse and you can also can use Redux since its flux based, but anima + zturbo is just faster, has better prompt understanding
>>
File: Anima_0002.jpg (1.5 MB)
1.5 MB JPG
>>108665993
oops, forgot gen
>>
>>108665969
I can feel he's gonna try something on anima, make it pixel only and destroys it like he destroyed everything before kek, but even with all those failures I can't hate on the guy, I too want a future without a fucking VAE, that's what it takes to get the perfect edit model
>>
i tell LTX2.3 to walk the camera to a different spot in the scene and then i use an image model to recreate the details. it works better than asking the image model to move since it doesn't have any temporal consistency
>>
File: Anima_0003.jpg (1.5 MB)
1.5 MB JPG
>>108665990
Where are those people who said that training a lora on anima was bad because it started to forget stuff ?
>>
>>
>>108666010
if only it wasn't that long... but it's just the begining, if we can get this quality from an unfinished finetune, we're definitely not ready for the final version + a better realism lora on top of it
>>
File: Anima_0004.jpg (1.5 MB)
1.5 MB JPG
>>108666000
nice digits,
To me its just another cycle of the model training lucky strike, aka, someone trains a model, its very successful but can't replicate it when another new model and another architecture arrives, it happened with
SDXL => Juggernaut and Pony => Couldn't replicate its success
Flux => Chroma => Didn't finish training, went on a schizo path of experiments and crap that no one uses
I think the only sucessful guy was the LEOSAM's HelloWorld trainer that went on working with WAN-AI
and lets not mention ani, who never succeed at anything ai-related
>>
Man, civitai is really sensitive. It's rating one of my images on my model as rated R despite there being more than tame. I like to keep the main images SFW but I don't think it even matters.
Pic unrel.
>>
File: Anima_0010.jpg (1.8 MB)
1.8 MB JPG
>>
File: 2026-04-22-23-34-14_00001_.png (2.9 MB)
2.9 MB PNG
>>108666070
>>
>>
File: Anima_0011.jpg (1.7 MB)
1.7 MB JPG
>>
>>
>>
>>
File: Anima_0013.jpg (1.6 MB)
1.6 MB JPG
>>
>>
>>108666077
>>108666106
Workflow?
>>
File: nxyz-2026-04-22 23-31-02-er_sde-3.0-32-0246.jpg (573.1 KB)
573.1 KB JPG
my 1girl, sire
>>
>>
>>
>>108666167
provide the dataset and have it tagged if you want someone to do that for you. i'm training on anima atm if you provide the dataset and use anima i'll train it on there
>>108666178
This is anima, i'm still tweaking my lora. anima is new to me only been using it for 2 days
>>
>>
>>
>>
>>
>>
>>
File: nxyz-2026-04-22 23-55-06-er_sde-3.0-32-0253.jpg (861.2 KB)
861.2 KB JPG
>>108666208
Damn pardner I'm tryin
>>
>>
>>
>>108666208
just go to danbooru and gather tags and you're good to go, just one thing though, for artists it goes like this
>artist -> @artist
but, for artist tags that have parenthesis, it goes like this
>kaamin_(mariarose753) -> @kaamin \(mariarose753\)
here's how you can easily retrieve tags from an image
https://booruprompt.vercel.app/booru-tag
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
File: 1762012047028957.png (49.2 KB)
49.2 KB PNG
>>108666242
nice workflow anon, btw if you want to use less memory overall, go for DFloat11 for Z-image turbo, it's lossless and it's 30% lighter
https://github.com/mingyi456/ComfyUI-DFloat11-Extended
https://huggingface.co/mingyi456/Z-Image-Turbo-DF11-ComfyUI
>>
File: Anima_0030.jpg (1.5 MB)
1.5 MB JPG
>>
>>
File: oda-non.png (44.6 KB)
44.6 KB PNG
no oda-non in anima, sad
>>
File: 1760497052365822.png (1 MB)
1 MB PNG
>>108666342
Thanks. Gotta experiment with the ratios.
>>
File: 1776096388896084.png (1.8 MB)
1.8 MB PNG
GPT2 is bretty gud but it's not local so it doesn't matter.
>>
>>108666420
>~500 hits on booru
They're probably in the model and that site is out of date. v3 has at least seen artists with less than ~100 hits. https://tagexplorer.github.io/#/artists is the better one desu (oda-non isn't listed in it either tho).
>>
>>108666420
>>108666450
>https://tagexplorer.github.io/#/artists
oda-non is actually in this one just without the dash
>>
>>108664784
>Comfy
why hasn't anyone porked the comfyui before he started going tarded?
keep it neat and clean without the new unnecessary nonsense and especially 'partner nodes' + super-duper-amzing fronted updoots
>you do it
i dont do githubs nor python
>>
File: 1758351277778665.jpg (645.5 KB)
645.5 KB JPG
>>108666242
thanks for the workflow but personally I'm not a big fan, Z-image turbo removes the sovl out of it
>>
>>
File: Anima_0035.jpg (1.7 MB)
1.7 MB JPG
>>
>>
>>
>>
File: file.png (2.8 MB)
2.8 MB PNG
>>108666509
it happens
>>
File: Anima_0038.jpg (1.7 MB)
1.7 MB JPG
>>
File: 1759184137887676.png (738.6 KB)
738.6 KB PNG
>>
File: a-yume04_00055_.png (769.8 KB)
769.8 KB PNG
>>
>>
>>
>>
>>
>>
>>
>>
>>
File: 3216845213285132858452.png (2.5 MB)
2.5 MB PNG
>>108666789
just make a half-assed one in the meanwhile, it doesn't take long.
>>
>>
>>
>>
File: anima_00092_.png (2.5 MB)
2.5 MB PNG
>>
>>
>>108666841
>>108666841
https://civitai.red/models/1662740/lenovo-ultrareal
https://civitai.red/models/1862761/nicegirls-ultrareal
>>
File: milking-the-cow-2667676549-1746189555.2937.jpg (183.9 KB)
183.9 KB JPG
>>108666854
>those datasets
boy I can't wait for the inevitable Grainscape-ULTRAREAL-Anima.safetensors
>>
File: paper-zit-2026-04-23_00106_.png (3.2 MB)
3.2 MB PNG
>>
>>
>>
it's crazy how you can take an anima gen with a decent, realistic looking penis, send it to zit for a 2nd pass, and all you'll get is body horror, no matter which lora you use. Same with skimpy underwear, cameltoes, bikinis. I hope something comes along soon that can replace that piece of shit
>>
>>
File: a-yume04_00020_.png (1.3 MB)
1.3 MB PNG
>>
>>
>>
File: paper-zit-2026-04-23_00136_.png (3.1 MB)
3.1 MB PNG
>>
>>
use anima for the base gen and get a segmenter. then run z for a background detailer pass and chroma for a subject detailer pass. or just use saas instead of coping with 5+ different models just to generate a basic plastic 1girl
>>
>>
>>
>>
>>
>>108664862
>>108664887
>>108664901
>>108666432
Are these real, what the fuck?
>>
>>
>>108665374
Forgetting persists, and many anime centered checkpoint and lora makers have moved from Anima. That’s why furry and realistic styles are rising along with non anime styles type of loras like him >>108665304
Also locking training the model behind Linux is unfair and elitist. I don’t have time to learn a bloated OS just for “muh open source" snow flake ideology, that mindset is very first world.
>>
File: a-yume04_00069.png (1.6 MB)
1.6 MB PNG
>>
>>
>>
>>
>>
>>108667396
i've trained several loras on windows so far and haven't had a single issue.
it took like 200 steps on my first lora to dial in settings i was happy with, after that it was smooth sailing.
preview 3 has been out for awhile now, we should have lots of examples of "it's catastrophic memory issues." would you care you link some?
>>
>>108667440
>unironically responding to concern troll
lol
let me give you a timeline
>anima releases
>AAAAAAAAH THE LICENSE IS BAD!!!
>oh wow 1M grant TIME TO SHITPOST APACHE ANIMA!!! I messaged all the devs on discord, they didnt reply but theyre 100% onboard:)))))))))
>anima 2 releases
>ARGHHH stop using anima!!! we're totally cooking the new model and its gonna be free! here check this website I totally not made to bitch about anima stealing artists work!!!
>anima 3 releases
>AAGH ANIMA IS UNTRAINABLE! MUH CATASTROPHIC FORGETTERING!!!
>the failed devs go to bitch on HF (btw they have a competing completely melted and shit CLIP SDXL with flux vae model which is complete garbage so they 100% have interest in bringing anima down while propping up their literal shit)
>tdrussel provides a retard proof method to train loras to shut these retards up
>more good loras started getting made
>still tries to gaslight anons about stuff that doesn't exist
>copes daily about being a failed dev and being thrown out by comfy for being a literal psychotic backstabbing retard
ok, next time please just ignore this retard
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
File: 2026-04-21-17h56m42s_seed478690599_the girls sit on each side of the man. realistic,.jpg (468.4 KB)
468.4 KB JPG
>>108667530
i just throw random pics at it
>>
>>108667496
You say that as a realist or non anime genner, talk to me when you have the intention of adding the latest gacha whores post September 2025 and see how the model fails catastrophically and you will come crying at my feet, you and your Linux distros that you used to train that failed model.
>>
>>
File: Flux2-Klein_00225_.png (1.7 MB)
1.7 MB PNG
>>108667575
sounds like a real waste of time to me
>>
>>
>>
>>
>>
>>108667640
>that lora doesn't count because the anime it is from isn't obscure enough.
there are already thousands of anima loras on civitai, no one is complaining about catastrophic forgetting or anything resembling your lies.
>>
>>
>>
>>
>>
File: 1766307597374317.png (2.2 MB)
2.2 MB PNG
>>108667596
>Yeah, it's too strict compared to Gemini. All that trouble doesn't seem worth it for a side-grade.
its not allergic to genning big tits and nice looking bodies so i wouldnt call it a sidegrade
just have to figure out how to prompt it without triggering guardrails
>>
>>
>>108667749
>>108667754
Saar bros we are eating good bitch basterd
>>
File: 16124627468.png (346.8 KB)
346.8 KB PNG
cooking up some ltx2.3 kinos
>>
File: 1759383206572375.png (1.7 MB)
1.7 MB PNG
>>108667754
>>108667761
localcucks are arguing about shitty anime loras lmao
they've completely given up on realism (sad)
>>
>>
>>
>>108665826
>Wan 2.2 for video
I've been out for a few months, trying to catch up.
And this is STILL the best local video model? Nothing else was released that could compete? That sucks man, I hate the constant loading and unloading of using two models.
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>108668187
it's i2v no matter what res is 512 or 1024 fps 24 or 60
if you are trying to say skill issue it's not
look at ltx loras descriptions and comments on civitai
you will see 10k+ sometimes 50k+ steps of training
it's just ltx is garbage
>>
File: 748431728931741.png (2.2 MB)
2.2 MB PNG
>>
>>
>>
>>
>>
>>
>>
>>
>>
File: 1754788813493180.jpg (43.5 KB)
43.5 KB JPG
>>108664784
I love my RTX 5070ti so much bros :) All these degenerate illegal sloppa i gen :-))
>>
>>
>>108668408
https://github.com/gazingstars123/Anima-Standalone-Trainer
>>
>>
>>
>>
>>
File: Ernie-Image-Turbo_00066_.png (1.1 MB)
1.1 MB PNG
Trained an Iryna LoRA on Ernie. This time in Comfy instead of diffusion-pipe. rank 32, 5000 steps https://gofile.io/d/ljv01e
>>
>>
>>
>>
>>108668483
unironically, anima will be the best model to train realistic shit on it, it's surprisignly good at it
>>108665996
>>108665963
>>108665828
>>
>>108668433
He is a coward desu, imagine what the community could accomplish if he debated with Bluvol, Azhnek and Laxar there and became friends with them, but he takes refuge here with his mommy /ldg/ who gives him milk and warm cookies and pats him on the back for whatever crap he does.
>>
>>
>>108668480
elaborate
>>108668488
two digits iq subhuman spotted
>>
>>
File: 1757217129499855.png (377.3 KB)
377.3 KB PNG
>>108668517
>Bluvol, Azhnek and Laxar
who?? see? the fact I'm asking that question says everything, those guys haven't proved anything and you want them to ruin this projct? lmao
>>
>>
>>
>>108668488
>are you retarded?
>>108668525
>yes
that's what I thought
>>
>>108668523
Ipositive or negative in superficial ways. Bluvoll practically lives in noob cord testing Anima things with Aznek and they recently made a VAE of Anima 2d that requires much less VRAM. They spend more time with Anima than tdrusel himself.
>>
Ultimately Comfy was right to give the money to tdrussel, Anima is genuinely an insanely good model, he's probably the most talented guy we ever had on the local diffusion ecosystem when it comes to training stuff and not make them bad
>>
>>
>>
File: animatrashlop.png (3.7 MB)
3.7 MB PNG
how is this thing realistic at all?
>>
>>
>>108668555
https://huggingface.co/Anzhc/Qwen2D-VAE
Faster gens
>>
>>108668559
>how is this thing realistic at all?
did you use this specific workflow? it's the one that makes anima pretty good at realism
https://civitai.red/images/128285430
>>108668568
what about the quality though? if it's worse it's not worth it, I'll give it a try though, thanks for the link
>>
>>
>>
File: Ernie-Image_00016_.png (1 MB)
1 MB PNG
>>108668499
>why comfy
To see what it's like you virgin. I'll try anything once. 5k steps in under 4 hours @ 1024.
>>108668502
idgaf, just dropping something anonymously
>>108668509
Tried a realism LoRA on the first preview, but it didn't go well. Will try again after final release in 2 weeks
>>
>>
>>
>>108665464
go back home and kys retard
>>108653190
>>
File: file.png (3.8 MB)
3.8 MB PNG
>>108664784
Flux 2. Kline 4B - comfyui portable + manager plugin. pre-bundled setup.
[prompt] A bovine moo cow wearing a tiny Christmas hat and a scarf and little hoof-covering-mitten-boots surrounded by Christmas presents. 1green hyena wearing hat and a scarf. early VHS camcorder style, slight noise, flash photography, candid moment, 1990s VHS aesthetic, festive Christmas Eve snowy night holiday celebration atmosphere. happy cow. happy facial expression of joy on cow and green hyena. scene takes place inside comfortable well furnished opulent rustic multistory barn stable converted into home. roaring comfy cozy atmosphere festive beautiful emotional aesthetic. happy joyful smiling animals.
>>
File: 1757440706879373.jpg (673.1 KB)
673.1 KB JPG
>>108668568
>>108668596
>>108668617
it's not much but I'll take it, it's 100% lossless indeed, install that custom node and you can load it with the normal vae loader
https://github.com/Anzhc/anzhc-qwen2d-comfyui
>>
>>
>>
>>108668701
>>108668705
>too retarded to ask a LLM if the script is sus or not
yep, that's how I imagined the intelligence of Anima haters
>>
>>
>>
>>
>>108668690
>lossless
nigga is blind
>>108668711
Unless your shit is total gamechanger it doesn't deserve fiddling with nodes. Other finetuned VAEs have no issues working in native nodes.
>>
>>
>>
File: 1759640827939535.jpg (716.9 KB)
716.9 KB JPG
>>
>>