Thread #108664784
HomeIndexCatalogAll ThreadsNew ThreadReply
H
Discussion and Development of Local Image and Video Models

Previous: >>108659074

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
+Showing all 331 replies.
>>
File: 186950.jpg (1.1 MB)
1.1 MB
1.1 MB JPG
What happened to local?
>>
Blessed thread of frenship
>>
>>108664800
may 2024 you say
>>
>>108664800
the worst part is that Alibaba is still here, feeding the llm fags, but they seem to have abandoned us :( >>108664796
>>
Am I supposed to be upset that a company is no longer supporting local? It's happened before and every time someone else steps up.
>>
>>108664800
>>108664809
>GPT was able to deduce Alibaba's sellout plan 2 years ago, we just didn't listen
holy shit, saas is actually insanely powerful
>>
Why are the api niggers so upset at local? They mad they can't gen boobies or what?
>>
>>108664820
at this point its instigating or participating in a flame war desu
>>
>armchair jannying as if jannies give a single fuck about AI threads
>>
>>108664817
>It's happened before and every time someone else steps up.
yeah anyone crying about it is either very very new or just trolling
>>
>>108664817
>It's happened before and every time someone else steps up.
what if no one else steps up?
>>
>I am supposed to be upset she slept with another man? It's happened before
they really are localkeks after all!
>>
Are you equating a company with a woman because you have never had sex before
>>
>>108664800
sharty alert
sharty alert
sharty retard
>>
>>108664833
go on then post some nippies or vagene
>>
>>108664820
>They mad they can't gen boobies or what?
have you not seen the grok threads? we can get our fill of boobs whenever we want
>>
Local Diffusion?
>>
>>108664862
Then why is you so upset little nigga
>>
>>108664784
Thank you for baking this thread, anon
>>108664802
Thank you for blessing this thread, anon
>>
>mfw Resource news

04/22/2026

>Embedding Arithmetic: A Lightweight, Tuning-Free Framework for Post-hoc Bias Mitigation in Text-to-Image Models
https://github.com/cvims/EMBEDDING-ARITHMETIC

>Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generation
https://github.com/CompVis/patch-forcing

>TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generation
https://github.com/Hong-yu-Zhang/TS-Attn

>AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model
https://yutian10.github.io/AnyRecon

>SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editing
https://github.com/vivoCameraResearch/SmartPhotoCrafter

>Soft Label Pruning and Quantization for Large-Scale Dataset Distillation
https://github.com/he-y/soft-label-pruning-quantization-for-dataset-distillation

>Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation
https://github.com/AMAP-ML/EMF

>Enhancing Continual Learning of Vision-Language Models via Dynamic Prefix Weighting
https://github.com/YonseiML/dpw

>IR-Flow: Bridging Discriminative and Generative Image Restoration via Rectified Flow
https://github.com/fanzh03/IR-Flow

>TRELLIS.2-stableprojectorz: Trellis.2 optimized to fit inside 8GB gpus
https://github.com/IgorAherne/TRELLIS.2-stableprojectorz

>Fizgig — Klein 9B LoRA Studio
https://github.com/shootthesound/Fizgig

>Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items
https://huggingface.co/datasets/TaobaoTmall-AlgorithmProducts/Tstars-VTON

04/21/2026

>MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Style Mapping
https://jeoyal.github.io/MegaStyle

>UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Models
https://github.com/Yovecent/UDM-GRPO

>Noise-Adaptive Diffusion Sampling for Inverse Problems Without Task-Specific Tuning
https://github.com/NA-HMC/NA-HMC
>>
>>108664868
im not upset, im happy as a clam genning these cool pics with gpt image 2
>>
>mfw Research news

04/22/2026

>Memorize When Needed: Decoupled Memory Control for Spatially Consistent Long-Horizon Video Generation
https://arxiv.org/abs/2604.18215

>Diff-SBSR: Learning Multimodal Feature-Enhanced Diffusion Models for Zero-Shot Sketch-Based 3D Shape Retrieval
https://arxiv.org/abs/2604.19135

>ReImagine: Rethinking Controllable High-Quality Human Video Generation via Image-First Synthesis
https://arxiv.org/abs/2604.19720

>Long-Text-to-Image Generation via Compositional Prompt Decomposition
https://arxiv.org/abs/2604.18258

>HP-Edit: A Human-Preference Post-Training Framework for Image Editing
https://arxiv.org/abs/2604.19406

>Geometric Decoupling: Diagnosing the Structural Instability of Latent
https://arxiv.org/abs/2604.18804

>CreatiParser: Generative Image Parsing of Raster Graphic Designs into Editable Layers
https://arxiv.org/abs/2604.19632

>Allo SR $^2$: Rectifying One-Step Super-Resolution to Stay Real via Allomorphic Generative Flows
https://arxiv.org/abs/2604.19238

>Learning to Credit the Right Steps: Objective-aware Process Optimization for Visual Generation
https://arxiv.org/abs/2604.19234

>Deep sprite-based image models: An analysis
https://arxiv.org/abs/2604.19480

>LLM-as-Judge Framework for Evaluating Tone-Induced Hallucination in Vision-Language Models
https://arxiv.org/abs/2604.18803

>Hierarchically Robust Zero-shot Vision-language Models
https://arxiv.org/abs/2604.18867

>Rethinking Dataset Distillation: Hard Truths about Soft Labels
https://arxiv.org/abs/2604.18811

>Guiding Distribution Matching Distillation with Gradient-Based Reinforcement Learning
https://arxiv.org/abs/2604.19009

>Benign Overfitting in Adversarial Training for Vision Transformers
https://arxiv.org/abs/2604.19724

>BARD: Bridging AutoRegressive and Diffusion Vision-Language Models Via Highly Efficient Progressive Block Merging and Stage-Wise Distillation
https://arxiv.org/abs/2604.16514
>>
>>108664887
is that GW2?
>>
Why do people throw a fit about Grok/GPT/NBP gens? They are supported fully in ComfyUI now and integrate well into local workflows. Despite this, the same freetards continue to cry over them. This isn't linux general you boomers.
>>
>>108664901
>>108664887
>>108664862
>This isn't linux general you boomers.
it literally is, it's a local thread, if you want to spam your API garbage you're simply off topic, what's hard to understand about that? get the fuck out >>108653190
>>
imagine local living so rent free in your mind that you cant stop lurking and posting here KEK
>>
>>108664901
>>108653190
>>
>>
>try Images 2.0 to add some text on an image that just happens to have a shirtless man in the background
>I'm focused on the text don't even notice the man. prompt to "make it a little bigger"
>We’re so sorry, but the image we created may violate our guardrails around nudity, sexuality, or erotic content. If you think we got it wrong, please retry or edit your prompt.
local will still be needed.
>>
>>108664993
i dont think gpt image is that strict, i just genned this a minute ago
try playing around with the prompt a little
>>
>>108665013
>>108653190
>>
>>108664901
is this really what your life has come down to? Doing le epic troll 8+ hours a day on /ldg/? Surely there are better ways of spending your limited time on this planet?
>>
>>108664993
>We’re so sorry, but the image we created may violate our guardrails around nudity, sexuality, or erotic content. If you think we got it wrong, please retry or edit your prompt.
>change the prompt to say make the text a little bigger
>just werks
maybe another day local will be needed
>>
>>108664901
>grok

Enjoy your 480p vertical videos gens lmao
>>
>>108664993
>search for court cases
>chatgpt spitting out text
>deletes reply
>Stopped searching
>ask it the same question
>I have to time it and click stop before it deletes reply
Censorship is cancer.
>>
Realism lora for anima
https://civitai.com/models/1662740/lenovo-ultrareal
>>
>>108665062
huge
>>
>>108665062
It'll be so easy to pick up a fuck ton of cred by making a very simple style lora for realism with anima. Fuck I should do that.
>>
>>108665062
>Only trained on 30 pics.
No way is that enough. I sleep.
>>
>>108665089
Where does it say 30 images?
>>
are there any good frame interpolators? RIFE just blurs between frames
>>
File: file.png (160 KB)
160 KB
160 KB PNG
ai could never

>>108665062
my crystal maidens are all chubby now
>>
>>108665115
In the lora metadata.
>>
So on tdrussell's Rutowski lora config it says 1000 epochs on a 153 image dataset.
Surely that doesn't mean anima needs 153k steps for a lora?
How many epochs did it actually go through?
>>
>>108665149
Going off epochs is for chumps. It's 2k - 4k steps depending on the style.
>>
>>108665149
Apparently I can't read the civit page
>This version corresponds to 40 epochs (120 passes over the data when considering the 3 resolutions)
120x153 steps.
>>108665161
He sets LR low so it needs more steps I think.
>>
>>108665149
I'm literally overfitting with 250 steps bs4
>>
>>108665149
Anima training is for Linux, retvrun to mugen
>>
>>108665169
Just get your LoRA to 2k steps and start saving every 500, go until 4k and pick which version you think looks the best.
>>
>>108665169
>>108665188
To clarify, I mean when using his default hyperparams. It works quite well.
>>
Has anyone experimented whether anima responds nicely to timestep shifts during training btw?
>>108665188
I am skeptical you actually trained a decent anima lora with the method you preach.
I had a very bad time trying to train lora for anima with high LR and lower step counts.
All epochs looked shit.
>>
Why did tdrusell only release his training on Linux? This is some kind of Triton bad faith move...
>>
>>108665198
>>108665192
I also tried a low LR run (coincidentally similar to his, tried before he published his example lora) with 8-10k steps (I think, I don't remember too well), that also looked bad. Maybe the 18k ballpark figure is needed, I intend to try that.
>>
>>
>>108665198
Anima trains fine with a decent dataset, his sane defaults, and somewhere between 2k and 4k steps. Rarely do I have to go up to 4k. Often around 3k is fine. I have yet to feel the need to change any params from his example LoRA.
>>
>>108665200
He's a piece of shit. Obviously gonna put his training scripts behind Linux since he knows Linux users are less artsy and more sloppers than Windows Mac artists where it just werks.
>>
>>108665205
Just looked it up.
I tried 8k steps with 0.00004 LR. I am planning to try 0.00002 or 0.00003 with 18k or slightly below that.
Oh btw I just remembered his 18k steps is with gradient_accumulation_steps = 4
So does that loosely equal 4500 "real" steps?
>>
anima is really separating the wheat from the chaff
>>
>>108665205
>>108665247
Just use his trainer and the config from his example. All I'm meaning is it just werked for me.
>>108665200
You should already be running Linux desu.
>>
File: file.png (3.7 MB)
3.7 MB
3.7 MB PNG
>>108665149
I did about 5k steps with 200 images for this stonetoss lora, using big russ's example config. Prior to that I did a couple with prodigy optimizer and they turned out ok too.

>>108665062
Seemed pretty bad when I tested it.

>>108665126
Have you tried InterpAny-Clearer? It's some stuff built on top of RIFE, I never really noticed blurring that much with RIFE, just awful artifacting with fast motion which InterpAny-Clearer fixed for me. It's a bit slower than RIFE I think.
>>
what are you guys using to train anima anyway?
>>
>>108665307
my computer
>>
>>108665200
keep seething wintroon
>>
>>108665310
This PC*
>>
>>108665062
He needs bigger dataset if he wants to go this dimly lit grainy shit route with Anima
>>
>>108665307
very funny anon
may your loras overfit and your gens fail
>>
remember all the retards claiming it couldnt be trained due to catastrophic forgetting
good times
>>
>>108665304
>I did about 5k steps with 200 images for this stonetoss lora, using big russ's example config. Prior to that I did a couple with prodigy optimizer and they turned out ok too.
Thanks for the reference point anon.
>>
>>108665374
don't forget the retard claiming it wouldnt be trained due to licensing
>>
OneTrainer support FUCKING when
>>
>>108665129
aaaaiiiieee i'm not paying. fuck the police
>>
>>108665198
I know I can ask a chatbot but I'd rather ask a real anon with real experience. What are the benefits of timestep shifts? Or, what are they used for?
>>
What we gennin' tonight, GPTgods??
>>
doing anima3 -> zit, works pretty well for creating realistic images
but I'm struggling with anima3 -> klein9b, has anyone had any luck getting good outputs from this kind of workflow?
>>
i'm getting astray heads/body parts with the anima turbo lora, is there anything i can do to fix it? already tried editing the prompt and changing steps from 12 to 8 but the problem persists. it doesn't happen without the lora. it also doesn't happen if i use the highres aesthetic boost at a certain value (definitely doesn't on 1) but i dislike the aesthetics of turbo + high res, they remove too much detail together
>>
>>108665451
It comes from the SD3 paper. Shift values above 1 make the model spend more time on higher timesteps/sigmas, which improve composition for flow models (anything SD3 or newer). Going too high makes the image blurry/fucks up details so you can't crank it indefinitely. This is for genning images. If you never touched it, Comfy automatically uses the value 3 for most image models, feel free to experiment with model sampling node sometime.
I am not too well versed about precise impact for lora training, but it might help under some situations I believe.
>>
>>108665473
nevermind, i tested it and still happened with both loras at 1.0. fuck. i guess it's the turbo lora. i like the aesthetics too, i wish i could get them even without dumbing down the model
>>
>>108665477
>I am not too well versed about precise impact for lora training, but it might help under some situations I believe.
you adjust it regarding dataset complexity and training resolution

>>108665473
>>108665490
can you post image, not sure what you mean exactly
>>
>>
>>108665501
So what shift value would you use for full 1024p anima lora training on circa 100 images for a decently complex style?
>>
>>108665524
No clue, I haven't done testing with anima. I'd stick to settings russel guy uses
>>
If v3 isn't the final version, what more is he going to add? Just further highres training?
If v3 is the final version, e621 injection wen?
>>
I got a 5090. What's the best model I can run?
>>
File: 999.jpg (1.9 MB)
1.9 MB
1.9 MB JPG
Ohh, I still have some of my old 1.5 gens on my PC
>>
>>108665644
More hopeful days
>>
Is the API diffusion thread a psyop? I tried the new OAI one on openrouter, but it really seems like an inferior NBP that's slightly less fascist about censoring in exchange for sucking. Maybe it has to do with Openrouter? But the effusive praise feels kinda paid for.
>>
>>108665414
who cares, onetrainer feels old and dated compared to ai-toolkit
>>
>>108665695
It's thinly veiled shitposting yes.
I am not sure what quality level openrouter version is running, I tried it on FAL where you can set it to run at high quality.
Regardless only thing it particularly excels at is cramming a lot of text into the image without fucking it up.
Otherwise its abilities range from "maybe good but NBP can already do this" to shit.
>>
>>108665644
way more sovlfvl compared to current models
>>
>>108665584
you can run pretty much anything that's available
>>
>>108665784
Yeah but what's the best local model for big vram?
>>
>>108665804
Anima
>>
>>108665804
depends on what you want to do.
>>
>>108665804
Wan 2.2 for video.
Anima for anime/illustration (SFW and NSFW)
Klein 9b for Edit
Realism still Z-Image Turbo?
NSFW Realism Chroma (broken schizo model but you should be able to gen fast until you luck out enough with seed, if you don't mind the body horror)
>>
>>108665062
https://civitai.red/models/1862761/nicegirls-ultrareal
https://civitai.red/models/1662740/lenovo-ultrareal
https://github.com/DanrisiUA/ComfyUI-LoRA-Block-Filter
it's actually not bad at all, download his workflow and use those 2 lors to get this kind of quality, it's really long though since in his custom node he's using a very slow sampler, I tried to switch to good old euler but it wasn't as good

it's actually not bad at all, you download one of his images and you have to use those 2 loras, it's also fucking
>>
fix my Anima prompt

a woman with her right arm amputated at the elbow holds a garment in her right hand, in front of racks of clothing in a department store.
>>
>>108665062
What sampler should I take for faster gens?
200+ seconds isn't kino
>>
>>108665840
oops, anima lol
>>
>>108665062
that guy has been milking the same dataset for like 2 years already lol
>>
>>108665845
faster? If you can make it work, lcm lol. but it's like trying to ride a bucking horse. So maybe ice cream? that's what I call ipndm.
>>
anima is great, is what chroma should've been
>>
>>108665857
And it's 30 images? Surely he could have added a few more in that time, lmao.
>>
>>108665871
I'm actually impressed by the realism, like it's just a lora on top of a model heavily trained on anime only, why is it so good? did rusell see something we didn't in the fucking chronos architecture?? kek
>>
>>108665881
>trained on anime only
not only, dataset contains ye-pop and deviantart
>>
>>108665881
maybe the original model was just underbaked? didnt even see much about it
>>
>>108665891
With photographs filtered out.
>>
>>108665891
what the fuck is ye-pop? 808s and heartbreaks?
>>
>>108665881
Cosmos was trained on 200 million videos.
>>
>>108665857
>>108665871
Catbox please?
>>
its the ye-pop dataset, but its better if you train a photorealism lora and combine it with anima

>>108665875
Don't fix what isn't broken mentality, sadly they went the reddit grifter route, but it works for them so its ok I guess, but their loras are very underwhelming since their dataset is so low-quality (noisy 2000s photos), it worked with flux but they never improved
>>
>>108665871
Where's the stump?
>>
>>108665881
>>108665923
>Nvdia gave us an hidden gem and it took us more than a year to notice
Jensen sempai...
>>
>>108665828
you are killing me
>>
>>108665828
it's funny that it looks so good when russel is still training it at 512x resolution, remember the chroma days when people were coping about the fact that chroma had shit anatomy because of the low res training, and that everything would be magically fixed once he would've switched to 1024x kek
>>
>>
daily reminder that kekstone is still training Zeta-Chroma-X0-Dino-Nuggies-PixelPred-FlowInversion-UOOHH-pomf-PTHC-edition instead of using his compute to train Anima for like one or two epochs on the Chroma dataset which is all it would take
>>
>>
>>108665975
>>108665963
Images like this kind of creep me out. It's like a glimpse into an alternate reality but there's no soul involved at all so it just feels like something that could have been but never was. I don't like it. I'm not even talking about your gens in particular just all of these non-chalant/natural instagram type "photos".
>>
File: based.gif (956.7 KB)
956.7 KB
956.7 KB GIF
>>108665963
>>108665975
it can do good realism while having gozillions of anime characters and style, while doing good NSFW and the anatomy is actually pretty good too, sasuka trusell...
>>
>>108665969
I like..or liked Chroma, but that model is just too slow and clunky to run, you gotta play with the unet, add some flash loras, play with the loras weights, do a 2-pass, its just too messy, the only reason I like it, is that its dataset its really good and diverse and you can also can use Redux since its flux based, but anima + zturbo is just faster, has better prompt understanding
>>
>>108665993
oops, forgot gen
>>
>>108665969
I can feel he's gonna try something on anima, make it pixel only and destroys it like he destroyed everything before kek, but even with all those failures I can't hate on the guy, I too want a future without a fucking VAE, that's what it takes to get the perfect edit model
>>
i tell LTX2.3 to walk the camera to a different spot in the scene and then i use an image model to recreate the details. it works better than asking the image model to move since it doesn't have any temporal consistency
>>
>>108665990
Where are those people who said that training a lora on anima was bad because it started to forget stuff ?
>>
>drop the pretense of these anime characters having unique designs and just focus on the hair
>can generate a billion anime characters
shrimple as
>>
>>108666010
if only it wasn't that long... but it's just the begining, if we can get this quality from an unfinished finetune, we're definitely not ready for the final version + a better realism lora on top of it
>>
>>108666000
nice digits,

To me its just another cycle of the model training lucky strike, aka, someone trains a model, its very successful but can't replicate it when another new model and another architecture arrives, it happened with
SDXL => Juggernaut and Pony => Couldn't replicate its success
Flux => Chroma => Didn't finish training, went on a schizo path of experiments and crap that no one uses

I think the only sucessful guy was the LEOSAM's HelloWorld trainer that went on working with WAN-AI

and lets not mention ani, who never succeed at anything ai-related
>>
Man, civitai is really sensitive. It's rating one of my images on my model as rated R despite there being more than tame. I like to keep the main images SFW but I don't think it even matters.

Pic unrel.
>>
>>
>>108666070
>>
>>108666077
is that gpt image 2? the text is perfect behind that's sus lol
>>
>>
>>108666088
nta but.. there's like a billion unique comic con images on reddit. It's probably one of the few logos that generates super well.
>>
>>108666088
just anima > z-image
>>
>>108665975
catbox me up, sensei, please
>>
>>
>>108665934
Do you know how to make a LORA

Will you make a LORA for me

can you make a LORA

(of the Foretrex 801/901)

It's kind of my kink, my kind of LORA
>>
>>108666077
>>108666106
Workflow?
>>
my 1girl, sire
>>
>>108666144
It's alright. Don't see no Foretrex 801/901 tho. I guess it'll do, for losers who are too dumb to figure out how to share a LORA of Foretrex 801/901.
>>
>>108666144
please switch to anima xl looks so godawful
>>
>>108666167
provide the dataset and have it tagged if you want someone to do that for you. i'm training on anima atm if you provide the dataset and use anima i'll train it on there

>>108666178
This is anima, i'm still tweaking my lora. anima is new to me only been using it for 2 days
>>
Anima is slow, but since it doesn't eat that much vram, is there technically a way to load it several time to do parallel processing or some shit?
>>
>>108666186
>anima is new to me only been using it for 2 days
oh damn my b homie srry
>>
>>108666186
Real non-losers tag it for me since cba
>>
AnimaGODS, please teach this lowly peasant how to proompt high quality gens.
>>
>>108665840
USE SOME BOORU TAGS YOU MONSTER
>>
>>108666208
You won't believe this but my kino anima prompts are essentially the same as my kino prompts from previous gen anime models
>>
>>108666208
Damn pardner I'm tryin
>>
>>108666224
can you gen non-bandaged amputees, first starting with the at elbow amputation.

There are different levels of amputation.
>>
>>108666140
https://files.catbox.moe/i6mwte.png
>>
>>108666234
>no amputation

vanilla
>>
>>108666208
just go to danbooru and gather tags and you're good to go, just one thing though, for artists it goes like this
>artist -> @artist
but, for artist tags that have parenthesis, it goes like this
>kaamin_(mariarose753) -> @kaamin \(mariarose753\)
here's how you can easily retrieve tags from an image
https://booruprompt.vercel.app/booru-tag
>>
>>108666224
Yes, yes, HOWEVER consider that I suck with other models too (though my anima gens are definitely way better by default)
>>
>2026
>still no real multi gpu inference for image/video gen

How is this possible?
>>
>>108666248
>>kaamin_(mariarose753) -> @kaamin \(mariarose753\)
clear as mud, use a code tag example and explain it.

RIGHT NOW!

(or ur a LOSER)
>>
>>108666248
Is there any way to rein in extra fingers, arms, legs, and other glaring errors?
>>
>>108666248
>\(mariarose753\)
Is there a plug-in for this? It gets really fucking tedious when you have multiple tags with parenthesis.
>>
Anima negs: source_pony, source_furry
>>
did preview 3 improve artist mixing?
>>
>>108666320
It was never bad. Just use weights.
>>
>>108666328
Can you give an example?
>>
>>108666334
No.
>>
>>108666334
Sure, (@artist1:2.0), (@artist2:1.5). Add as many artists as you'd like.
>>
next you're gonna say
>anima negs: bad
>it just works
>>
help a vramlet nigga out.
how bad would zit nunchaku be for anima upscaling? 1.5 minute for upscaling sucks
>>
>>
>>108666242
nice workflow anon, btw if you want to use less memory overall, go for DFloat11 for Z-image turbo, it's lossless and it's 30% lighter
https://github.com/mingyi456/ComfyUI-DFloat11-Extended
https://huggingface.co/mingyi456/Z-Image-Turbo-DF11-ComfyUI
>>
>>
>>108666408
too old
>>
File: oda-non.png (44.6 KB)
44.6 KB
44.6 KB PNG
no oda-non in anima, sad
>>
>>108666342
Thanks. Gotta experiment with the ratios.
>>
GPT2 is bretty gud but it's not local so it doesn't matter.
>>
>>108666420
>~500 hits on booru
They're probably in the model and that site is out of date. v3 has at least seen artists with less than ~100 hits. https://tagexplorer.github.io/#/artists is the better one desu (oda-non isn't listed in it either tho).
>>
>>108666420
>>108666450
>https://tagexplorer.github.io/#/artists
oda-non is actually in this one just without the dash
>>
>>108664784
>Comfy
why hasn't anyone porked the comfyui before he started going tarded?
keep it neat and clean without the new unnecessary nonsense and especially 'partner nodes' + super-duper-amzing fronted updoots
>you do it
i dont do githubs nor python
>>
>>108666242
thanks for the workflow but personally I'm not a big fan, Z-image turbo removes the sovl out of it
>>
zit
zit
dog shit
>>
>>
>>108666499
mummy
>>
>>108666256
img models cant into sharding
>>
i sharded my pants
>>
File: file.png (2.8 MB)
2.8 MB
2.8 MB PNG
>>108666509
it happens
>>
>>
>>
>>
>>108665960
one more snakeoil ultra experimental 1/10th baked model.... just one more and im sure the furry will deliver!!!!!!!!
>>
>>
for realism with anima3, I've notice that if you feed it two prompts, one using tags and one in natural language, it tends to enhance realism noticeably.
>>
>>108666122
to have woman like that, suicide by fucking like hamster
>>
>>108666730
benchod
>>
>>108666743
what your mother blody saar
>>
release the realism lora, russel
>>
big russ is a coward for not releasing realistic lora
there, i said it
>>
>>108666789
just make a half-assed one in the meanwhile, it doesn't take long.
>>
It's filled with gay porn and he is just ashamed
>>
local is dead
there, i said it
>>
>>108666242
what are lenovo_anima and nicegirls_anima
>>
>>
>>108666841
I read the thread
>>
>>108666841
>>108666841
https://civitai.red/models/1662740/lenovo-ultrareal
https://civitai.red/models/1862761/nicegirls-ultrareal
>>
>>108666841
>>108665828
>>
>>108666854
>those datasets
boy I can't wait for the inevitable Grainscape-ULTRAREAL-Anima.safetensors
>>
>>
how do i get more varied compositions with anima turbo lora? every seed is the same
>>
>>108666909
why'd you need a turbo lora for a tiny model
>>
it's crazy how you can take an anima gen with a decent, realistic looking penis, send it to zit for a 2nd pass, and all you'll get is body horror, no matter which lora you use. Same with skimpy underwear, cameltoes, bikinis. I hope something comes along soon that can replace that piece of shit
>>
>>108666996
wait, i thought local was supposed to be uncensored????
>>
>>
>>108666996
Use ZiB retard
>>
>>108667041
zib is untrainable dogshit that sucks at realism and the fact that no-one has come out with anything approximating a usable finetune of it tells you everything you need to know
>>
>>
>>108667071
use chromer for 2nd pass :)
>>
use anima for the base gen and get a segmenter. then run z for a background detailer pass and chroma for a subject detailer pass. or just use saas instead of coping with 5+ different models just to generate a basic plastic 1girl
>>
>>108667124
not sure saas is the best option for nsfw content
>>
just starting with anima, are there any optimizations for it, like using xformers in sdxl?
>>
>anima
Can I run this shit with RX 9070 XT? I recall I tried Z-Image and run out of VRAM. Illustrious and pony models work fine.
>>
>>108667187
How the hell did you run out of VRAM running 6b model on 16gb card? I run it fine on 12gb. Are you running some piece of shit outdated frontend?
And as for your first question, yes.
>>
>>108664862
>>108664887
>>108664901
>>108666432
Are these real, what the fuck?
>>
>>108667241
bloody bloody
>>
>>108665374
Forgetting persists, and many anime centered checkpoint and lora makers have moved from Anima. That’s why furry and realistic styles are rising along with non anime styles type of loras like him >>108665304

Also locking training the model behind Linux is unfair and elitist. I don’t have time to learn a bloated OS just for “muh open source" snow flake ideology, that mindset is very first world.
>>
>>
>>108667258
********************************** status?
>>
>>108667275
my fucking sides
>>
>>108667258
i don't think anything you said is even remotely true, and you can easily train anima on windows.
>>
>>108667356
You can’t train Anima on Windows, well, you *can*, but not with the presets that Tdrusell recommends, which help Anima mitigate its catastrophic memory issues.
>>
>>108667396
>>108667275
>>
>>108667396
i've trained several loras on windows so far and haven't had a single issue.
it took like 200 steps on my first lora to dial in settings i was happy with, after that it was smooth sailing.
preview 3 has been out for awhile now, we should have lots of examples of "it's catastrophic memory issues." would you care you link some?
>>
>>108667440
>unironically responding to concern troll
lol
let me give you a timeline
>anima releases
>AAAAAAAAH THE LICENSE IS BAD!!!
>oh wow 1M grant TIME TO SHITPOST APACHE ANIMA!!! I messaged all the devs on discord, they didnt reply but theyre 100% onboard:)))))))))
>anima 2 releases
>ARGHHH stop using anima!!! we're totally cooking the new model and its gonna be free! here check this website I totally not made to bitch about anima stealing artists work!!!
>anima 3 releases
>AAGH ANIMA IS UNTRAINABLE! MUH CATASTROPHIC FORGETTERING!!!
>the failed devs go to bitch on HF (btw they have a competing completely melted and shit CLIP SDXL with flux vae model which is complete garbage so they 100% have interest in bringing anima down while propping up their literal shit)
>tdrussel provides a retard proof method to train loras to shut these retards up
>more good loras started getting made
>still tries to gaslight anons about stuff that doesn't exist
>copes daily about being a failed dev and being thrown out by comfy for being a literal psychotic backstabbing retard
ok, next time please just ignore this retard
>>
ani kills himself this year
>>
>>108667460
i know, i just like calling him out on his lies.
anima is pure kino so i can understand why he hates it so much.
>>
>>108667460
> >tdrussel provides a retard proof method to train loras to shut these retards up
> >more good loras started getting made
lol
>>
first result when you search for hf anima loras
https://huggingface.co/Coald/pullupType_diaper_lora_AnimaPreview2_Preview3_V1
>>
i am so crazy about klein 9b...
>>
>>108667524
ok post gens
>>
>>108667440
We should got a lot of examples of good anime baked loras but we haven't also. So your opinion is irrelevant
>>
>>108667460
You took the time for write that but ypu dont shared any good Anima non-existant anime character lora. Your meme is worthless.
>>
english pls
>>
>>108667530
i just throw random pics at it
>>
>>108667496
You say that as a realist or non anime genner, talk to me when you have the intention of adding the latest gacha whores post September 2025 and see how the model fails catastrophically and you will come crying at my feet, you and your Linux distros that you used to train that failed model.
>>
>>108664993
Yeah, it's too strict compared to Gemini. All that trouble doesn't seem worth it for a side-grade.
>>
>>108667575
sounds like a real waste of time to me
>>
>>108667581
>>108667275
>>
>>108667496
>anima is pure kino so
You boldly claim Anima is pure kino, yet I have not seen any gen that makes me say "Anima is kino" and do not come at me with painterly slop.
>>
>>108667581
the irony of your incessant fudding is that it makes anima seem like a fantastic model because you can only ever speak in vagaries and hypotheticals.
>>
>>108667619
>>108667275
>>
>>108667623
You also speak in the same way. you still haven't shown any good anime baked lora and you prefer to argue with me instead of sharing well made loras.
>>
>>108667640
im using anima yume for my new gacha girls thoughever ;)
>>
>>108667640
>that lora doesn't count because the anime it is from isn't obscure enough.
there are already thousands of anima loras on civitai, no one is complaining about catastrophic forgetting or anything resembling your lies.
>>
>>108664993
yeah, adding text to images with topless people is so needed
>>
G'day fellas. What's the "recommended" specs for Anima?
>>
>>108667599
i don't understand how it's possible to be already jaded about these things
>>
>>108667477
>>
>>108667596
>Yeah, it's too strict compared to Gemini. All that trouble doesn't seem worth it for a side-grade.
its not allergic to genning big tits and nice looking bodies so i wouldnt call it a sidegrade
just have to figure out how to prompt it without triggering guardrails
>>
>>108667749
>genning big tits and nice looking bodies
local is FINISHED
>>
>>108667749
>>108667754
Saar bros we are eating good bitch basterd
>>
cooking up some ltx2.3 kinos
>>
>>108667754
>>108667761
localcucks are arguing about shitty anime loras lmao
they've completely given up on realism (sad)
>>
>>108667692
a gpu
>>
>>108667780
realism is for incels
>>
>>108665826
>Wan 2.2 for video
I've been out for a few months, trying to catch up.
And this is STILL the best local video model? Nothing else was released that could compete? That sucks man, I hate the constant loading and unloading of using two models.
>>
>>108667798
no, ltx2.3 is good
>>
>>108667921
no it's not
>>
>>108667937
why?
>>
>>108667964
because it has terrible quality
it's like chroma but for video and even worse
chroma at least can do decent 1girl standing cowboy shot 1 in 10 gens
>>
>>108667581
lol

Give me the character you want, I'll do it.
>>
>>108668058
turn the quality up then
>>
>>108668071
if it was that simple we would still use sd1.5
>>
>>108668111
what resolution are you generating in?
>>
Good morning, Anima is shit.
Have a good day ;)
>>
>>108668206
how can you just show up out of the blue and make one single post in this general and then vanish?
>>
>>108668187
it's i2v no matter what res is 512 or 1024 fps 24 or 60
if you are trying to say skill issue it's not
look at ltx loras descriptions and comments on civitai
you will see 10k+ sometimes 50k+ steps of training
it's just ltx is garbage
>>
>>
>>
>>
>>108667780
/ldg/ was infinitely better back when those anime fags remained confined to their containment general.
>>
anime fags go
tdrussell stays
>>
>>108668312
Based
>>
>>108665960
>remember the chroma days
God chromafags were insufferable. The model was pure shit they kept shilling and crying.
>>
has the faggot really locked down training behind diffusion pipe? lol
>>
>>108668408
just implement it yourself, retard
>>
>>108664784
I love my RTX 5070ti so much bros :) All these degenerate illegal sloppa i gen :-))
>>
>>108668312
Tdrusell should be in Noob cord with people of his category, talking and discussing with the greats, not here being a rat head arguing with schizos and civitai tier 3d sloppers
>>
>>108668408
https://github.com/gazingstars123/Anima-Standalone-Trainer
>>
>>108668433
>not here being a rat head arguing with schizos and civitai tier 3d sloppers
which one are you then? a schizo or a civitai tier 3d slopper?
>>
>>108668433
if tdrussel is so cool
why can't he train controlnets
>>
>>108668474
Because it's not sdxl era anymore.
>>
>>108668457
>only 84 stars
that's not a lot of stars, is this trustworthy?
>>
Trained an Iryna LoRA on Ernie. This time in Comfy instead of diffusion-pipe. rank 32, 5000 steps https://gofile.io/d/ljv01e
>>
>>108668474
>why can't he train controlnets
because he needs to finish the model first before implementing controlnets on top of it, are you retarded?
>>
>>108668483
>training in comfy
why
>>
>>108668483
looks like dogshit
>>
>>108668483
unironically, anima will be the best model to train realistic shit on it, it's surprisignly good at it
>>108665996
>>108665963
>>108665828
>>
>>108668433
He is a coward desu, imagine what the community could accomplish if he debated with Bluvol, Azhnek and Laxar there and became friends with them, but he takes refuge here with his mommy /ldg/ who gives him milk and warm cookies and pats him on the back for whatever crap he does.
>>
>>108668517
>his mommy /ldg/ who gives him milk and warm cookies and pats him on the back for whatever crap he does.
it's not true, you're shitting on him right now, proving that /ldg/ can be harsh on him
>>
>>108668480
elaborate

>>108668488
two digits iq subhuman spotted
>>
>>108668517
He already got money working quietly in his own, doesn’t need to prove himself to failed jealous devs
>>
>>108668517
>Bluvol, Azhnek and Laxar
who?? see? the fact I'm asking that question says everything, those guys haven't proved anything and you want them to ruin this projct? lmao
>>
>>108668433
> not here
never was
>>
>>108668534
they are a BIG deal on discord, you wouldn't get it.
>>
>>108668488
>are you retarded?
>>108668525
>yes
that's what I thought
>>
>>108668523
Ipositive or negative in superficial ways. Bluvoll practically lives in noob cord testing Anima things with Aznek and they recently made a VAE of Anima 2d that requires much less VRAM. They spend more time with Anima than tdrusel himself.
>>
Ultimately Comfy was right to give the money to tdrussel, Anima is genuinely an insanely good model, he's probably the most talented guy we ever had on the local diffusion ecosystem when it comes to training stuff and not make them bad
>>
>>108668548
>they recently made a VAE of Anima 2d that requires much less VRAM.
>VRAM
dude it's a 2b model, and you want to save VRAM? are you living on a shithole or something? get a better card nigga
>>
>>108668549
Money and time makes miracles
>>
how is this thing realistic at all?
>>
>>108668548
>VAE of Anima 2d that requires much less VRAM
indian priorities
>>
>>108668555
https://huggingface.co/Anzhc/Qwen2D-VAE
Faster gens
>>
>>108668559
>how is this thing realistic at all?
did you use this specific workflow? it's the one that makes anima pretty good at realism
https://civitai.red/images/128285430
>>108668568
what about the quality though? if it's worse it's not worth it, I'll give it a try though, thanks for the link
>>
>>108668578
The quality its 1:1
>>
>>108668568
this shit doesn't work
>>
>>108668499
>why comfy
To see what it's like you virgin. I'll try anything once. 5k steps in under 4 hours @ 1024.

>>108668502
idgaf, just dropping something anonymously

>>108668509
Tried a realism LoRA on the first preview, but it didn't go well. Will try again after final release in 2 weeks
>>
>>108668568
>RuntimeError: Error(s) in loading state_dict for AutoencoderKL:
doesn't work
>>
>>108668600
/ldg/ has standards
we dont just post useless junk in the threads
>>
>>108665464
go back home and kys retard
>>108653190
>>
File: file.png (3.8 MB)
3.8 MB
3.8 MB PNG
>>108664784
Flux 2. Kline 4B - comfyui portable + manager plugin. pre-bundled setup.
[prompt] A bovine moo cow wearing a tiny Christmas hat and a scarf and little hoof-covering-mitten-boots surrounded by Christmas presents. 1green hyena wearing hat and a scarf. early VHS camcorder style, slight noise, flash photography, candid moment, 1990s VHS aesthetic, festive Christmas Eve snowy night holiday celebration atmosphere. happy cow. happy facial expression of joy on cow and green hyena. scene takes place inside comfortable well furnished opulent rustic multistory barn stable converted into home. roaring comfy cozy atmosphere festive beautiful emotional aesthetic. happy joyful smiling animals.
>>
>>108668568
>>108668596
>>108668617
it's not much but I'll take it, it's 100% lossless indeed, install that custom node and you can load it with the normal vae loader
https://github.com/Anzhc/anzhc-qwen2d-comfyui
>>
>umm install my custom n..
no
>>
>>108668701
based
>>
>>108668701
>>108668705
>too retarded to ask a LLM if the script is sus or not
yep, that's how I imagined the intelligence of Anima haters
>>
>2 mins to gen with anima
what the fuck are you guys doing lmao
>>
>>108666432
Slop
>>108653190
>>
>>108668711
>implying
bro im not install shoddily coded garbage that will inevitabily break to save what, 1-2 seconds off a gen? literally fuck off retarded poorshit, go be poor somewhere else
>>
>>108668690
>lossless
nigga is blind
>>108668711
Unless your shit is total gamechanger it doesn't deserve fiddling with nodes. Other finetuned VAEs have no issues working in native nodes.
>>
>>108668720
>that will inevitabily break to save
implying
>>
>>108668723
>nigga is blind
there's literally 0 difference, you're trolling
>>
>>
thanks to whoever sugggested the Flux 9b consistency lora last thread, upscales are spot on with it, this shit is fucking magic
>>
Fresh

>>108668921
>>108668921
>>108668921
>>108668921
>>
>>108665490
i managed to fix it by using a lower resolution
>>
>>108668600
Thanks anon. I was planning on experimenting to see how ernie responds to lora training.
Now I know the answer is not good.
Not sarcastic btw, thanks for saving me time.

Reply to Thread #108664784


Supported: JPG, PNG, GIF, WebP, WebM, MP4, MP3 (max 4MB)