/g/ - Thread 108668921

/g/

Thread #108668921

Home Index Catalog All Threads New Thread Reply

Anonymous
/ldg/ - Local Diffusion General 04/23/26(Thu)15:09:13 No.108668921

/ldg/ - Local Diffusion General Anonymous 04/23/26(Thu)15:09:13 No.108668921 [Reply]▶

File: highlights_g_108664784_1776956890_1.jpg (2.5 MB)

2.5 MB JPG

Discussion and Development of Local Image and Video Models

Previous: >>108664784

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon

335 RepliesView Thread

Showing all 335 replies.

Anonymous
04/23/26(Thu)15:11:30 No.108668934

Anonymous 04/23/26(Thu)15:11:30 No.108668934▶

is it over or are we back

Anonymous
04/23/26(Thu)15:13:39 No.108668948

Anonymous 04/23/26(Thu)15:13:39 No.108668948▶

File: 1755007656454961.jpg (392.1 KB)

392.1 KB JPG

Anonymous
04/23/26(Thu)15:13:41 No.108668949

Anonymous 04/23/26(Thu)15:13:41 No.108668949▶

File: ComfyUI_10703_.png (367.9 KB)

367.9 KB PNG

Anonymous
04/23/26(Thu)15:14:37 No.108668954

Anonymous 04/23/26(Thu)15:14:37 No.108668954▶

>>108668948
why is it so brown

Anonymous
04/23/26(Thu)15:15:34 No.108668958

Anonymous 04/23/26(Thu)15:15:34 No.108668958▶

>>108668954
ghibli niggers did this

Anonymous
04/23/26(Thu)15:16:48 No.108668960

Anonymous 04/23/26(Thu)15:16:48 No.108668960▶

>>108668948
>GreasePT

Anonymous
04/23/26(Thu)15:18:27 No.108668972

Anonymous 04/23/26(Thu)15:18:27 No.108668972▶

Why is civitai full of new accounts literally named "abc123abc" commenting in every single z-image lora to make an Ernie version. For fuck sake, just take a look at the Commodore64 lora for Ernie, is disgusting, makes me puke just to stare at the images.

Anonymous
04/23/26(Thu)15:19:29 No.108668978

Anonymous 04/23/26(Thu)15:19:29 No.108668978▶

my gpu fans are starting to rattle. the end is near

Anonymous
04/23/26(Thu)15:19:45 No.108668983

Anonymous 04/23/26(Thu)15:19:45 No.108668983▶

>>108668948
get out! >>108653190

Shankism on Discord
04/23/26(Thu)15:21:22 No.108668989

Shankism on Discord 04/23/26(Thu)15:21:22 No.108668989▶

>civitai split between red boards and blue//green board

Anonymous
04/23/26(Thu)15:29:40 No.108669029

Anonymous 04/23/26(Thu)15:29:40 No.108669029▶

QRD on Ernie? Is it a meme or can it actually save local?

Anonymous
04/23/26(Thu)15:30:43 No.108669034

Anonymous 04/23/26(Thu)15:30:43 No.108669034▶

>>108669029
infographic generator

Anonymous
04/23/26(Thu)15:31:22 No.108669037

Anonymous 04/23/26(Thu)15:31:22 No.108669037▶

>>108668948
that's a lot of inpainting and many hours in gimp

Anonymous
04/23/26(Thu)15:35:27 No.108669058

Anonymous 04/23/26(Thu)15:35:27 No.108669058▶

File: image.png (32.1 KB)

32.1 KB PNG

>>108668972
chinks shill army nothing new
they are also shilling chink models in r/localllama right now

Anonymous
04/23/26(Thu)15:36:38 No.108669064

Anonymous 04/23/26(Thu)15:36:38 No.108669064▶

File: 1760920978918124.png (26.2 KB)

26.2 KB PNG

>>108668954
the room was prompted to be bathed in warm light with dusty color pallete because it looks cozy
>>108669037
facts. i really like what it did with groks coffee cup

Anonymous
04/23/26(Thu)15:38:09 No.108669070

Anonymous 04/23/26(Thu)15:38:09 No.108669070▶

Why do ai images look like ai? I can't see the exact reason. How can you then make your gens look less than ai?

Anonymous
04/23/26(Thu)15:42:27 No.108669088

Anonymous 04/23/26(Thu)15:42:27 No.108669088▶

>mfw Resource news

04/23/2026

>ParetoSlider: Diffusion Models Post-Training for Continuous Reward Control
https://shelley-golan.github.io/ParetoSlider-webpage

>DynamicRad: Content-Adaptive Sparse Attention for Long Video Diffusion
https://github.com/Adamlong3/DynamicRad

>Normalizing Flows with Iterative Denoising
https://github.com/apple/ml-itarflow

>LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model
https://github.com/inclusionAI/LLaDA2.0-Uni

>Illustrious XL & NoobAI-XL Style Explorer
https://github.com/ThetaCursed/Illustrious-NoobAI-Style-Explorer

>AI Model & ‘MAGA’ Influencer Emily Hart Unmasked as Indian Man
https://www.yahoo.com/news/articles/ai-model-maga-influencer-emily-091027504.html

04/22/2026

>Embedding Arithmetic: A Lightweight, Tuning-Free Framework for Post-hoc Bias Mitigation in Text-to-Image Models
https://github.com/cvims/EMBEDDING-ARITHMETIC

>Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generation
https://github.com/CompVis/patch-forcing

>TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generation
https://github.com/Hong-yu-Zhang/TS-Attn

>AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model
https://yutian10.github.io/AnyRecon

>SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editing
https://github.com/vivoCameraResearch/SmartPhotoCrafter

>Soft Label Pruning and Quantization for Large-Scale Dataset Distillation
https://github.com/he-y/soft-label-pruning-quantization-for-dataset-distillation

>Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation
https://github.com/AMAP-ML/EMF

>Enhancing Continual Learning of Vision-Language Models via Dynamic Prefix Weighting
https://github.com/YonseiML/dpw

>IR-Flow: Bridging Discriminative and Generative Image Restoration via Rectified Flow
https://github.com/fanzh03/IR-Flow

Anonymous
04/23/26(Thu)15:43:05 No.108669089

Anonymous 04/23/26(Thu)15:43:05 No.108669089▶

>>108669070
put "AI" in the negative prompt

Anonymous
04/23/26(Thu)15:43:27 No.108669090

Anonymous 04/23/26(Thu)15:43:27 No.108669090▶

>mfw Research news

04/23/2026

>Image Generators are Generalist Vision Learners
http://vision-banana.github.io

>Camera Control for Text-to-Image Generation via Learning Viewpoint Tokens
https://randdl.github.io/viewtoken_control

>Hallucination Early Detection in Diffusion Models
https://arxiv.org/abs/2604.20354

>Wan-Image: Pushing the Boundaries of Generative Visual Intelligence
https://arxiv.org/abs/2604.19858

>MMCORE: MultiModal COnnection with Representation Aligned Latent Embeddings
https://arxiv.org/abs/2604.19902

>Rethinking Where to Edit: Task-Aware Localization for Instruction-Based Image Editing
https://arxiv.org/abs/2604.20258

>Amodal SAM: A Unified Amodal Segmentation Framework with Generalization
https://arxiv.org/abs/2604.20748

>FluSplat: Sparse-View 3D Editing without Test-Time Optimization
https://arxiv.org/abs/2604.20038

>HumanScore: Benchmarking Human Motions in Generated Videos
https://arxiv.org/abs/2604.20157

>Render-in-the-Loop: Vector Graphics Generation via Visual Self-Feedback
https://arxiv.org/abs/2604.20730

>Mitigating Hallucinations in Large Vision-Language Models without Performance Degradation
https://arxiv.org/abs/2604.20366

>Cognitive Alignment At No Cost: Inducing Human Attention Biases For Interpretable Vision Transformers
https://arxiv.org/abs/2604.20027

>X-Cache: Cross-Chunk Block Caching for Few-Step Autoregressive World Models Inference
https://arxiv.org/abs/2604.20289

>Self-supervised pretraining for an iterative image size agnostic vision transformer
https://arxiv.org/abs/2604.20392

>Efficient INT8 Single-Image Super-Resolution via Deployment-Aware Quantization and Teacher-Guided Training
https://arxiv.org/abs/2604.20291

>From Diffusion to Flow: Efficient Motion Generation in MotionGPT3
https://arxiv.org/abs/2603.26747

Anonymous
04/23/26(Thu)15:43:43 No.108669092

Anonymous 04/23/26(Thu)15:43:43 No.108669092▶

>>108669037
that's basically what image 2 is doing.
it's a second pass that projects the text onto the genned image. the easiest way to spot it is on clothing, the X for example, it's just sitting on her dress. it's actually almost pixel perfect with the X on the laptop.

Anonymous
04/23/26(Thu)15:44:00 No.108669093

Anonymous 04/23/26(Thu)15:44:00 No.108669093▶

>>108669070
Hire an artist to paint over it.

Anonymous
04/23/26(Thu)15:46:35 No.108669107

Anonymous 04/23/26(Thu)15:46:35 No.108669107▶

>>108669092
why cant local models do that?

Anonymous
04/23/26(Thu)15:50:02 No.108669117

Anonymous 04/23/26(Thu)15:50:02 No.108669117▶

>>108669088
>>108669090
thanks

Anonymous
04/23/26(Thu)15:52:27 No.108669129

Anonymous 04/23/26(Thu)15:52:27 No.108669129▶

>>108669107
they probably can but nobody is developing the tooling for it, at least not in a user friendly way

Anonymous
04/23/26(Thu)15:53:28 No.108669135

Anonymous 04/23/26(Thu)15:53:28 No.108669135▶

File: Untitled-1.png (191.3 KB)

191.3 KB PNG

>>108669107
probably because they don't care, it's a parlor trick to impress indians and boomer investors. sorry to pull the curtain back.
point in case, the gen has uses the same X, it just has a slight skew on the dress. same with the openAI logo, it's just sitting on her shirt.

Anonymous
04/23/26(Thu)15:54:15 No.108669137

Anonymous 04/23/26(Thu)15:54:15 No.108669137▶

>>108669117
Put the phone away fag

Anonymous
04/23/26(Thu)16:02:44 No.108669182

Anonymous 04/23/26(Thu)16:02:44 No.108669182▶

>>108669093
Gay
>>108669089
There is no way it's that simple. But now that I think of it putting tags like "masterpiece" seem to help

Anonymous
04/23/26(Thu)16:03:44 No.108669190

Anonymous 04/23/26(Thu)16:03:44 No.108669190▶

File: image.png (44.4 KB)

44.4 KB PNG

>>108669137
?

Anonymous
04/23/26(Thu)16:08:56 No.108669231

Anonymous 04/23/26(Thu)16:08:56 No.108669231▶

>>108669190
api image thread is here >>108653190

Anonymous
04/23/26(Thu)16:09:29 No.108669237

Anonymous 04/23/26(Thu)16:09:29 No.108669237▶

>>108669182
>masterpiece in the positives helps make outputs not look ai
Erm..... Anon? When's the last time you saw an optometrist??

Anonymous
04/23/26(Thu)16:09:34 No.108669238

Anonymous 04/23/26(Thu)16:09:34 No.108669238▶

>>108669070
Can we talk about this pls?

Anonymous
04/23/26(Thu)16:10:36 No.108669243

Anonymous 04/23/26(Thu)16:10:36 No.108669243▶

>>108669135
its a cool trick honestly
hopefully the chinese will be able to reverse engineer it for local models

Anonymous
04/23/26(Thu)16:14:47 No.108669272

Anonymous 04/23/26(Thu)16:14:47 No.108669272▶

>>108669231
?

Anonymous
04/23/26(Thu)16:18:34 No.108669294

Anonymous 04/23/26(Thu)16:18:34 No.108669294▶

>>108669243
honestly i think a random person could figure out a better implementation in a few days, local has a lot more head room to fuck around. there are 3d models, i assume they have some kind of texture projection.
you could probably jury-rig something from preexisting nodes. convert a masked area into a plane or 3d topology, project text or an image onto it, then lay it on top of the gen.

Anonymous
04/23/26(Thu)16:36:59 No.108669426

Anonymous 04/23/26(Thu)16:36:59 No.108669426▶

>I haven't checked in on /ldg/ in a while what are they up to
>Thread gets diverged again
Still at it huh

Anonymous
04/23/26(Thu)16:40:27 No.108669449

Anonymous 04/23/26(Thu)16:40:27 No.108669449▶

File: _AnimaPreview3_00155_.jpg (382.3 KB)

382.3 KB JPG

Anonymous
04/23/26(Thu)16:41:45 No.108669455

Anonymous 04/23/26(Thu)16:41:45 No.108669455▶

File: 1748109684850279.jpg (646.6 KB)

646.6 KB JPG

aight, you can now use NAG on Anima
https://github.com/BigStationW/ComfyUI-NAG-Extended
https://github.com/BigStationW/ComfyUI-NAG-Extended/blob/main/workflows/NAG-Anima-ComfyUI-Workflow.json
https://civitai.com/models/2560840/anima-turbo-lora

Anonymous
04/23/26(Thu)16:42:32 No.108669462

Anonymous 04/23/26(Thu)16:42:32 No.108669462▶

>>108669455
>bigstationw
im sorry i dont use vibesharted code :")

Anonymous
04/23/26(Thu)16:42:42 No.108669464

Anonymous 04/23/26(Thu)16:42:42 No.108669464▶

What kind of hardware and software/driver combinations do you guys use to generate images and videos and what not?

Anonymous
04/23/26(Thu)16:42:42 No.108669465

Anonymous 04/23/26(Thu)16:42:42 No.108669465▶

File: _AnimaPreview3_00156_.jpg (387.4 KB)

387.4 KB JPG

Anonymous
04/23/26(Thu)16:43:17 No.108669475

Anonymous 04/23/26(Thu)16:43:17 No.108669475▶

File: 1563934765591.png (4.8 KB)

4.8 KB PNG

>turbo lora for a 2b model

Anonymous
04/23/26(Thu)16:43:19 No.108669476

Anonymous 04/23/26(Thu)16:43:19 No.108669476▶

>>108669455
Thanks king. Does left also use a negative prompt tho?

Anonymous
04/23/26(Thu)16:43:50 No.108669477

Anonymous 04/23/26(Thu)16:43:50 No.108669477▶

>>108669476
>Does left also use a negative prompt tho?
left can't use a negative prompt, it's at cfg 1

Anonymous
04/23/26(Thu)16:46:07 No.108669492

Anonymous 04/23/26(Thu)16:46:07 No.108669492▶

>>108669455
https://github.com/pamparamm/ComfyUI-ppm
I've been using this for negative weights while at CFG 1.0, works great, you just have to get used to the fact that you are putting negative weighted tags in the positive prompt instead of writing in the negative prompt. This has worked better for me than NAG ever did.

Anonymous
04/23/26(Thu)16:46:26 No.108669495

Anonymous 04/23/26(Thu)16:46:26 No.108669495▶

File: 1765410005510165.jpg (33.8 KB)

33.8 KB JPG

>>108668921
>my Roll-chan made it in the OP

Anonymous
04/23/26(Thu)16:47:06 No.108669500

Anonymous 04/23/26(Thu)16:47:06 No.108669500▶

>>108669475
if you're using a sophisticated sampler like ClownsharKSampler, going for cfg > 1 + 50 can be really long (like more than 2 mn on my 3090)

Anonymous
04/23/26(Thu)16:47:23 No.108669503

Anonymous 04/23/26(Thu)16:47:23 No.108669503▶

>>108669477
Would you be so kind as to compare non turbo lora with regular CFG vs NAG? I'm just curious

Anonymous
04/23/26(Thu)16:47:52 No.108669510

Anonymous 04/23/26(Thu)16:47:52 No.108669510▶

>sophisticated sampler

Anonymous
04/23/26(Thu)16:48:05 No.108669513

Anonymous 04/23/26(Thu)16:48:05 No.108669513▶

>>108668271
what model did you use anon, looks clean

Anonymous
04/23/26(Thu)16:49:22 No.108669518

Anonymous 04/23/26(Thu)16:49:22 No.108669518▶

>>108669495
God-tier aesthetics in that series. Shame there's so few images tagged "reaverbot" on Danbooru, I want to gen some fucking bots. Guess I have to train a lora...

Anonymous
04/23/26(Thu)16:49:49 No.108669522

Anonymous 04/23/26(Thu)16:49:49 No.108669522▶

File: _AnimaPreview3_00162_.jpg (409.6 KB)

409.6 KB JPG

>>108669513
probably zimage turbo

Anonymous
04/23/26(Thu)16:50:00 No.108669523

Anonymous 04/23/26(Thu)16:50:00 No.108669523▶

Can someone explain why than new fancy chatgpt image thing isn't possible locally? Couldn't you just hook up something like z-image or anima to a smart LLM like Gemma with vision?

Anonymous
04/23/26(Thu)16:50:37 No.108669528

Anonymous 04/23/26(Thu)16:50:37 No.108669528▶

File: file.png (17.8 KB)

17.8 KB PNG

I wonder if there is a way to automate gemma 4 with its vision capabilities as an agent + whatever model + inpainting tools to approach the result of the gpt autoregressive model.

Anonymous
04/23/26(Thu)16:50:40 No.108669529

Anonymous 04/23/26(Thu)16:50:40 No.108669529▶

>>108669495
baker doesnt like my anime2real images sadgely :(

Anonymous
04/23/26(Thu)16:50:47 No.108669531

Anonymous 04/23/26(Thu)16:50:47 No.108669531▶

>>108669503
the issue is that those NAG parameters don't work for cfg > 1, it can be used yeah but I'm just too lazy to find the right values again, I mean if you already have CFG, adding NAG on top of that is kinda useless imo (and it's slower)

Anonymous
04/23/26(Thu)16:51:50 No.108669540

Anonymous 04/23/26(Thu)16:51:50 No.108669540▶

>>108669528
Yes

Anonymous
04/23/26(Thu)16:52:01 No.108669544

Anonymous 04/23/26(Thu)16:52:01 No.108669544▶

turdbo looks so stale i have no idea how anon isnt tired of that look already
it was cool when it came out but its just a demo for ZiB
just use ZiB

Anonymous
04/23/26(Thu)16:52:43 No.108669553

Anonymous 04/23/26(Thu)16:52:43 No.108669553▶

>>108669449
is there a finetune for anima I didn't hear about?
since when did it do realistic?

Anonymous
04/23/26(Thu)16:52:49 No.108669555

Anonymous 04/23/26(Thu)16:52:49 No.108669555▶

>>108669528
>inpainting tools
replace that with an edit model like klein and you can probably do it yeah

Anonymous
04/23/26(Thu)16:53:24 No.108669563

Anonymous 04/23/26(Thu)16:53:24 No.108669563▶

>>108669544
ZIB cant do fine detail

Anonymous
04/23/26(Thu)16:53:48 No.108669570

Anonymous 04/23/26(Thu)16:53:48 No.108669570▶

File: 1763564705331420.jpg (242.1 KB)

242.1 KB JPG

How do I anima with krita?

Anonymous
04/23/26(Thu)16:53:57 No.108669571

Anonymous 04/23/26(Thu)16:53:57 No.108669571▶

>>108669522
alright, thanks

Anonymous
04/23/26(Thu)16:54:06 No.108669573

Anonymous 04/23/26(Thu)16:54:06 No.108669573▶

>>108669563
its the same VAE yes it does

Anonymous
04/23/26(Thu)16:54:11 No.108669574

Anonymous 04/23/26(Thu)16:54:11 No.108669574▶

>>108669544
>>108669563
I've seen some workflow where they use ZIB to do the begining of the image (like the first 50% of steps), then switch to ZiT to make it look good

Anonymous
04/23/26(Thu)16:54:42 No.108669577

Anonymous 04/23/26(Thu)16:54:42 No.108669577▶

>>108669553
Since always. https://civitai.com/models/1662740/lenovo-ultrareal?modelVersionId=2882170 This lora helps a tiny bit.

Anonymous
04/23/26(Thu)16:55:07 No.108669580

Anonymous 04/23/26(Thu)16:55:07 No.108669580▶

>>108669574
that's what im doing

Anonymous
04/23/26(Thu)16:55:15 No.108669582

Anonymous 04/23/26(Thu)16:55:15 No.108669582▶

>>108669555
issue is that edit won't be able to target specific things to enhance

>>108669528
can gemma select a part of an image?

Anonymous
04/23/26(Thu)16:55:50 No.108669588

Anonymous 04/23/26(Thu)16:55:50 No.108669588▶

>>108669574
>switch to ZiT to make it look good
kek i guess if you enjoy distill slop then sure

Anonymous
04/23/26(Thu)16:58:14 No.108669613

Anonymous 04/23/26(Thu)16:58:14 No.108669613▶

>>108669582
>edit won't be able to target specific things to enhance
yes it can, edit can just modify one specific part of the image, that makes shit easier because you just have to say "hey, add a hat to that girl's head" instead of trying to automate an inpainting process

Anonymous
04/23/26(Thu)16:58:31 No.108669622

Anonymous 04/23/26(Thu)16:58:31 No.108669622▶

File: 646065966145594.png (1.1 MB)

1.1 MB PNG

>>108669513
>>108669522
Anima -> ZIT

Anonymous
04/23/26(Thu)16:58:45 No.108669629

Anonymous 04/23/26(Thu)16:58:45 No.108669629▶

>>108669528
I don't know what it is but they did something else than just "look at this image and fix it".
Even SOTA API models don't really have a super great visual reasoning.
Again I don't know what precisely it is, but they are feeding ChatGPT more than a few hundred visual tokens.

Anonymous
04/23/26(Thu)17:00:34 No.108669646

Anonymous 04/23/26(Thu)17:00:34 No.108669646▶

i wish lodestones didn't have the attention span of a fruit fly

Anonymous
04/23/26(Thu)17:01:19 No.108669649

Anonymous 04/23/26(Thu)17:01:19 No.108669649▶

File: _AnimaPreview3_00173_.jpg (382.6 KB)

382.6 KB JPG

>>108669553
I use loras for photography and interior. Haven't uploaded anywhere yet.

Anonymous
04/23/26(Thu)17:02:02 No.108669653

Anonymous 04/23/26(Thu)17:02:02 No.108669653▶

>>108669528
>>108669629
it's probably something like this
>it makes the image -> it uses its visual encoder to see mistakes -> it makes an edit prompt -> it edits the model
a gemma 4 + klein combo could definitely do the trick

Anonymous
04/23/26(Thu)17:05:41 No.108669683

Anonymous 04/23/26(Thu)17:05:41 No.108669683▶

Retard here, is there any reason to use Klein 9b base over distill when you do upscaling and editing? Or is it just slower without any real benefit?

Anonymous
04/23/26(Thu)17:05:43 No.108669684

Anonymous 04/23/26(Thu)17:05:43 No.108669684▶

>>108669622
so anime gen with anima + zit at 0.x denoise to make it realistic?

Anonymous
04/23/26(Thu)17:06:53 No.108669699

Anonymous 04/23/26(Thu)17:06:53 No.108669699▶

>>108669683
use base with speed lora

Anonymous
04/23/26(Thu)17:07:13 No.108669703

Anonymous 04/23/26(Thu)17:07:13 No.108669703▶

>>108669503
>>108669531
give me a prompt and a negative prompt, I'll try it out

Anonymous
04/23/26(Thu)17:07:32 No.108669707

Anonymous 04/23/26(Thu)17:07:32 No.108669707▶

>>108669613
yeah but it's not as precise for things it doesn't know about, or background things, or basically very specific things you want to target

Anonymous
04/23/26(Thu)17:09:25 No.108669722

Anonymous 04/23/26(Thu)17:09:25 No.108669722▶

>>108669707
>things it doesn't know about
OpenAI probably uses a tool calling to browse the internet, fetch some images and ask the model to merge those images onto the canevas

Anonymous
04/23/26(Thu)17:09:31 No.108669726

Anonymous 04/23/26(Thu)17:09:31 No.108669726▶

>>108669475
>he doesnt want to gen instantaneously

Anonymous
04/23/26(Thu)17:09:51 No.108669731

Anonymous 04/23/26(Thu)17:09:51 No.108669731▶

>>108669629
Anything can be bruteforced with enough tokens, and seeing the prices on the api sides, I'm pretty sure it's feeding a whole lot of tokens to refine the image.
The result is good though, and I'd like to see that locally done with the tools we have.

Anonymous
04/23/26(Thu)17:10:56 No.108669742

Anonymous 04/23/26(Thu)17:10:56 No.108669742▶

File: t5g_gallery_00125_.png (911 KB)

911 KB PNG

https://huggingface.co/TheRemixer/ChenkinNoobRF-T5Gemma-adapter
Neat, T5gemma adapter for Chenkin Noob!

>>108669726
Anima is x2 slower than SDXL and for train Anima loras it's between 2.3 -2.5 slower than sdxl

Anonymous
04/23/26(Thu)17:11:19 No.108669747

Anonymous 04/23/26(Thu)17:11:19 No.108669747▶

>>108669653
>it uses its visual encoder to see mistakes
this is probably their secret sauce (along with using agents), I think they trained specifically for "wrong looking texts" and details, which means the model is probably very good at spotting that

Anonymous
04/23/26(Thu)17:15:44 No.108669780

Anonymous 04/23/26(Thu)17:15:44 No.108669780▶

>Anima is x2 slower than SDXL and for train Anima loras it's between 2.3 -2.5 slower than sdxl
loving this indian reasoning

Anonymous
04/23/26(Thu)17:16:34 No.108669785

Anonymous 04/23/26(Thu)17:16:34 No.108669785▶

>>108668948
It’s rare, the aesthetic is kind of fried, like SD 1.5, but with better coherence. It reminds me of one of those SD 1.5 shitmixes with a lot of inpainting, regional prompter, and Photoshop.

Anonymous
04/23/26(Thu)17:20:43 No.108669812

Anonymous 04/23/26(Thu)17:20:43 No.108669812▶

>>108669683
In my single testcase the results differed, but base did more unwanted fiddling than distilled. YMMV.

Anonymous
04/23/26(Thu)17:27:05 No.108669856

Anonymous 04/23/26(Thu)17:27:05 No.108669856▶

File: 410146798802683.png (3.9 MB)

3.9 MB PNG

>>108669684
Pretty much, expect I start with a realistic Anima gen.

Anonymous
04/23/26(Thu)17:32:20 No.108669896

Anonymous 04/23/26(Thu)17:32:20 No.108669896▶

>>108669856
>double eyelid zitslopgirl
NinTenLOL

Anonymous
04/23/26(Thu)17:33:26 No.108669907

Anonymous 04/23/26(Thu)17:33:26 No.108669907▶

File: 968433991998314.png (2.1 MB)

2.1 MB PNG

Anonymous
04/23/26(Thu)17:33:29 No.108669908

Anonymous 04/23/26(Thu)17:33:29 No.108669908▶

>>108669742
>SDXL
*pukes*

Anonymous
04/23/26(Thu)17:35:46 No.108669924

Anonymous 04/23/26(Thu)17:35:46 No.108669924▶

Any way to stop the artist name from showing up with anima? I already have signature, artist name, twitter username, patreon username, and watermark in the negative prompt but it's still doing it.

Anonymous
04/23/26(Thu)17:42:30 No.108669980

Anonymous 04/23/26(Thu)17:42:30 No.108669980▶

>>108669731
I guess you can take your time trying to min-max llama.cpp params and see if it scales up well enough? I wouldn't be too hopeful but worth a shot.
Maybe 3.6 works better for this, that's also worth experimenting.

Anonymous
04/23/26(Thu)17:44:54 No.108670003

Anonymous 04/23/26(Thu)17:44:54 No.108670003▶

File: 1747808016100949.jpg (258.8 KB)

258.8 KB JPG

>>108669653
>>108669747
you need a very good model that doesnt use a vae in order to do what gpt 2 is doing
dont waste your time trying to squeeze water out of a stone with these outdated latent diffusion models

Anonymous
04/23/26(Thu)17:47:53 No.108670022

Anonymous 04/23/26(Thu)17:47:53 No.108670022▶

File: absolute gpt image 2 slop.png (3 MB)

3 MB PNG

>>108670003
>you need a very good model that doesnt use a vae in order to do what gpt 2 is doing
the thing is that it's obvious that gpt 2 is still using a vae, when you go for very complex images, it starts to be slopped fast and there's more and more noises and artifacts, it's probably the result of the model doing like 10 edits and at this point the vae issues starts to be really amplified

Anonymous
04/23/26(Thu)17:49:44 No.108670036

Anonymous 04/23/26(Thu)17:49:44 No.108670036▶

File: woman 2026 04 23 1.png (1.5 MB)

1.5 MB PNG

Anonymous
04/23/26(Thu)17:53:07 No.108670060

Anonymous 04/23/26(Thu)17:53:07 No.108670060▶

>>108670022
indians think api models are magic.

Anonymous
04/23/26(Thu)17:57:49 No.108670090

Anonymous 04/23/26(Thu)17:57:49 No.108670090▶

>>108669980
gemma 31B has pretty ok image understanding, and is easy to stop from moralfagging over nsfw

Anonymous
04/23/26(Thu)17:59:30 No.108670109

Anonymous 04/23/26(Thu)17:59:30 No.108670109▶

File: 1770711519026006.png (2.3 MB)

2.3 MB PNG

>use pear-shaped figure tag
>turns her into a literal fucking pair
Kek

Anonymous
04/23/26(Thu)18:01:07 No.108670122

Anonymous 04/23/26(Thu)18:01:07 No.108670122▶

>>108668863
catbox?

Anonymous
04/23/26(Thu)18:01:24 No.108670126

Anonymous 04/23/26(Thu)18:01:24 No.108670126▶

>>108670109
canon btw

Anonymous
04/23/26(Thu)18:06:14 No.108670176

Anonymous 04/23/26(Thu)18:06:14 No.108670176▶

>>108670109
what a smug pear.

Anonymous
04/23/26(Thu)18:07:54 No.108670191

Anonymous 04/23/26(Thu)18:07:54 No.108670191▶

File: _AnimaPreview3_00224_.jpg (370 KB)

370 KB JPG

Anonymous
04/23/26(Thu)18:08:26 No.108670194

Anonymous 04/23/26(Thu)18:08:26 No.108670194▶

>>108670022
>there's more and more noises and artifacts
This is due to overly aggressive distillation + RL. See Ernie Turbo for a local example, it is trying to "fake" detail by just having noise on everything.

Anonymous
04/23/26(Thu)18:14:33 No.108670241

Anonymous 04/23/26(Thu)18:14:33 No.108670241▶

>>108670191
me on the right

Anonymous
04/23/26(Thu)18:18:18 No.108670278

Anonymous 04/23/26(Thu)18:18:18 No.108670278▶

>>108669856
The problem with this method is that the anatomy more or less sucks.

Talking about anatomy, what model produces the most anatomically accurate gens? I have been using virt a mate + ZIT to make my gens look realistic, but that's kind of a hassle.

Anonymous
04/23/26(Thu)18:19:08 No.108670284

Anonymous 04/23/26(Thu)18:19:08 No.108670284▶

File: 921750962661488.png (2.3 MB)

2.3 MB PNG

Anonymous
04/23/26(Thu)18:19:57 No.108670294

Anonymous 04/23/26(Thu)18:19:57 No.108670294▶

File: 1750158164106180.png (2.3 MB)

2.3 MB PNG

>>108670022
that mainly happens to posters and cartoons, i havent seen that happen to real images yet. it could be using a different model for realism
>>108670060
its basically magic compared to local.
youre better off trying to reverse engineer it to gain some understanding than acting like a know it all

Anonymous
04/23/26(Thu)18:21:27 No.108670312

Anonymous 04/23/26(Thu)18:21:27 No.108670312▶

File: 1767269344628861.jpg (576 KB)

576 KB JPG

>>108669503
>Would you be so kind as to compare non turbo lora with regular CFG vs NAG?
here you go

Anonymous
04/23/26(Thu)18:21:29 No.108670313

Anonymous 04/23/26(Thu)18:21:29 No.108670313▶

File: 575979444952377.png (1.8 MB)

1.8 MB PNG

Anonymous
04/23/26(Thu)18:22:44 No.108670321

Anonymous 04/23/26(Thu)18:22:44 No.108670321▶

I hate NAGgers

Anonymous
04/23/26(Thu)18:22:53 No.108670322

Anonymous 04/23/26(Thu)18:22:53 No.108670322▶

>>108670294
>its basically magic compared to local.
point in case.

Anonymous
04/23/26(Thu)18:23:53 No.108670332

Anonymous 04/23/26(Thu)18:23:53 No.108670332▶

>>108670322
I mean, local doesn't have this ultra automated thing where the model is able to correct itself, I could vibecode that shit but we only have Klein as a decent edit model and the vae is scaring me off

Anonymous
04/23/26(Thu)18:25:04 No.108670349

Anonymous 04/23/26(Thu)18:25:04 No.108670349▶

more soles gens

Anonymous
04/23/26(Thu)18:26:27 No.108670366

Anonymous 04/23/26(Thu)18:26:27 No.108670366▶

File: 1776968397991114_.png (2.4 MB)

2.4 MB PNG

>>108670294

Anonymous
04/23/26(Thu)18:28:00 No.108670386

Anonymous 04/23/26(Thu)18:28:00 No.108670386▶

>>108670332
>local doesn't have this ultra automated thing where the model is able to correct itself,
neither does gpt2, the gens wouldn't be as fucked as they are if it was "thinking".

Anonymous
04/23/26(Thu)18:28:42 No.108670393

Anonymous 04/23/26(Thu)18:28:42 No.108670393▶

File: _AnimaPreview3_00241_.jpg (386 KB)

386 KB JPG

>>108670241

Anonymous
04/23/26(Thu)18:29:06 No.108670396

Anonymous 04/23/26(Thu)18:29:06 No.108670396▶

File: 1_00027_.jpg (3.3 MB)

3.3 MB JPG

also whats cool is that you can use AI nowadays to remove ugly tattoos from woman.

Anonymous
04/23/26(Thu)18:29:17 No.108670397

Anonymous 04/23/26(Thu)18:29:17 No.108670397▶

>>108670386
let's say that they have something, it's not that good yeah, but it's still better than nothing lol

Anonymous
04/23/26(Thu)18:29:38 No.108670402

Anonymous 04/23/26(Thu)18:29:38 No.108670402▶

File: 869726735467031.png (2.4 MB)

2.4 MB PNG

Anonymous
04/23/26(Thu)18:30:18 No.108670408

Anonymous 04/23/26(Thu)18:30:18 No.108670408▶

File: zazed.png (168.9 KB)

168.9 KB PNG

>>108670396
>I can fix her!
and you did

Anonymous
04/23/26(Thu)18:30:38 No.108670411

Anonymous 04/23/26(Thu)18:30:38 No.108670411▶

>>108670397
they have segm inpainting lol

Anonymous
04/23/26(Thu)18:30:59 No.108670414

Anonymous 04/23/26(Thu)18:30:59 No.108670414▶

File: 238674633911580.png (2.1 MB)

2.1 MB PNG

Anonymous
04/23/26(Thu)18:31:12 No.108670415

Anonymous 04/23/26(Thu)18:31:12 No.108670415▶

>>108670312
what's the negative tag there

Anonymous
04/23/26(Thu)18:32:01 No.108670421

Anonymous 04/23/26(Thu)18:32:01 No.108670421▶

File: 430128115435224.png (2.1 MB)

2.1 MB PNG

Anonymous
04/23/26(Thu)18:32:10 No.108670425

Anonymous 04/23/26(Thu)18:32:10 No.108670425▶

>>108670414
he should be one groping both asses, what a mistake

Anonymous
04/23/26(Thu)18:33:02 No.108670435

Anonymous 04/23/26(Thu)18:33:02 No.108670435▶

File: 574086081934887.png (2 MB)

2 MB PNG

Anonymous
04/23/26(Thu)18:35:13 No.108670462

Anonymous 04/23/26(Thu)18:35:13 No.108670462▶

>>108670396
Now remove that disgusting armpit hair. *pukes*

Anonymous
04/23/26(Thu)18:45:24 No.108670551

Anonymous 04/23/26(Thu)18:45:24 No.108670551▶

File: 166514189754498.png (2 MB)

2 MB PNG

>>108670425
He's shy bro.

Anonymous
04/23/26(Thu)18:45:26 No.108670553

Anonymous 04/23/26(Thu)18:45:26 No.108670553▶

File: 1759207999061119.jpg (559 KB)

559 KB JPG

How come anima uses @ for artist tags?

Anonymous
04/23/26(Thu)18:56:17 No.108670648

Anonymous 04/23/26(Thu)18:56:17 No.108670648▶

>>108670553
I guess it helps the model to know that it's an artistic style and not another character

Anonymous
04/23/26(Thu)19:02:54 No.108670710

Anonymous 04/23/26(Thu)19:02:54 No.108670710▶

>>108670648
There's also artists with common nouns in their name in the dataset and on illustrious/noob you will ALWAYS get that common noun in your gen if you prompt for them. Haven't seen that issue on anima yet.

Anonymous
04/23/26(Thu)19:04:37 No.108670728

Anonymous 04/23/26(Thu)19:04:37 No.108670728▶

File: 1746837922993034.png (2 MB)

2 MB PNG

>>108670411
we all cant wait to see your inpainting skills
why dont you show us an example of how gpt 2 handles image editing?

Anonymous
04/23/26(Thu)19:08:22 No.108670763

Anonymous 04/23/26(Thu)19:08:22 No.108670763▶

>>108670396
where is her benis?

Anonymous
04/23/26(Thu)19:09:51 No.108670778

Anonymous 04/23/26(Thu)19:09:51 No.108670778▶

>>108670728
it was already explained at the start of the thread.
it generates the image then does a semg inpaint to projects text and logos.
>>108669135

Anonymous
04/23/26(Thu)19:10:17 No.108670782

Anonymous 04/23/26(Thu)19:10:17 No.108670782▶

>>108670763
she has none, that's what makes tomboys >>> faggoted femboys

Anonymous
04/23/26(Thu)19:11:43 No.108670791

Anonymous 04/23/26(Thu)19:11:43 No.108670791▶

>>108669243
>its a cool trick honestly
it's just some cope duct tape, I find it even worse than cope loras, if the model cannot do it iself then it shows that they can't improve further, which is sad

Anonymous
04/23/26(Thu)19:13:58 No.108670809

Anonymous 04/23/26(Thu)19:13:58 No.108670809▶

>>108669243
It's more than a cool trick, it's very impressive for text especially, which helps a lot for image to feel good.
I want that in local asap.

Anonymous
04/23/26(Thu)19:20:20 No.108670859

Anonymous 04/23/26(Thu)19:20:20 No.108670859▶

File: 1770353268506318.png (1.2 MB)

1.2 MB PNG

Seems like some people want GPT-2 at home, you got your wish granted lol
https://github.com/inclusionAI/LLaDA2.0-Uni
https://huggingface.co/inclusionAI/LLaDA2.0-Uni

Anonymous
04/23/26(Thu)19:22:12 No.108670874

Anonymous 04/23/26(Thu)19:22:12 No.108670874▶

>>108670859
bloody bitch bastrd!

Anonymous
04/23/26(Thu)19:30:22 No.108670917

Anonymous 04/23/26(Thu)19:30:22 No.108670917▶

>>108670859
We had GPT2 at home, it was GPT-J.

Seeing about a dozen of the fotm model gens in different generals was enough to make me hate """inconspicuously""" arranged books with extremely readable spines.

Anonymous
04/23/26(Thu)19:31:42 No.108670929

Anonymous 04/23/26(Thu)19:31:42 No.108670929▶

>>108670859
>diff llms
What can even run this? vllm?

Anonymous
04/23/26(Thu)19:32:05 No.108670933

Anonymous 04/23/26(Thu)19:32:05 No.108670933▶

>>108670859
>16B model with 5B decoder
Fuck my VRAMlet chungus life.
This looks very interesting though, I lowkey want to hire some GPU on vast to test it.

Anonymous
04/23/26(Thu)19:33:31 No.108670941

Anonymous 04/23/26(Thu)19:33:31 No.108670941▶

>>108670929
There is example inference code on the repo.
It seems like torch+transformers is enough

Anonymous
04/23/26(Thu)19:37:09 No.108670968

Anonymous 04/23/26(Thu)19:37:09 No.108670968▶

>>108669649
share your realistic lora. looks better than with z

Anonymous
04/23/26(Thu)19:37:51 No.108670979

Anonymous 04/23/26(Thu)19:37:51 No.108670979▶

>>108670929
Oh somehow I missed that

# Understand the image
response = model.understand_image(
    image_tokens, h, w,
    question="Describe this image in detail.",
    steps=32, gen_length=2048,
)

Holy hell it is diffusing text.

Anonymous
04/23/26(Thu)19:38:16 No.108670985

Anonymous 04/23/26(Thu)19:38:16 No.108670985▶

File: 2026-04-23153725_stealthmeta.png (658.4 KB)

658.4 KB PNG

Anonymous
04/23/26(Thu)19:40:45 No.108671006

Anonymous 04/23/26(Thu)19:40:45 No.108671006▶

File: 1761391883384476.jpg (178.3 KB)

178.3 KB JPG

>>108670859
>Diffusion Large Language Model
Huh? it's diffusing text too?

also what an awful way of showing perf

Anonymous
04/23/26(Thu)19:40:53 No.108671007

Anonymous 04/23/26(Thu)19:40:53 No.108671007▶

>>108670979
yep, it's a diffusion LLM model, and those things are like 5X faster than your regular autoregressive LLM, the issue is that for the moment no one has managed to make them as smart as autoregressive, I hope it'll happen

Anonymous
04/23/26(Thu)19:41:20 No.108671011

Anonymous 04/23/26(Thu)19:41:20 No.108671011▶

https://civitai.red/models/2553102/editanything?modelVersionId=2869279

edit lora for LTX 2.3, pretty funny what you can do. "replace the man in the red shirt with a green orc from lord of the rings."

https://files.catbox.moe/i3agwk.mp4

Anonymous
04/23/26(Thu)19:44:57 No.108671041

Anonymous 04/23/26(Thu)19:44:57 No.108671041▶

File: 566546845314684654_.png (3 MB)

3 MB PNG

anima is pretty decent

Anonymous
04/23/26(Thu)19:48:29 No.108671073

Anonymous 04/23/26(Thu)19:48:29 No.108671073▶

>>108670859
>16b image moe
Damn you could prolly run this fullsized even on shit machines.

Anonymous
04/23/26(Thu)19:50:25 No.108671081

Anonymous 04/23/26(Thu)19:50:25 No.108671081▶

>>108671073
definitely, it can be run on Q8 + Q8 KV cache (with the turboquant shit, I wonder when this will be implemented on the diffusion ecosystem)

Anonymous
04/23/26(Thu)19:56:27 No.108671139

Anonymous 04/23/26(Thu)19:56:27 No.108671139▶

>>108671073
I don't think they are using MOE in that sense.
I think it's MOE in the sense that there is 16B core LLM and a separate decoder model used for t2i or edit tasks.

Anonymous
04/23/26(Thu)19:57:29 No.108671150

Anonymous 04/23/26(Thu)19:57:29 No.108671150▶

File: We'll see about that.gif (1.4 MB)

1.4 MB GIF

>>108670859
>https://github.com/inclusionAI/LLaDA2.0-Uni
>It enables precise modifications while perfectly preserving original details.

Anonymous
04/23/26(Thu)19:58:56 No.108671163

Anonymous 04/23/26(Thu)19:58:56 No.108671163▶

>>108669190
damn, how did you get your phone to run on windows 1o

Anonymous
04/23/26(Thu)20:01:28 No.108671176

Anonymous 04/23/26(Thu)20:01:28 No.108671176▶

>>108671011
kek, "replace the man in the red shirt with mickey mouse from Disney."

https://files.catbox.moe/9av1e7.mp4

Anonymous
04/23/26(Thu)20:03:04 No.108671187

Anonymous 04/23/26(Thu)20:03:04 No.108671187▶

>>108671176
I want to watch movies with everyone replaced with hot goonbait girls. ecchify EVERYTHING

Anonymous
04/23/26(Thu)20:03:19 No.108671191

Anonymous 04/23/26(Thu)20:03:19 No.108671191▶

>>108671176
that lean back at the start lmao

Anonymous
04/23/26(Thu)20:05:56 No.108671207

Anonymous 04/23/26(Thu)20:05:56 No.108671207▶

File: _AnimaPreview3_00309_.jpg (464.2 KB)

464.2 KB JPG

>>108670968
I might but I don't think it's good enough

Anonymous
04/23/26(Thu)20:08:20 No.108671224

Anonymous 04/23/26(Thu)20:08:20 No.108671224▶

File: 1766663339136520.png (1.5 MB)

1.5 MB PNG

Is there a rule of thumb for how many artists you can mix with anima? I've had good results with 2 (using weights of course) but I've had trouble keeping the style consistent with more.

Anonymous
04/23/26(Thu)20:08:59 No.108671229

Anonymous 04/23/26(Thu)20:08:59 No.108671229▶

File: 1759704504932123.mp4 (232.2 KB)

232.2 KB MP4

>>108670859
it's interesting to look at, the way it process text with the diffusion process

Anonymous
04/23/26(Thu)20:10:22 No.108671249

Anonymous 04/23/26(Thu)20:10:22 No.108671249▶

>>108671207
whats your dataset look like, just general style or character?

Anonymous
04/23/26(Thu)20:12:54 No.108671271

Anonymous 04/23/26(Thu)20:12:54 No.108671271▶

>>108671249
940 photos. Portraits, nature, interiors etc. 95 % of portraits are women

Anonymous
04/23/26(Thu)20:14:54 No.108671285

Anonymous 04/23/26(Thu)20:14:54 No.108671285▶

>>108671011
>>108671176
>>108671187
Already for old porn with ugly but competent actresses, I can see the use case by replacing their face and body with unreal hotties.

Anonymous
04/23/26(Thu)20:15:52 No.108671291

Anonymous 04/23/26(Thu)20:15:52 No.108671291▶

>>108671285
women should be terrified

Anonymous
04/23/26(Thu)20:16:36 No.108671300

Anonymous 04/23/26(Thu)20:16:36 No.108671300▶

>>108671224
no it's random, and it's kind of an art like cooking : the stronger the artist style, the less it can marry with anything else strong too, and some artists are similar enough they can be together with each bringing something specific (like one good at hosiery, one good at poses etc)

Anonymous
04/23/26(Thu)20:17:06 No.108671306

Anonymous 04/23/26(Thu)20:17:06 No.108671306▶

>>108671291
Why would they, it's just better looking porn.

Anonymous
04/23/26(Thu)20:18:15 No.108671314

Anonymous 04/23/26(Thu)20:18:15 No.108671314▶

>>108671306
>it's just better looking porn.
let me clarify, women in sex work/porn/onlyfans

Anonymous
04/23/26(Thu)20:19:22 No.108671320

Anonymous 04/23/26(Thu)20:19:22 No.108671320▶

>>108671313
wrong thread?

Anonymous
04/23/26(Thu)20:19:23 No.108671321

Anonymous 04/23/26(Thu)20:19:23 No.108671321▶

>>108668921
cute Roll

Anonymous
04/23/26(Thu)20:21:41 No.108671332

Anonymous 04/23/26(Thu)20:21:41 No.108671332▶

>>108671314
Prostitution no, not until we have hacked all senses at least.
Porn, the industry itself is already kind of being killed by OF and "amateur" before that.
OF, yeah but it's imploding by itself, and tbdesu it's a very recent category, pretty girls doing lewd stuff on cam is a very recent thing. I see it as fleeting.

Anonymous
04/23/26(Thu)20:22:10 No.108671341

Anonymous 04/23/26(Thu)20:22:10 No.108671341▶

File: ComparisonLatest.jpg (2.5 MB)

2.5 MB JPG

A fair-skinned young Caucasian woman with long, sleek copper-red hair stands centrally on a weathered stone walkway, posing directly for the camera. She wears a whimsical pastel lavender mini-dress featuring a tiered skirt, ruffled bodice with lace trim, and sheer long sleeves, accessorized with a metallic gold crossbody bag. Her legs are clad in intricate white patterned lace tights, ending in chunky two-tone black and white platform oxford shoes. She is situated in a formal garden setting, flanked by stone balustrades topped with large white classical urns containing manicured green bushes. Immediately behind her stands a white architectural frame structure bearing the text "1GIRL GARDENS" in bold serif capital letters. The background reveals terraced flower beds, classical white statues, and a green hillside dotted with buildings. The lighting is soft, flat, and diffused from an overcast sky, creating shadow-free illumination that enhances the soft pastel colors of her dress and the even tones of her complexion. Style: whimsical street fashion photography. Mood: sweet, composed, and serene.

Anonymous
04/23/26(Thu)20:24:39 No.108671355

Anonymous 04/23/26(Thu)20:24:39 No.108671355▶

>>108671341
Nano Banana Pro seems to still be the best on realism

Anonymous
04/23/26(Thu)20:25:24 No.108671362

Anonymous 04/23/26(Thu)20:25:24 No.108671362▶

File: _AnimaPreview3_00320_.jpg (422.8 KB)

422.8 KB JPG

Anonymous
04/23/26(Thu)20:26:19 No.108671367

Anonymous 04/23/26(Thu)20:26:19 No.108671367▶

>>108671341
I like nbp scenery, and gpt girl is the cutest of the 3 by far so I'd choose her

Anonymous
04/23/26(Thu)20:27:37 No.108671382

Anonymous 04/23/26(Thu)20:27:37 No.108671382▶

>>108671341
Klein is so flat, like the luminosity is the same everywhere

Anonymous
04/23/26(Thu)20:27:39 No.108671383

Anonymous 04/23/26(Thu)20:27:39 No.108671383▶

File: thank you tdrussell miku 2.png (865.6 KB)

865.6 KB PNG

After failing a few times before I was finally able to train my lora on anima thanks to the official configuration tdrussell shared. (I also bumped training dataset from 80 to 130 in between)
It's not perfect, but it actually feels usable now without coping extensively. Some gens still undershoot and maybe still just a little bit of overlearning irrelevant noise in the others but it came off better than previous attempts where I either fried it to oblivion or undershot massively.
In the interest of perhaps helping an interested party here is the command I last used:
python anima_train_network.py --tokenizer_cache_dir /home/user/myloras/tokcache/ --metadata_trigger_phrase "@tag. " --cache_info --resolution 1024 --cache_latents --enable_bucket --min_bucket_reso 256 --max_bucket_reso 2048 --bucket_reso_steps 16 --resize_interpolation lanczos --pretrained_model_name_or_path /home/user/models/anima-preview2.safetensors --qwen3 /home/user/models/qwen_3_06b_base.safetensors --vae /home/user/models/qwen_image_vae.safetensors --output_dir /home/user/myloras/output/ --save_precision bf16 --save_every_n_epochs 1 --save_state --train_batch_size 2 --xformers --max_train_epochs 10 --persistent_data_loader_workers --seed 999 --gradient_checkpointing --mixed_precision bf16 --logging_dir /home/user/myloras/logs/ --log_with tensorboard --optimizer_type AdamW --learning_rate 0.00003 --optimizer_args weight_decay=0.01 betas=0.9,0.99 --lr_scheduler cosine --lr_warmup_steps 0.1 --save_model_as safetensors --network_dim 32 --network_alpha 16 --network_dropout 0.075 --cache_text_encoder_outputs --cache_text_encoder_outputs_to_disk --timestep_sampling sigmoid --sigmoid_scale 1.3 --network_module networks.lora_anima --dataset_config /home/user/myloras/animaconfig.toml --max_grad_norm 1.0 --network_train_unet_only --split_attn
4 repeats so 5.2k steps in total.

Anonymous
04/23/26(Thu)20:28:56 No.108671394

Anonymous 04/23/26(Thu)20:28:56 No.108671394▶

>>108671332
>not until we have hacked all senses at least.
we're truly not far off. we don't need smell. if anything excluding smell is a bonus for most. we have the visual and physical stimulation already it just needs to be all thrown together in a kit.

>Porn, the industry itself is already kind of being killed by OF and "amateur" before that.
even most "amateur" stuff was being ran by essentially a pimp.

>OF, yeah but it's imploding by itself
Yeah it's not really "imploding" but the fake AI girls are starting to get more subscribers than the real thing. OF is nearly going to become entirely fake AI girls and the retards paying for it don't care they're gooning retards who just want a weird parasocial relationship. AI sex bots are ultimately the future, though. The price of pussy and affection will soon be at an all time low.

Anonymous
04/23/26(Thu)20:29:23 No.108671399

Anonymous 04/23/26(Thu)20:29:23 No.108671399▶

File: _AnimaPreview3_00335_.jpg (364.4 KB)

364.4 KB JPG

Anonymous
04/23/26(Thu)20:30:47 No.108671409

Anonymous 04/23/26(Thu)20:30:47 No.108671409▶

>>108671367
GPT got the meaning of Caucasian wrong though, a problem that I've only seen Ernie have recently besides it lol

Anonymous
04/23/26(Thu)20:31:40 No.108671415

Anonymous 04/23/26(Thu)20:31:40 No.108671415▶

>>108671320
It was a wrong thread. Sorry for this.

Anonymous
04/23/26(Thu)20:32:32 No.108671422

Anonymous 04/23/26(Thu)20:32:32 No.108671422▶

>>108671394
>we're truly not far off.
It's always the last 5% that take the most time.

>even most "amateur" stuff was being ran by essentially a pimp.
You're probably right.

Anonymous
04/23/26(Thu)20:35:11 No.108671440

Anonymous 04/23/26(Thu)20:35:11 No.108671440▶

File: Ernie-Image-Turbo_00018_.png (1.9 MB)

1.9 MB PNG

>>108671341
>prompt
>>108671409
>Caucasian
>Ernie

Anonymous
04/23/26(Thu)20:35:27 No.108671444

Anonymous 04/23/26(Thu)20:35:27 No.108671444▶

>>108671409
>GPT got the meaning of Caucasian wrong
??
She's what then?

Anonymous
04/23/26(Thu)20:37:44 No.108671460

Anonymous 04/23/26(Thu)20:37:44 No.108671460▶

>>108671300
I hope one day we get a model that just lets you say "do lineart like x artist, render like y artist, and make the background like z artist."

Anonymous
04/23/26(Thu)20:39:58 No.108671491

Anonymous 04/23/26(Thu)20:39:58 No.108671491▶

>>108671460
the true goal is to get an edit model to perfectly reproduce the style if you give it one image input

Anonymous
04/23/26(Thu)20:41:00 No.108671498

Anonymous 04/23/26(Thu)20:41:00 No.108671498▶

so Ernie or Anima for realism lora creation?

Anonymous
04/23/26(Thu)20:41:09 No.108671500

Anonymous 04/23/26(Thu)20:41:09 No.108671500▶

File: 1776600359777472.png (2.1 MB)

2.1 MB PNG

>>108671314
the problem with nbp is that it struggles with creating unique looking faces and attractive bodies
also gpt image seems to handle prompts differently to any other model, slight changes can make or break a gen so its not fair to use the same prompt

Anonymous
04/23/26(Thu)20:41:19 No.108671502

Anonymous 04/23/26(Thu)20:41:19 No.108671502▶

also how come with the same prompt, ZiT is producing hotter goonettes than ZiB?
is there a logical explanation for this phenomenon?

Anonymous
04/23/26(Thu)20:42:14 No.108671511

Anonymous 04/23/26(Thu)20:42:14 No.108671511▶

>>108671500
meant for
>>108671341

Anonymous
04/23/26(Thu)20:42:31 No.108671513

Anonymous 04/23/26(Thu)20:42:31 No.108671513▶

>>108671444
East Asian

Anonymous
04/23/26(Thu)20:47:49 No.108671547

Anonymous 04/23/26(Thu)20:47:49 No.108671547▶

>>108671498
kek

Anonymous
04/23/26(Thu)20:48:05 No.108671550

Anonymous 04/23/26(Thu)20:48:05 No.108671550▶

>>108671444
She does have kind of an Asian cosplayer look. Very surreal prettiness next to the other two 3DPDs.

Anonymous
04/23/26(Thu)20:48:15 No.108671552

Anonymous 04/23/26(Thu)20:48:15 No.108671552▶

>>108671460
when we graduate from small ass models, and it'll be probably only feasible with LLM intelligence so autoregressive

Anonymous
04/23/26(Thu)20:48:37 No.108671556

Anonymous 04/23/26(Thu)20:48:37 No.108671556▶

>>108671502
ZIT has been aggressively post-training RL'd, to the point where they destroyed all seed variance. Pretty women are a result of that (good luck getting big boobs though)
ZIB swings wildly in terms of how a human should look like, because it's a rawer base model.

Anonymous
04/23/26(Thu)20:48:59 No.108671561

Anonymous 04/23/26(Thu)20:48:59 No.108671561▶

File: _AnimaPreview3_00354_.jpg (484.9 KB)

484.9 KB JPG

>>108671341

Anonymous
04/23/26(Thu)20:50:21 No.108671567

Anonymous 04/23/26(Thu)20:50:21 No.108671567▶

>>108671556
>(good luck getting small boobs though)
Fixed

Anonymous
04/23/26(Thu)20:51:51 No.108671577

Anonymous 04/23/26(Thu)20:51:51 No.108671577▶

>>108671382
They are all like that, read the prompt.
>The lighting is soft, flat, and diffused from an overcast sky, creating shadow-free illumination

Anonymous
04/23/26(Thu)21:02:13 No.108671653

Anonymous 04/23/26(Thu)21:02:13 No.108671653▶

File: Comfy_00021.jpg (1.7 MB)

1.7 MB JPG

>>108671341
zit

Anonymous
04/23/26(Thu)21:02:32 No.108671659

Anonymous 04/23/26(Thu)21:02:32 No.108671659▶

>>108671271
I have been curating my own realism dataset last few weeks. Aiming for 1k mark.
It's roughly:
35% solo women
10% various backgrounds
20% various sex acts
The remaining are miscellaneous shit (transports, actions, objects, plants, etc.)
I am yet to prune and caption.
I will post if I proceed to train and it goes well.
Open to suggestions in terms of dataset curation for realism lora task.

Anonymous
04/23/26(Thu)21:04:02 No.108671674

Anonymous 04/23/26(Thu)21:04:02 No.108671674▶

>>108671653
too saturated

Anonymous
04/23/26(Thu)21:05:55 No.108671687

Anonymous 04/23/26(Thu)21:05:55 No.108671687▶

>>108671207
>I might but I don't think it's good enough
don't worry about me, just share. i'll take care of it.

Anonymous
04/23/26(Thu)21:06:57 No.108671697

Anonymous 04/23/26(Thu)21:06:57 No.108671697▶

File: Comfy_00022.jpg (1.7 MB)

1.7 MB JPG

>>108671674
4U

Anonymous
04/23/26(Thu)21:09:04 No.108671709

Anonymous 04/23/26(Thu)21:09:04 No.108671709▶

File: 1761204260818296.png (1.4 MB)

1.4 MB PNG

>>108671653
>>108671697
Anima + that realism turbo lora
https://civitai.red/models/1862761/nicegirls-ultrareal?modelVersionId=2882216

Anonymous
04/23/26(Thu)21:12:28 No.108671731

Anonymous 04/23/26(Thu)21:12:28 No.108671731▶

File: Comfy_00023.jpg (1.9 MB)

1.9 MB JPG

>>108671709
does anima do 'natural language' prompts? try 'deep depth of field'

Anonymous
04/23/26(Thu)21:13:22 No.108671737

Anonymous 04/23/26(Thu)21:13:22 No.108671737▶

>>108671709
So ugly you'd think she's real

Anonymous
04/23/26(Thu)21:14:37 No.108671745

Anonymous 04/23/26(Thu)21:14:37 No.108671745▶

>>108671709
it's smooth like Qwen Image but doesn't have that plastic skin, great potential

Anonymous
04/23/26(Thu)21:15:54 No.108671753

Anonymous 04/23/26(Thu)21:15:54 No.108671753▶

>>108671745
>it's smooth like Qwen Image
both are using Qwen Image's VAE... hmm noticing! When Flux 2's vae working on anima? I'm sure it won't be too expensive to train

Anonymous
04/23/26(Thu)21:16:08 No.108671756

Anonymous 04/23/26(Thu)21:16:08 No.108671756▶

>>108671731
Anima does natural language to some degree (less than ZIT or Klein, more than SDXL)
Though deep depth of field seems a bit too vague, heavy depth of field effect would work better probably.

Anonymous
04/23/26(Thu)21:17:20 No.108671766

Anonymous 04/23/26(Thu)21:17:20 No.108671766▶

>>108671756
it's photographic terms
>shallow depth of field
>deep/large depth of field
never heard "heavy" depth of field before but go for it

Anonymous
04/23/26(Thu)21:17:47 No.108671769

Anonymous 04/23/26(Thu)21:17:47 No.108671769▶

why is my anima lora so small? 66mb? I have a huge dataset so why is this so small? if I keep training is it going to jump up in size? epoch 1-8 have barely any bytes added. i'm using that anima standalone trainer but I'm not sure if I should just use russ's diffusion-pipe or what? help a nigga out pahLEASE.

Anonymous
04/23/26(Thu)21:18:44 No.108671774

Anonymous 04/23/26(Thu)21:18:44 No.108671774▶

File: _AnimaPreview3_00341_.jpg (295.4 KB)

295.4 KB JPG

>>108671659
Start pruning, cropping and captioning while you collect images. You'll end up with much higher quality dataset.

>>108671687
I will have to retrain, but it's not far off

Anonymous
04/23/26(Thu)21:18:54 No.108671775

Anonymous 04/23/26(Thu)21:18:54 No.108671775▶

>>108671756
What you want is bokeh, and rather than using adjectives just turn the weight up, e.g. (bokeh,:3)

Anonymous
04/23/26(Thu)21:20:49 No.108671784

Anonymous 04/23/26(Thu)21:20:49 No.108671784▶

give the man in the red shirt a cigar, black sunglasses, and a reversed black baseball cap.

https://litter.catbox.moe/j1yb8eikzur8uufi.mp4

Anonymous
04/23/26(Thu)21:22:22 No.108671797

Anonymous 04/23/26(Thu)21:22:22 No.108671797▶

>>108671769
what rank did you use? and what training res?
>>108671659
make sure to resize/crop them to good "normal resolution" sizes (divisible by 32 preferably but 16 ok)
get more people in groups/pairs
obviously anything larger than 3-4MP would be best, i aim for 5-10mp if i can find sources that high, and just let the trainer downscale them (or run a script)
get more general stuff too (animals/pets, cars, locations, etc)
even if a realism lora doesnt work out, that would be a nice dataset for reg images

Anonymous
04/23/26(Thu)21:22:27 No.108671798

Anonymous 04/23/26(Thu)21:22:27 No.108671798▶

>>108671784
I'm so glad to be born in this era bros, we are witnessing big changes in the world, feelsgoodman

Anonymous
04/23/26(Thu)21:22:56 No.108671801

Anonymous 04/23/26(Thu)21:22:56 No.108671801▶

>>108671766
I didn't say heavy depth of field.
I said heavy depth of field effect.
Heavy as an adjective for the noun phrase "depth of field effect".

Anonymous
04/23/26(Thu)21:23:47 No.108671812

Anonymous 04/23/26(Thu)21:23:47 No.108671812▶

>>108671797
and of course use LLM to caption as much as possible
>>108671801
ah i misread then. i'm going for the opposite, no bokeh/blur, the opposite of >>108671775

Anonymous
04/23/26(Thu)21:25:13 No.108671828

Anonymous 04/23/26(Thu)21:25:13 No.108671828▶

>>108671812
shallow depth of field = blurred background = bokeh
that's why deep/large depth of field = more focused background elements
like using large f-stop vs small

Anonymous
04/23/26(Thu)21:26:05 No.108671837

Anonymous 04/23/26(Thu)21:26:05 No.108671837▶

File: ss_04-23-2026_006.png (32.3 KB)

32.3 KB PNG

>>108671797
>what rank did you use? and what training res?
16 rank but like I said my dataset is huge. every image has a max pixel resolution on x or y of 1024. when I trained on pony it just werked so not sure what I'm doing wrong.

Anonymous
04/23/26(Thu)21:27:19 No.108671847

Anonymous 04/23/26(Thu)21:27:19 No.108671847▶

>>108671341
API is so good bros... someone pull out the blurry lenovo realism loras to cope

Anonymous
04/23/26(Thu)21:27:29 No.108671848

Anonymous 04/23/26(Thu)21:27:29 No.108671848▶

>>108671812
>>108671828
Stop thinking in terms of what the correct photography terms are, these models are generally not trained with that stuff. Bokeh in positive prompt = blurry background, put it in negative if you want a sharp in focus background, simple as.

Anonymous
04/23/26(Thu)21:28:51 No.108671862

Anonymous 04/23/26(Thu)21:28:51 No.108671862▶

>>108671837
>16 rank
well there you go
huge dataset means incrase yur rank to at least 64 (128 if you have over say 300 images)
you can always reduce it back down later but it wont train that well at rank 16
and if your'e doing large dataset + high res (1024+) then yeah you gotta push to at the very least 64, i'd go 256 if your'e able to
>>108671848
you do it your way then, i've tested this extensively and it does matter with models using LLM (or at least t5) as text encoder

Anonymous
04/23/26(Thu)21:31:59 No.108671877

Anonymous 04/23/26(Thu)21:31:59 No.108671877▶

File: 1747837639991977.png (2.4 MB)

2.4 MB PNG

Anonymous
04/23/26(Thu)21:32:13 No.108671880

Anonymous 04/23/26(Thu)21:32:13 No.108671880▶

>>108671862
My dataset is 1,835 images plus the tags. Yes it's hand curated.
I'll try cranking it to 256. Thanks for the help.

Anonymous
04/23/26(Thu)21:32:40 No.108671887

Anonymous 04/23/26(Thu)21:32:40 No.108671887▶

>>108671775
;3

Anonymous
04/23/26(Thu)21:33:03 No.108671890

Anonymous 04/23/26(Thu)21:33:03 No.108671890▶

>>108671774
I have cropped the watermarks etc. while collecting them. Going through more than a thousand images at one go would drive me crazy.
I will use some llm to caption. I prefer to caption when dataset is complete.
>>108671797
I know there is possibly some quality to squeeze by manually preparing but I will just let trainer's lanczos to do the job. I will use bucket step of 16 since dataset is large, shouldn't be too disruptive to images.
>get more general stuff too (animals/pets, cars, locations, etc)
Yeah around one third of the dataset is "general stuff".
To some degree I need to focus on the primary purpose of the lora though (1girl and coom.)

Anonymous
04/23/26(Thu)21:35:21 No.108671907

Anonymous 04/23/26(Thu)21:35:21 No.108671907▶

>people still give bullshit trigger names like b10wj03_maxsex69 to loras instead of leveraging the intrinsic semantic understanding of the model with actual words
shit man, I hate that so much

Anonymous
04/23/26(Thu)21:35:30 No.108671908

Anonymous 04/23/26(Thu)21:35:30 No.108671908▶

>>108671775
>>108671887
>:3
OWO WHATS THIS?
https://www.youtube.com/watch?v=7mBqm8uO4Cg

Anonymous
04/23/26(Thu)21:35:57 No.108671911

Anonymous 04/23/26(Thu)21:35:57 No.108671911▶

>>108671890
>I have cropped the watermarks etc. while collecting them. Going through more than a thousand images at one go would drive me crazy.
I went to the extreme on my 1800+ data set. I literally photoshopped out watermarks and signatures by hand lmao.

Anonymous
04/23/26(Thu)21:36:30 No.108671914

Anonymous 04/23/26(Thu)21:36:30 No.108671914▶

>>108671837
Irrespective of dataset size I would go above 16 rank for cramming multiple characters on a single lora.
>>108671880
256 might be too much even for 1835 images. These characters aren't complex enough to warrant it.
128 should work better.

Anonymous
04/23/26(Thu)21:37:02 No.108671918

Anonymous 04/23/26(Thu)21:37:02 No.108671918▶

>>108671880
np
>>108671890
>I will use bucket step of 16 since dataset is large
the reason to crop/resize is if your reso doesnt fit a bucket it'll get ignored, and if the buckets dont have enough images to get filled, they dup (or drop, depending on the trainer). each trainer does it differently too so you gotta adapt or at least keep the resolutions at the expected "normal" sizes. and lately most images found online are random res since people crop/screenshot without thinking
but your dataset's big enough you might not notice that too much
if you're just traiing nsfw women+sex, then keep the rest out or use it as reg dataset. otehrwise your lora's just gonna learn a bunch of crap and not work too well for the purpose
>>108671911
there's ways to avoid doing that nowadays but yah been there done that
>>108671914
>Irrespective of dataset size I would go above 16 rank for cramming multiple characters on a single lora.
this too

Anonymous
04/23/26(Thu)21:37:27 No.108671919

Anonymous 04/23/26(Thu)21:37:27 No.108671919▶

this lora is neat.

give the man in the red shirt a white karate outfit.

https://litter.catbox.moe/d14r11xgwecml3f6.mp4

Anonymous
04/23/26(Thu)21:39:15 No.108671934

Anonymous 04/23/26(Thu)21:39:15 No.108671934▶

>>108671919
I really hope those jews will continue delivering, LTX2.3 is just at the frontier on being decent, if they continue improving their next version will be great

Anonymous
04/23/26(Thu)21:39:30 No.108671935

Anonymous 04/23/26(Thu)21:39:30 No.108671935▶

>>108671918
>>108671914
Nicesu, thank you helpful anons.

Anonymous
04/23/26(Thu)21:42:23 No.108671964

Anonymous 04/23/26(Thu)21:42:23 No.108671964▶

File: kekestone will probably love it AWOOO.jpg (516.4 KB)

516.4 KB JPG

Babe wake up, another pixel space image model got released
https://pixeldit.github.io/
https://github.com/NVlabs/PixelDiT
https://huggingface.co/nvidia/PixelDiT-1300M-1024px

Anonymous
04/23/26(Thu)21:44:23 No.108671981

Anonymous 04/23/26(Thu)21:44:23 No.108671981▶

File: 1759732755864422.png (954.9 KB)

954.9 KB PNG

>>108671964
VAEs BTFOOO

Anonymous
04/23/26(Thu)21:44:33 No.108671984

Anonymous 04/23/26(Thu)21:44:33 No.108671984▶

>>108671918
Aren't buckets based on aspect ratios? 4000x2000, and 2000x1000 belong to the same bucket after resizing.
At least that's how sd-scripts do it I believe.
I am not certain but I don't think it is dropping images.
>if you're just traiing nsfw women+sex, then keep the rest out or use it as reg dataset. otehrwise your lora's just gonna learn a bunch of crap and not work too well for the purpose
No I also want it to be a decent general purpose realism lora. It's just that 1girl coom is the primary purpose of anima.
The reason I am going out of my way to hand curate thousand images is to cram both into a single lora.

Anonymous
04/23/26(Thu)21:45:35 No.108671990

Anonymous 04/23/26(Thu)21:45:35 No.108671990▶

File: lmaooo.png (414.8 KB)

414.8 KB PNG

>>108671964
>Text encoder: Gemma-2-2B-IT
Nvidia bros are still living in 1983

Anonymous
04/23/26(Thu)21:47:03 No.108671998

Anonymous 04/23/26(Thu)21:47:03 No.108671998▶

>>108671964
Did nvidia just beat kekstone in his own game by using a 1.3B model?
LMAO
Maybe he can copy their homework and finally produce one vaeless model that isn't broken piece of shit.

Anonymous
04/23/26(Thu)21:49:28 No.108672007

Anonymous 04/23/26(Thu)21:49:28 No.108672007▶

>>108671984
yes, and i think only sd-scripts (or the kohya gui to be specific) will crop resolutions that dont fit to a close bucket, but all the rest dont, they just skip images with resolutions not fitting a bucket aspect ratio and/or use less image in that bucket (aitoolkit) and/or fill it with dupes (diff-trainer) unless they fixed that shit in the past couple of months
>to cram both into a single lora.
do two loras, one with all the images, one with just the sex/girl stuff
the full set will lean heavily toward coom material so it may not work well as a general realism, for that you'd need way more general stuff to cover many more concepts. with a heavy bias toward sex/girl stuff you're just gonna get more of that
>>108671998
this would be like the third "pixel space" model that i've seen reported this year, not counding furraidiance

Anonymous
04/23/26(Thu)21:50:21 No.108672010

Anonymous 04/23/26(Thu)21:50:21 No.108672010▶

>>108671998
Lodeddiaper thinks he already did it. That's the funny part.

Anonymous
04/23/26(Thu)21:51:14 No.108672016

Anonymous 04/23/26(Thu)21:51:14 No.108672016▶

File: file.png (268.3 KB)

268.3 KB PNG

>>108671964
that's the most important part, they show that removing the VAE helped them reaching new heights, that's promising as fuck
>>108672010
really? I thought he was using the pixnerd method, this one is something new

Anonymous
04/23/26(Thu)21:51:58 No.108672026

Anonymous 04/23/26(Thu)21:51:58 No.108672026▶

>>108672007
kohya put me in the habit of proper cropping, my autism couldn't handle seeing a massive list of buckets at the start of every training session.

Anonymous
04/23/26(Thu)21:52:50 No.108672030

Anonymous 04/23/26(Thu)21:52:50 No.108672030▶

>>108672016
I don't know what method he used I just know he goes around claiming to have created the first vaeless model even though it clearly did not work out.

Anonymous
04/23/26(Thu)21:52:59 No.108672032

Anonymous 04/23/26(Thu)21:52:59 No.108672032▶

Fuck, I managed to get a scraped NovelAI API key, and honestly, there's no comparison. They won. I don't want this key to stop working. NAI is perfect, it's incomparable. The colors, the style... the style. That copyrighted style cannot be matched by any anime model out there. No LoRA can even come close. And the speed.. generating kino images in 5 seconds, my God. And to think I used to underestimate it before I got this key...

Anonymous
04/23/26(Thu)21:53:33 No.108672035

Anonymous 04/23/26(Thu)21:53:33 No.108672035▶

>>108672026
lol yep

Anonymous
04/23/26(Thu)21:53:48 No.108672038

Anonymous 04/23/26(Thu)21:53:48 No.108672038▶

>>108672032
link the key lil bro

Anonymous
04/23/26(Thu)21:54:02 No.108672040

Anonymous 04/23/26(Thu)21:54:02 No.108672040▶

>>108672030
>he goes around claiming to have created the first vaeless model
if he really said that then it's retarded as fuck, like all he does is just copy papers, the same papers that created those models before him

Anonymous
04/23/26(Thu)21:54:43 No.108672044

Anonymous 04/23/26(Thu)21:54:43 No.108672044▶

Never again. Never again Anima. Never again SDXL. NAI won, NAI won by a landslide. Local anime simply isn't worth it anymore. It's beyond saving, there's no fix, no cure. There aren't enough skilled people working on it, and there probably never will be. NAI won, and it pains me deeply to say it, it genuinely hurts, but now that I'm using it again, I can clearly see it's superior in every single aspect.

Anonymous
04/23/26(Thu)21:55:02 No.108672045

Anonymous 04/23/26(Thu)21:55:02 No.108672045▶

>>108671784
close but no cigar

Anonymous
04/23/26(Thu)21:56:07 No.108672054

Anonymous 04/23/26(Thu)21:56:07 No.108672054▶

>>108672044
this post makes me horny for my beloved anima

Anonymous
04/23/26(Thu)21:56:15 No.108672055

Anonymous 04/23/26(Thu)21:56:15 No.108672055▶

erm.... anonette?

Anonymous
04/23/26(Thu)21:58:16 No.108672075

Anonymous 04/23/26(Thu)21:58:16 No.108672075▶

You now know who was funding all the trolling in /ldg/. The mask is off. Every new preview release they will get more desperate. Until they can't hold back anymore, and they start killing anons.

Anonymous
04/23/26(Thu)21:58:21 No.108672076

Anonymous 04/23/26(Thu)21:58:21 No.108672076▶

File: wen.png (727.4 KB)

727.4 KB PNG

>>108671964
wen comfyui?

Anonymous
04/23/26(Thu)21:58:46 No.108672080

Anonymous 04/23/26(Thu)21:58:46 No.108672080▶

>>108672032
>NovelAI API key
What variable name is typically used for NAI keys?
I might fire my own scraper bot and see if it finds anything of value.

Anonymous
04/23/26(Thu)21:58:49 No.108672082

Anonymous 04/23/26(Thu)21:58:49 No.108672082▶

No ComfyUI node, no workflow, no VRAM upgrade, nothing local can match the quality of NAI.
>>108672038
No way!

Anonymous
04/23/26(Thu)22:00:19 No.108672095

Anonymous 04/23/26(Thu)22:00:19 No.108672095▶

I am completely defeated, defeated by beauty, by aesthetics, by pure quality by GOOD TASTE. It's been so long since I've seen intrinsic quality in my local generations. I had forgotten what beauty even looked like, and NAI made me rediscover it.

>>108672054
No. Anima has no beauty, no quality. It has stiffed intelligence alone, it lacks life, it lacks beauty. That feeling of looking at an image and instantly falling in love with it... I hadn't experienced that in a long time and NAI brought it back.

Anonymous
04/23/26(Thu)22:00:20 No.108672096

Anonymous 04/23/26(Thu)22:00:20 No.108672096▶

>>108672082
>No way!
That's what I said when I heard about NovelAI. NAI is NoobAI, btw. Not trying your product try shilling it to the api niggers, RETARD.

Anonymous
04/23/26(Thu)22:01:05 No.108672105

Anonymous 04/23/26(Thu)22:01:05 No.108672105▶

File: 2026-04-23-03-00-25_00001_.png (2.5 MB)

2.5 MB PNG

Anonymous
04/23/26(Thu)22:01:11 No.108672108

Anonymous 04/23/26(Thu)22:01:11 No.108672108▶

>>108672007
I REALLY want to pull it off with a single lora but if it doesn't work out I will split as you say.

Anonymous
04/23/26(Thu)22:01:48 No.108672111

Anonymous 04/23/26(Thu)22:01:48 No.108672111▶

>>108672040
he didn't say that

Anonymous
04/23/26(Thu)22:02:52 No.108672115

Anonymous 04/23/26(Thu)22:02:52 No.108672115▶

>>108672105
Mogs do the real life counterpart.

Anonymous
04/23/26(Thu)22:02:53 No.108672116

Anonymous 04/23/26(Thu)22:02:53 No.108672116▶

>>108671964
it really makes no sense to me that nvidia uses weird ass licenses like that
its not like their money comes directly from training ai models apart from shit like dlss which is tied to their gpus anyways

Anonymous
04/23/26(Thu)22:03:10 No.108672120

Anonymous 04/23/26(Thu)22:03:10 No.108672120▶

>>108672040
>like all she does is just copy, the same code that created those programs before her
I say this about ada lovelace but people don't want to hear it.

Anonymous
04/23/26(Thu)22:03:13 No.108672121

Anonymous 04/23/26(Thu)22:03:13 No.108672121▶

File: kantokus_21.jpg (601.2 KB)

601.2 KB JPG

>>108672032
>>108672044
I'll take the bait.

NAI's 4.5 model is worse than anima (although inpaint, vibe transfer and PT are really good). As an example, try getting any kind of reasonably complex backgrounds out of NAI.

Anonymous
04/23/26(Thu)22:04:25 No.108672129

Anonymous 04/23/26(Thu)22:04:25 No.108672129▶

>>108672096
No anon, no. I'm looking at my local gens from the past year since I started using local, and I can tell you with confidence that Anima is junk food. Anima is slop. I don't want to touch any local anime model ever again, and it's not personal with Anima specifically, but with the entire local ecosystem as a whole.

Anonymous
04/23/26(Thu)22:06:16 No.108672140

Anonymous 04/23/26(Thu)22:06:16 No.108672140▶

>>108672129
get your ai genned text posts the fuck outta here, faggot. I hate you api niggers, we're not switching to your token system. my shit will always work on my computer I don't care if it's several magnitudes worse than your shit ass cloud. fuck you cock sucker nigger faggot.

Anonymous
04/23/26(Thu)22:06:17 No.108672142

Anonymous 04/23/26(Thu)22:06:17 No.108672142▶

No, no, no. Everything is wrong with local. Nothing makes sense. Everything is half baked, everything is low quality, everything is left to the free will of the community, and the community produces something truly awful.

>>108672121
this is not copyrighted my friend, tdrusell fears copyright

Anonymous
04/23/26(Thu)22:06:18 No.108672143

Anonymous 04/23/26(Thu)22:06:18 No.108672143▶

why is anon even responding to this obvious SaaS b8

Anonymous
04/23/26(Thu)22:06:41 No.108672151

Anonymous 04/23/26(Thu)22:06:41 No.108672151▶

>>108672129
Then go away: >>108653190
Oh right, you pissed yourself having to share the same room with Nano Banana Pro and GPT-Image-2.

Anonymous
04/23/26(Thu)22:07:42 No.108672159

Anonymous 04/23/26(Thu)22:07:42 No.108672159▶

File: 1756755676500602.png (495 KB)

495 KB PNG

>>108671964
I can see lodestone switch to that method, it's way more accurate than pixnerd

Anonymous
04/23/26(Thu)22:08:23 No.108672161

Anonymous 04/23/26(Thu)22:08:23 No.108672161▶

File: pixelDit0001.jpg (212.3 KB)

212.3 KB JPG

>>108671341
>>108671964
PixelDiT

Anonymous
04/23/26(Thu)22:08:27 No.108672162

Anonymous 04/23/26(Thu)22:08:27 No.108672162▶

>>108672140
I'm using a translator dude, and I want you to know that I completely agree with you. I HATE NAI. I HATE that something so good is out of my reach, that I can't own it, that I can't have it in a physical form.
I HATE NAI.
I HATE NAI FOR BEING SO PERFECT

Anonymous
04/23/26(Thu)22:09:01 No.108672165

Anonymous 04/23/26(Thu)22:09:01 No.108672165▶

>>108672116
Nvidia has like 4 licenses they release stuff under and they seemingly choose it at random. Off the top of my head: apache 2, nvidia open weights commercial license, nvidia research license, and now this one. They really ought to keep things simple and just make everything apache 2.

Anonymous
04/23/26(Thu)22:10:49 No.108672177

Anonymous 04/23/26(Thu)22:10:49 No.108672177▶

>>108672165
They probably pick their license in whatever way would fuck over AMD the most. I'm dumb, though so I'm just assuming.

Anonymous
04/23/26(Thu)22:11:09 No.108672180

Anonymous 04/23/26(Thu)22:11:09 No.108672180▶

>>108672161
kek, not bad for a 1b pixel only base model, it's really starting to be competitive with models using VAEs

Anonymous
04/23/26(Thu)22:12:55 No.108672187

Anonymous 04/23/26(Thu)22:12:55 No.108672187▶

File: Make it look like a photo, okay? A realistic photo! No CGI look! A spontaneous photograph capturing .jpg (212.6 KB)

212.6 KB JPG

>>108672180
i'm trying to copy the sample prompts which come out okish but not this one

Anonymous
04/23/26(Thu)22:13:50 No.108672192

Anonymous 04/23/26(Thu)22:13:50 No.108672192▶

>>108672177
They don't even think about you at all lil blud. Nice imaginary beef though.

Anonymous
04/23/26(Thu)22:14:27 No.108672197

Anonymous 04/23/26(Thu)22:14:27 No.108672197▶

File: medium-shot portrait of a fair-skinned young Caucasian woman with long, sleek copper-red hair stands.jpg (196.7 KB)

196.7 KB JPG

Anonymous
04/23/26(Thu)22:14:54 No.108672199

Anonymous 04/23/26(Thu)22:14:54 No.108672199▶

>>108672161
>>108672187
can you also try that model >>108670859

Anonymous
04/23/26(Thu)22:15:23 No.108672203

Anonymous 04/23/26(Thu)22:15:23 No.108672203▶

>>108672192
I use nvidia, lil crip. I just think it's a safe assumption that nvidia tries to fuck over their only competition any way they can.

Anonymous
04/23/26(Thu)22:15:54 No.108672205

Anonymous 04/23/26(Thu)22:15:54 No.108672205▶

>>108672187
Can you show us the parameters (command) you are running? Also how much VRAM does it use?

Anonymous
04/23/26(Thu)22:16:53 No.108672210

Anonymous 04/23/26(Thu)22:16:53 No.108672210▶

>>108672121
The character is stiff and doesn’t say anything. Its only intelligence shows in building the background and making the character ride a scooter, but the image itself doesn’t make you fall in love with it. It doesn’t say anything, it’s an image without life, it has only 2026 level intelligence.

Anonymous
04/23/26(Thu)22:17:46 No.108672219

Anonymous 04/23/26(Thu)22:17:46 No.108672219▶

>>108672161
Dios mio...

>>108672187
Okay, now that's just genuinely scary.

Anonymous
04/23/26(Thu)22:18:29 No.108672222

Anonymous 04/23/26(Thu)22:18:29 No.108672222▶

>>108672203
Yes, for enterprise shit where the money is.
I am very skeptical the small subset of AMD users who might bother with these weird research experiment models are considered at all when writing the license.

Anonymous
04/23/26(Thu)22:18:36 No.108672224

Anonymous 04/23/26(Thu)22:18:36 No.108672224▶

File: PixelDIT00002.jpg (150.1 KB)

150.1 KB JPG

8-bit scroller on PixxelDiT
>>108672205
just the commands on the github (it downloads the model so dont bother with the HF files). i just changed the "prompt.txt" file to whatever, using same neg and other CLI options shown on the example command
it takes maybe 5-8gb? the model is like 5gb
>>108672199
>60.3 GB
nope lol
maybe later if no one else does

Anonymous
04/23/26(Thu)22:18:55 No.108672228

Anonymous 04/23/26(Thu)22:18:55 No.108672228▶

>>108672161
how fast was it? and how much memory did it use?

Anonymous
04/23/26(Thu)22:19:21 No.108672233

Anonymous 04/23/26(Thu)22:19:21 No.108672233▶

qwen 3 27b really feels like last gens sota cloud models. idk how the based chinks did it

Anonymous
04/23/26(Thu)22:19:48 No.108672236

Anonymous 04/23/26(Thu)22:19:48 No.108672236▶

>>108672205
not him but i used their example command and went oom with 16gb vram

Anonymous
04/23/26(Thu)22:19:57 No.108672238

Anonymous 04/23/26(Thu)22:19:57 No.108672238▶

>>108671964
Isn't this how Zeta Chroma works?

Anonymous
04/23/26(Thu)22:20:59 No.108672247

Anonymous 04/23/26(Thu)22:20:59 No.108672247▶

>>108672222
Maybe, maybe I'm retarded maybe i'm schizo maybe I'm a retarded schizo. My 2 cents have been deposited.

Anonymous
04/23/26(Thu)22:21:03 No.108672248

Anonymous 04/23/26(Thu)22:21:03 No.108672248▶

oh sorry i posted in the furry thread instead

Anonymous
04/23/26(Thu)22:21:07 No.108672249

Anonymous 04/23/26(Thu)22:21:07 No.108672249▶

>>108672205
>>108672224
>python inference.py --config configs/PixelDiT_1024px_pixel_diffusion_stage3.yaml --model_path pixeldit_t2i_v1.pth --txt_file prompts.txt --custom_height 1024 --custom_width 1024 --cfg_scale 2.75 --seed 2025 --negative_prompt "low quality, worst quality, over-saturated, blurry, deformed, watermark" --work_dir "."

i used the comfyui venv, only needed a couple packages from the requirements.txt
didnt even unload comfy lol
>>108672228
5090
2026-04-24 06:15:33 - [PixDiT] - INFO - Inference with torch.bfloat16, guidance_type: classifier-free, flow_shift: 4.0
loading text encoder from Efficient-Large-Model/gemma-2-2b-it
Loading checkpoint shards: 100%|| 2/2 [00:00<00:00, 57.57it/s]
2026-04-24 06:15:44 - [PixDiT] - INFO - PixDiTTrainer:PixDiTTrainer, Model Parameters: 1,311,388,547
2026-04-24 06:15:44 - [PixDiT] - INFO - Generating sample from ckpt: pixeldit_t2i_v1.pth
2026-04-24 06:15:46 - [PixDiT] - WARNING - Missing keys: []
2026-04-24 06:15:46 - [PixDiT] - WARNING - Unexpected keys: []
2026-04-24 06:15:46 - [PixDiT] - INFO - Saving images at ./vis
2026-04-24 06:15:46 - [PixDiT] - INFO - Eval first 1/1 samples
2026-04-24 06:15:46 - [PixDiT] - INFO - Sampler flow_dpm-solver
2026-04-24 06:15:46 - [PixDiT] - INFO - Inference with torch.bfloat16, guidance_type: classifier-free, flow_shift: 4.0
100%|| 49/49 [00:03<00:00, 12.51it/s]

Anonymous
04/23/26(Thu)22:21:41 No.108672255

Anonymous 04/23/26(Thu)22:21:41 No.108672255▶

>>108672224
>the model is like 5gb
Oh so they released it on fp32. No one has bothered with that for a while.
Coupled with >>108671990, it makes me think that they trained this model a while back but only releasing now.

Anonymous
04/23/26(Thu)22:22:20 No.108672259

Anonymous 04/23/26(Thu)22:22:20 No.108672259▶

File: 1760908533743.png (2.6 MB)

2.6 MB PNG

>>108672238
zeta isn't pixel. Only radiance is.

Anonymous
04/23/26(Thu)22:22:40 No.108672262

Anonymous 04/23/26(Thu)22:22:40 No.108672262▶

>>108672236 (me)
guess its some weird allocation retardation, it seems to use around 11-13gb vram during generation

Anonymous
04/23/26(Thu)22:22:51 No.108672263

Anonymous 04/23/26(Thu)22:22:51 No.108672263▶

>>108672161
Now I can feel the realness

Anonymous
04/23/26(Thu)22:23:11 No.108672268

Anonymous 04/23/26(Thu)22:23:11 No.108672268▶

>>108672249
Thanks for the info anon

Anonymous
04/23/26(Thu)22:24:18 No.108672273

Anonymous 04/23/26(Thu)22:24:18 No.108672273▶

>>108672259
Zeta is pixel thoughbeit.

Anonymous
04/23/26(Thu)22:25:16 No.108672279

Anonymous 04/23/26(Thu)22:25:16 No.108672279▶

>>108672161
such dreamy eyes

Anonymous
04/23/26(Thu)22:25:41 No.108672284

Anonymous 04/23/26(Thu)22:25:41 No.108672284▶

>>108672273
Is it? I thought it's just zit + chroma dataset?

Anonymous
04/23/26(Thu)22:26:42 No.108672294

Anonymous 04/23/26(Thu)22:26:42 No.108672294▶

File: PixelDIT0003.jpg (196.2 KB)

196.2 KB JPG

it's not very good at complex models unless you do the exact sample prompts provided by them lel

Anonymous
04/23/26(Thu)22:27:24 No.108672300

Anonymous 04/23/26(Thu)22:27:24 No.108672300▶

>>108672284
No he decapitated Z-Image and added some pixel space shit to it.
That's why it is extremely CRUNCHY (and schizo).

Anonymous
04/23/26(Thu)22:27:33 No.108672303

Anonymous 04/23/26(Thu)22:27:33 No.108672303▶

>>108672197
kek

Anonymous
04/23/26(Thu)22:27:47 No.108672305

Anonymous 04/23/26(Thu)22:27:47 No.108672305▶

>>108672294
even their samples look noisy as fuck, JiT in comparison looked much better

Anonymous
04/23/26(Thu)22:27:56 No.108672309

Anonymous 04/23/26(Thu)22:27:56 No.108672309▶

Nvidia models benchmaxxed? No way

Anonymous
04/23/26(Thu)22:28:22 No.108672311

Anonymous 04/23/26(Thu)22:28:22 No.108672311▶

>>108672273
>>108672284
zeta = zit+chroma
kaleidoscope = klein4b+ radiance
radiance = pixel space
there may be another test of his i missed
>>108672300
huh i missed that

Anonymous
04/23/26(Thu)22:28:26 No.108672313

Anonymous 04/23/26(Thu)22:28:26 No.108672313▶

>>108672121
NAI 4.5 doesn't even have as good natural language prompt adherence as the later versions of NetaYume Lumina a lot of the time

Anonymous
04/23/26(Thu)22:28:30 No.108672314

Anonymous 04/23/26(Thu)22:28:30 No.108672314▶

>>108672294
>it's not very good at complex models
I mean it's fucking gemma 2 we're talking about

Anonymous
04/23/26(Thu)22:28:54 No.108672315

Anonymous 04/23/26(Thu)22:28:54 No.108672315▶

>>108672294
I think they might have beaten kekstone on cherry picking also.

Anonymous
04/23/26(Thu)22:29:51 No.108672326

Anonymous 04/23/26(Thu)22:29:51 No.108672326▶

File: awooooooo.jpg (3.3 MB)

3.3 MB JPG

>>108672300
>No he decapitated Z-Image
based Robestone

Anonymous
04/23/26(Thu)22:32:16 No.108672344

Anonymous 04/23/26(Thu)22:32:16 No.108672344▶

>>108671653
if this is even vanilla ZIT without loras at all you have to have used some kind of wacky tiled upscale pipeline here

Anonymous
04/23/26(Thu)22:33:38 No.108672356

Anonymous 04/23/26(Thu)22:33:38 No.108672356▶

File: Spring invitation to a picnic with kebab in honor of a woman's birthday.png (1.5 MB)

1.5 MB PNG

actually the noisiness is much less bad when you change the saving format from jpg to png in inference.py

Anonymous
04/23/26(Thu)22:33:42 No.108672358

Anonymous 04/23/26(Thu)22:33:42 No.108672358▶

>>108671207
We need it anon, for science

Anonymous
04/23/26(Thu)22:35:08 No.108672364

Anonymous 04/23/26(Thu)22:35:08 No.108672364▶

>>108672356
this looks good, can't wait to see the day VAE will be seen as technology from the past

Anonymous
04/23/26(Thu)22:35:12 No.108672365

Anonymous 04/23/26(Thu)22:35:12 No.108672365▶

>>108668948
this is cute
Dibs on Gemini

Anonymous
04/23/26(Thu)22:36:45 No.108672374

Anonymous 04/23/26(Thu)22:36:45 No.108672374▶

>>108672344
rtx upscale, only used my character lora

Anonymous
04/23/26(Thu)22:37:43 No.108672387

Anonymous 04/23/26(Thu)22:37:43 No.108672387▶

File: A blonde woman, busty, full body, bikini, looks straight into the camera, soft light, shot on Agfa V.png (1.4 MB)

1.4 MB PNG

>>108672364
yeah but its also one of their cherrypicked prompts and seeds

Anonymous
04/23/26(Thu)22:39:29 No.108672396

Anonymous 04/23/26(Thu)22:39:29 No.108672396▶

>>108672387
shadow straps, melty eye syndrome, broad shoulders, thin hips.
looks good to me!

Anonymous
04/23/26(Thu)22:39:41 No.108672397

Anonymous 04/23/26(Thu)22:39:41 No.108672397▶

>>108672387
Do a "girl on grass" prompt, let's see what it spits out for that.

Anonymous
04/23/26(Thu)22:39:54 No.108672399

Anonymous 04/23/26(Thu)22:39:54 No.108672399▶

>>108672387
I mean, this is a 1.2b base model, it really looks decent, I wasn't expecting much better lol

Anonymous
04/23/26(Thu)22:41:39 No.108672411

Anonymous 04/23/26(Thu)22:41:39 No.108672411▶

>>108672399
What's the goal here? Having a 100M model generate grass and telling everyone it looks good for a 100M model?

Anonymous
04/23/26(Thu)22:43:23 No.108672428

Anonymous 04/23/26(Thu)22:43:23 No.108672428▶

>>108672411
the simple fact that it looks like your regular diffusion model (VAE) at the same size tells the whole story, we're talking about a pixel space model, those things are supposed to look like shit, look at kekestone's attempt, this shit is new and hard to master

Anonymous
04/23/26(Thu)22:44:06 No.108672437

Anonymous 04/23/26(Thu)22:44:06 No.108672437▶

>>108672161
>>108671964
this will be the next model comfyorg and tdrussel choose btw

Anonymous
04/23/26(Thu)22:44:54 No.108672444

Anonymous 04/23/26(Thu)22:44:54 No.108672444▶

>>108672437
so far they were succesful so...

Anonymous
04/23/26(Thu)22:47:34 No.108672461

Anonymous 04/23/26(Thu)22:47:34 No.108672461▶

>>108672399
no it looks like ass

Anonymous
04/23/26(Thu)22:48:12 No.108672465

Anonymous 04/23/26(Thu)22:48:12 No.108672465▶

>THIS LOOKS LIKE A PERFECT BASE FOR FINETUNING
stuck in 2024....

Anonymous
04/23/26(Thu)22:48:22 No.108672468

Anonymous 04/23/26(Thu)22:48:22 No.108672468▶

>>108672461
find me another ~1b base model that is much better than this then

Anonymous
04/23/26(Thu)22:49:16 No.108672475

Anonymous 04/23/26(Thu)22:49:16 No.108672475▶

File: 1746778314755135.png (2.1 MB)

2.1 MB PNG

>>108672161
>>108672387
seems like local is regressing all the way back to sd 1.4 era
this is just sad at this point

Anonymous
04/23/26(Thu)22:50:43 No.108672485

Anonymous 04/23/26(Thu)22:50:43 No.108672485▶

>>108672468
dude ive been through this so many times already
if you want to use the 1b excuse then they should have made the model bigger (2b anima already looks good)
there is so much unusable dogshit coming out that will always be unusable dogshit and i dont have the patience to copium and hopium for the 50th time

Anonymous
04/23/26(Thu)22:51:48 No.108672490

Anonymous 04/23/26(Thu)22:51:48 No.108672490▶

>>108672485
>if you want to use the 1b excuse then they should have made the model bigger (2b anima already looks good)
remind me what anima used as a base model, fucking cosmos 2b, do you remember how bad this model was? there's a whole universe between a base model and a highly finetuned model, come on dude

Anonymous
04/23/26(Thu)22:53:40 No.108672504

Anonymous 04/23/26(Thu)22:53:40 No.108672504▶

>>108672490
RL the hell out of it i dont care, just show that at its max potential it can look good, not falling for this shit again

Anonymous
04/23/26(Thu)22:58:05 No.108672537

Anonymous 04/23/26(Thu)22:58:05 No.108672537▶

Fresh

>>108672527
>>108672527
>>108672527
>>108672527

Subject
Name
Comment
File	Supported: JPG, PNG, GIF, WebP, WebM, MP4, MP3 (max 4MB)
CAPTCHA

Reply to Thread #108668921

🔍 Search & Sort