/g/ - Thread 108604726

ok ernie turbo is fucking garbage at prompt following

Anonymous
04/14/26(Tue)17:15:16 No.108604754

Anonymous 04/14/26(Tue)17:15:16 No.108604754▶

>>108604751

forgot prompt
>A photorealistic candid photo of a woman with long, flowing hair that transitions from icy white at the roots to vibrant cyan-blue at the tips, cascading over her shoulders and partially obscuring her face as she looks downward. She wears a form-fitting, sleeveless top with a high neckline, primarily white with bold geometric yellow trim and a large, faceted blue diamond-shaped emblem centered on the chest. The garment has a structured, armored appearance with gold-brown segmented panels along the waist and hips, suggesting a fantasy or sci-fi outfit. Her right hand rests on a smooth, light-colored surface in the foreground, fingers slightly curled. The background is an out-of-focus twilight landscape under a deep indigo sky, with a soft gradient of magenta and purple along the horizon. A faint, glowing horizontal line runs across the lower portion of the frame, possibly a railing or edge of a platform. The lighting is directional, casting soft shadows and highlights on her hair and clothing, emphasizing texture and form with natural depth and contrast. No text, speech bubbles, or tears are visible.

Anonymous
04/14/26(Tue)17:16:32 No.108604759

Anonymous 04/14/26(Tue)17:16:32 No.108604759▶

File: 1746834295472832.png (3.6 MB)

3.6 MB PNG

https://huggingface.co/baidu/ERNIE-Image
https://huggingface.co/baidu/ERNIE-Image-Turbo
https://yiyan.baidu.com/blog/posts/ernie-image
https://ernieimageprompt.com/

LOCAL IS SAVED!!

Anonymous
04/14/26(Tue)17:17:05 No.108604763

Anonymous 04/14/26(Tue)17:17:05 No.108604763▶

File: 1768040078201688.png (1.6 MB)

1.6 MB PNG

>>108604754
wait nvm im gay, fucked up a setting

Anonymous
04/14/26(Tue)17:20:05 No.108604772

Anonymous 04/14/26(Tue)17:20:05 No.108604772▶

can some littlebox or gofile some nsfw gens of ernie image? the huggingface demo is too censored.

Anonymous
04/14/26(Tue)17:21:46 No.108604779

Anonymous 04/14/26(Tue)17:21:46 No.108604779▶

>>108604759
But can it do anime loli porn?

Anonymous
04/14/26(Tue)17:24:19 No.108604786

Anonymous 04/14/26(Tue)17:24:19 No.108604786▶

File: Ernie.png (2.1 MB)

2.1 MB PNG

>>108604759
>no edit
that's a shame, imagine doing edit with such a monster of a model, the prompt following is on another level, can't believe it's using a simple 3b text encoder to get that shit, and fucking ministral of all things

Anonymous
04/14/26(Tue)17:31:39 No.108604806

Anonymous 04/14/26(Tue)17:31:39 No.108604806▶

>>108604786
ZAMN!

Anonymous
04/14/26(Tue)17:33:07 No.108604810

Anonymous 04/14/26(Tue)17:33:07 No.108604810▶

>>108604759
https://github.com/Comfy-Org/workflow_templates/blob/main/templates/image_ernie_image_turbo.json
https://huggingface.co/Comfy-Org/ERNIE-Image
>AttributeError: 'Ministral3_3B' object has no attribute 'generate'
thanks Comfy

Anonymous
04/14/26(Tue)17:35:26 No.108604817

Anonymous 04/14/26(Tue)17:35:26 No.108604817▶

>>108604759
Can it do nude?

Anonymous
04/14/26(Tue)17:39:09 No.108604839

Anonymous 04/14/26(Tue)17:39:09 No.108604839▶

>>108604759
Can it do shrek?

Anonymous
04/14/26(Tue)17:39:27 No.108604842

Anonymous 04/14/26(Tue)17:39:27 No.108604842▶

File: 1750850705420377.png (1.9 MB)

1.9 MB PNG

>>108604759
bruh, turbo has garbage anatomy, downloading the base model

Anonymous
04/14/26(Tue)17:39:44 No.108604843

Anonymous 04/14/26(Tue)17:39:44 No.108604843▶

>>108604759
buy an ad
>>108604810
have you pulled?

Anonymous
04/14/26(Tue)17:41:39 No.108604852

Anonymous 04/14/26(Tue)17:41:39 No.108604852▶

>>108604842
>implying the monk didn't cultivate enough to master the four immeasurables and grow two extra arms
lol?

Anonymous
04/14/26(Tue)17:42:48 No.108604861

Anonymous 04/14/26(Tue)17:42:48 No.108604861▶

File: Ernie-Image_00001_.png (1.3 MB)

The gen times for non-turbo on my 3060 is a bit slow, 2 and half minutes for 20 steps, probably needs more steps, but it's not unusually slow for a model of this size.
Let's see how it holds up further testing.

Anonymous
04/14/26(Tue)17:43:22 No.108604862

Anonymous 04/14/26(Tue)17:43:22 No.108604862▶

>>108604817
>Can it do nude?
https://litter.catbox.moe/9z9qwbnxpflyqt27.jpg

Anonymous
04/14/26(Tue)17:43:31 No.108604863

Anonymous 04/14/26(Tue)17:43:31 No.108604863▶

>>108604751
>>108604763
What did you fuck up so I can avoid it

Anonymous
04/14/26(Tue)17:45:42 No.108604871

Anonymous 04/14/26(Tue)17:45:42 No.108604871▶

>>108604861
I see you tasted the base model, I hope it's the good one, I don't really like my tests on turbo so far
>>108604843
yes I'm on the latest version, seems like comfy hasn't implemented the prompt rewriting yet
https://github.com/Comfy-Org/ComfyUI/pull/13395
>Needs template before it works properly.

Anonymous
04/14/26(Tue)17:45:45 No.108604872

Anonymous 04/14/26(Tue)17:45:45 No.108604872▶

https://huggingface.co/lightx2v/Wan2.2-Distill-Models/blob/main/wan2.2_i2v_A14b_high_noise_lightx2v_4step_720p_260412.safetensors

Why is the high and low noise close to 60gb?

Anonymous
04/14/26(Tue)17:47:09 No.108604879

Anonymous 04/14/26(Tue)17:47:09 No.108604879▶

>>108604759
What VAE does it use?

Anonymous
04/14/26(Tue)17:50:40 No.108604888

Anonymous 04/14/26(Tue)17:50:40 No.108604888▶

>>108604817
>>108604862
https://litter.catbox.moe/tz2g5anklf3bmmmt.jpg
as expected, garbage genitals lol
>>108604879
the best one, flux 2's vae

Anonymous
04/14/26(Tue)17:50:47 No.108604889

Anonymous 04/14/26(Tue)17:50:47 No.108604889▶

>>108604817
>>108604772
It hasn't been trained on boobs, it generates mediocre breasts. Though from my very limited testing it doesn't seem to be deliberately poisoned like Flux models are.
>>108604871
I just had a feeling that the distill will be problematic and went for the base immediately.

Anonymous
04/14/26(Tue)17:52:12 No.108604893

Anonymous 04/14/26(Tue)17:52:12 No.108604893▶

>>108604888
Is this turbo or base

Anonymous
04/14/26(Tue)17:53:38 No.108604900

Anonymous 04/14/26(Tue)17:53:38 No.108604900▶

>>108604889
>I just had a feeling that the distill will be problematic and went for the base immediately.
good, was about time that we got a fully finetuned model that isn't distilled, no need for some NAG cope, we can directly use CFG, and we'll be able to train and make loras on it
>>108604893
turbo

Anonymous
04/14/26(Tue)17:54:56 No.108604906

Anonymous 04/14/26(Tue)17:54:56 No.108604906▶

>>108604872
FP32 precision.
4 bytes for every weight:
14b x 4 = 56

Anonymous
04/14/26(Tue)17:57:39 No.108604922

Anonymous 04/14/26(Tue)17:57:39 No.108604922▶

>>108604906
Huh, I haven't seen the fp32 version before.

Anonymous
04/14/26(Tue)18:02:52 No.108604940

Anonymous 04/14/26(Tue)18:02:52 No.108604940▶

File: Ernie base.png (1.7 MB)

>>108604842
>downloading the base model
I really don't like the anatomy, like this is base at 50 steps, come on

Anonymous
04/14/26(Tue)18:06:06 No.108604955

Anonymous 04/14/26(Tue)18:06:06 No.108604955▶

>migu
:)

Anonymous
04/14/26(Tue)18:08:03 No.108604959

Anonymous 04/14/26(Tue)18:08:03 No.108604959▶

File: 1771543722896827.jpg (647.4 KB)

647.4 KB JPG

>>108604940
smells like more and more like a nothingburger, the realism quality is Klein tier, but ernie can't even edits to compensate, sad

Anonymous
04/14/26(Tue)18:11:18 No.108604974

Anonymous 04/14/26(Tue)18:11:18 No.108604974▶

File: Ernie-Image_00006_.png (1.1 MB)

>>108604940
I am wondering if Comfy fucked something up, or did they do Chroma-tier cherry picking for the images?
>>108604922
FP32 is usually only used for training because the benefits to inference are almost non-existent.

Anonymous
04/14/26(Tue)18:14:04 No.108604983

Anonymous 04/14/26(Tue)18:14:04 No.108604983▶

File: Ernie-Image_00007_.png (1.1 MB)

>>108604974
50 steps turned out better.
Seems also a bit wild when it comes to adding shit to the image. First time I have seen AI add a knife to 1girl, standing prompt unsolicited.

Anonymous
04/14/26(Tue)18:16:52 No.108604991

Anonymous 04/14/26(Tue)18:16:52 No.108604991▶

File: 1769295316640072.jpg (923.6 KB)

923.6 KB JPG

>>108604959

Anonymous
04/14/26(Tue)18:17:15 No.108604992

Anonymous 04/14/26(Tue)18:17:15 No.108604992▶

>>108604983
Oh I think image is so different due to the fact that Control after generate is bugged with the retarded subgraph Cumfy has shipped with the template. So it ran a whole new seed.
The point about knife stands though, same prompt.

Anonymous
04/14/26(Tue)18:19:41 No.108605000

Anonymous 04/14/26(Tue)18:19:41 No.108605000▶

File: 1767451626519673.jpg (889.7 KB)

889.7 KB JPG

>>108604991

Anonymous
04/14/26(Tue)18:23:07 No.108605010

Anonymous 04/14/26(Tue)18:23:07 No.108605010▶

File: o_00245_.png (390.7 KB)

390.7 KB PNG

Anonymous
04/14/26(Tue)18:26:12 No.108605023

Anonymous 04/14/26(Tue)18:26:12 No.108605023▶

File: 1765165312128737.jpg (1.1 MB)

1.1 MB JPG

>>108605000
Ernie knows only one anime style: "Nano Banana Pro"

:]

Anonymous
04/14/26(Tue)18:32:33 No.108605045

Anonymous 04/14/26(Tue)18:32:33 No.108605045▶

File: 1757245838942284.jpg (1.5 MB)

1.5 MB JPG

>>108605023
kek, I think I've seen enough

Anonymous
04/14/26(Tue)18:37:15 No.108605060

Anonymous 04/14/26(Tue)18:37:15 No.108605060▶

File: 1771533625427241.jpg (1.7 MB)

1.7 MB JPG

>>108605045
maybe turbo at 16 steps is the best it can get

Anonymous
04/14/26(Tue)18:38:39 No.108605064

Anonymous 04/14/26(Tue)18:38:39 No.108605064▶

File: file.png (1.5 MB)

1.5 MB PNG

ernie base with the default settings and default prompt in comfyui gave me a guy with 3 legs.. not a great start

Anonymous
04/14/26(Tue)18:42:13 No.108605080

Anonymous 04/14/26(Tue)18:42:13 No.108605080▶

File: 1757491738029006.jpg (973.4 KB)

973.4 KB JPG

>>108605060
Z-image turbo be like:
https://youtu.be/WO23WBji_Z0?t=10

Anonymous
04/14/26(Tue)18:43:03 No.108605083

Anonymous 04/14/26(Tue)18:43:03 No.108605083▶

File: Ernie-Image_00009_.png (1.2 MB)

1.2 MB PNG

One of the better gens I got.
Still has this Kleiny look to it.

Anonymous
04/14/26(Tue)18:43:51 No.108605087

Anonymous 04/14/26(Tue)18:43:51 No.108605087▶

File: o_00247_.png (1.7 MB)

Anonymous
04/14/26(Tue)18:44:40 No.108605088

Anonymous 04/14/26(Tue)18:44:40 No.108605088▶

>>108605080
something is wrong with the proportion of their body, looks like they're midgets, Flux Kontext style lool

Anonymous
04/14/26(Tue)18:49:28 No.108605101

Anonymous 04/14/26(Tue)18:49:28 No.108605101▶

>>108605064
>3 feet
>>108605080
>3 hands
lol I think I won't downloading this

Anonymous
04/14/26(Tue)18:53:04 No.108605115

Anonymous 04/14/26(Tue)18:53:04 No.108605115▶

File: LTX3 will beat Seedance 2.0!.png (237.4 KB)

237.4 KB PNG

it's all right, the jews will save us
https://xcancel.com/ltx_model/status/2044108750592643279#m

Anonymous
04/14/26(Tue)18:54:56 No.108605126

Anonymous 04/14/26(Tue)18:54:56 No.108605126▶

File: Ernie Comparison.png (2.6 MB)

2.6 MB PNG

This model has been trained on 3 billion images of Nano Banana Pro kek.

Anonymous
04/14/26(Tue)18:55:40 No.108605128

Anonymous 04/14/26(Tue)18:55:40 No.108605128▶

File: Ernie-Image_00010_.png (1.6 MB)

1.6 MB PNG

Anonymous
04/14/26(Tue)19:01:13 No.108605150

Anonymous 04/14/26(Tue)19:01:13 No.108605150▶

>>108605126
>This model has been trained on 3 billion images of Nano Banana Pro kek.
Z-Image supremacy, yeaaah! We had Qwen Edit and then the Tongyi model/s, but all other Chinese t2i are all equally sloppy, GLM, this, whatever.

Anonymous
04/14/26(Tue)19:03:10 No.108605153

Anonymous 04/14/26(Tue)19:03:10 No.108605153▶

File: Ernie-Image_00011_.png (1.5 MB)

1.5 MB PNG

I am kinda liking things about it despite it's faults.
But they probably either overcooked this thing or it needed a little bit of post training aesthetic alignment to temper schizo anatomy.

Anonymous
04/14/26(Tue)19:05:30 No.108605160

Anonymous 04/14/26(Tue)19:05:30 No.108605160▶

File: o_00252_.png (770.7 KB)

770.7 KB PNG

Anonymous
04/14/26(Tue)19:11:32 No.108605183

Anonymous 04/14/26(Tue)19:11:32 No.108605183▶

>>108604974
>I am wondering if Comfy fucked something up
I think the model is just not that good, in my tests it's inferior to Z-image turbo almost everywhere
It can be a great base model to train on though, but yeah, 8b is big, people prefer something smaller like 2b so that they can do Anima type of models or some shit

Anonymous
04/14/26(Tue)19:13:52 No.108605192

Anonymous 04/14/26(Tue)19:13:52 No.108605192▶

File: 1485680357151.png (298.9 KB)

298.9 KB PNG

>>108605183
>8b is big

Anonymous
04/14/26(Tue)19:14:07 No.108605193

Anonymous 04/14/26(Tue)19:14:07 No.108605193▶

>>108605183
yup, same experience, back to zit for me

Anonymous
04/14/26(Tue)19:18:08 No.108605208

Anonymous 04/14/26(Tue)19:18:08 No.108605208▶

File: 1756553466182638.jpg (459.2 KB)

459.2 KB JPG

>>108605183
>it's inferior to Z-image turbo almost everywhere
the niggas thought that training a model only on Nano Banana Pro's images would do the trick, all we got is that Synth-ID watermark pattern everywhere lmao, once again, synthetic data BTFO

Anonymous
04/14/26(Tue)19:20:14 No.108605220

Anonymous 04/14/26(Tue)19:20:14 No.108605220▶

>>108605115
oops, forgot to attach their paper
https://arxiv.org/abs/2604.11788

Anonymous
04/14/26(Tue)19:20:57 No.108605224

Anonymous 04/14/26(Tue)19:20:57 No.108605224▶

>>108605183
I think there are issues with finetuning klein and ZIB for some reason.
If it responds to training well this look salvageable. Decent text encoder + best vae + good size balance between quality and being able to be run on most hardware + OK quality bar anatomy issues + mid instruction following but can be possibly ironed out.
I hope someone besides Kekstone takes a crack at it.
>>108605208
Can't we improve realism with finetuning/lora? I know training on slop sucks but banana pro is really high quality baseline.

Anonymous
04/14/26(Tue)19:23:59 No.108605236

Anonymous 04/14/26(Tue)19:23:59 No.108605236▶

File: weird.png (97.8 KB)

97.8 KB PNG

>>108604759
>https://ernieimageprompt.com/
or else something is wrong with ComfyUi, or those baidu fucks are straight up lying to us, I'm not getting something even close to those images in that site

Anonymous
04/14/26(Tue)19:27:37 No.108605254

Anonymous 04/14/26(Tue)19:27:37 No.108605254▶

File: 1746478579501469.jpg (37.8 KB)

37.8 KB JPG

>>108605236
Chinks lying? How can it be...

Anonymous
04/14/26(Tue)19:31:35 No.108605262

Anonymous 04/14/26(Tue)19:31:35 No.108605262▶

File: jpeg artifacts.png (1.7 MB)

I love to complain about the jpeg artifacts on Z-image turbo, but for Erenie we arrived to a whole other level, jesus this is ugly af

Anonymous
04/14/26(Tue)19:35:14 No.108605276

Anonymous 04/14/26(Tue)19:35:14 No.108605276▶

>>108605262
I don't think those are jpg artifacts, probably the watermark patterns of NBP >>108605126

Anonymous
04/14/26(Tue)19:35:50 No.108605278

Anonymous 04/14/26(Tue)19:35:50 No.108605278▶

>>108605262
Is this Turbo? I am not really getting these on the Base.

Anonymous
04/14/26(Tue)19:45:59 No.108605304

Anonymous 04/14/26(Tue)19:45:59 No.108605304▶

File: 1764213941152989.jpg (1 MB)

1 MB JPG

turbo seems more slopped overall, and if there's one thing I can say base does better than Z-image turbo, is that it seems to know more stuff, but knowing more stuff is useless if the anatomy is ass and the realism is not even close too

Anonymous
04/14/26(Tue)19:49:35 No.108605317

Anonymous 04/14/26(Tue)19:49:35 No.108605317▶

>comparing z turbo to ernie base
Why not compare base to base tho

Anonymous
04/14/26(Tue)19:50:10 No.108605321

Anonymous 04/14/26(Tue)19:50:10 No.108605321▶

File: 1755056615501464.jpg (873.8 KB)

873.8 KB JPG

>>108605278
I think you are right anon, base doesn't seem to have that much noise

Anonymous
04/14/26(Tue)19:52:49 No.108605334

Anonymous 04/14/26(Tue)19:52:49 No.108605334▶

>>108605321
as a ""base"" model it looks like it's destroying Z-image base, let's hope that we can train it well then, both ZIB and Klein had their issues

Anonymous
04/14/26(Tue)20:02:13 No.108605357

Anonymous 04/14/26(Tue)20:02:13 No.108605357▶

File: 1764603567551374.jpg (869.1 KB)

869.1 KB JPG

I don't see anything in which Ernie is the best at, Chroma has the best kino, Z-image has the best realism and anatomy, this shit is just slop after slop

Anonymous
04/14/26(Tue)20:04:56 No.108605367

Anonymous 04/14/26(Tue)20:04:56 No.108605367▶

>>108605317
it's been compared here >>108605080

Anonymous
04/14/26(Tue)20:08:48 No.108605377

Anonymous 04/14/26(Tue)20:08:48 No.108605377▶

File: 1761780172470859.jpg (623.1 KB)

623.1 KB JPG

Anonymous
04/14/26(Tue)20:15:41 No.108605402

Anonymous 04/14/26(Tue)20:15:41 No.108605402▶

File: kek.jpg (830.4 KB)

830.4 KB JPG

Anonymous
04/14/26(Tue)20:17:30 No.108605408

Anonymous 04/14/26(Tue)20:17:30 No.108605408▶

File: now what?.png (113.9 KB)

113.9 KB PNG

the ledditors are loving it though
https://www.reddit.com/r/StableDiffusion/comments/1slg4wh/we_may_have_a_new_sota_opensource_model/

Anonymous
04/14/26(Tue)20:24:47 No.108605427

Anonymous 04/14/26(Tue)20:24:47 No.108605427▶

>piggies love slop
STOP THE PRESSES A FROGFAG IS SPEAKING !!!

Anonymous
04/14/26(Tue)20:27:10 No.108605439

Anonymous 04/14/26(Tue)20:27:10 No.108605439▶

File: Nano Banana Amateur.jpg (1.1 MB)

1.1 MB JPG

Can't the chinks do anything else than just make cheap copies of murica's products?

Anonymous
04/14/26(Tue)20:28:47 No.108605448

Anonymous 04/14/26(Tue)20:28:47 No.108605448▶

File: 635872472572.jpg (2.1 MB)

2.1 MB JPG

Anonymous
04/14/26(Tue)20:29:36 No.108605451

Anonymous 04/14/26(Tue)20:29:36 No.108605451▶

File: _AnimaPreview3_00291_.jpg (464.8 KB)

464.8 KB JPG

Anonymous
04/14/26(Tue)20:35:20 No.108605464

Anonymous 04/14/26(Tue)20:35:20 No.108605464▶

File: Ernie-Image_00022_.png (1.3 MB)

>>108605408

Anonymous
04/14/26(Tue)20:36:24 No.108605468

Anonymous 04/14/26(Tue)20:36:24 No.108605468▶

File: 1773347391030829.png (706.6 KB)

706.6 KB PNG

>Tezuka Rin $katawa shoujo$ sitting on a bench
is that how you're supposed to prompt on Anima? I can't manage to get her

Anonymous
04/14/26(Tue)20:40:53 No.108605490

Anonymous 04/14/26(Tue)20:40:53 No.108605490▶

>>108605115
distilled seedance 2.0 (ltx 4) and kazar milkers honeypot spy gf was promised to me 6 gorillion years ago but unironically.

Anonymous
04/14/26(Tue)20:42:58 No.108605497

Anonymous 04/14/26(Tue)20:42:58 No.108605497▶

>>108605262
>>108605278
>>108605276
i never had the artifacts problem with zit, just dont use the suggested retard samplers and instead use:
euler (/euler_a) + simple (/normal)

Anonymous
04/14/26(Tue)20:45:23 No.108605502

Anonymous 04/14/26(Tue)20:45:23 No.108605502▶

>>108605468
Yes for tag based prompts but I don't think there is full consensus on how to prompt characters when prompting with natural language. Try Tezuka Rin from Katawa Shoujo.
If all options are exhausted try it on preview 2.

Anonymous
04/14/26(Tue)20:46:22 No.108605507

Anonymous 04/14/26(Tue)20:46:22 No.108605507▶

File: _AnimaPreview3_00310_.jpg (458.8 KB)

458.8 KB JPG

Anonymous
04/14/26(Tue)20:51:21 No.108605521

Anonymous 04/14/26(Tue)20:51:21 No.108605521▶

File: 1712175743062.jpg (1.6 MB)

1.6 MB JPG

Anonymous
04/14/26(Tue)20:54:58 No.108605526

Anonymous 04/14/26(Tue)20:54:58 No.108605526▶

File: blaze it.png (1.5 MB)

1.5 MB PNG

>>108605468
>Tezuka Rin from Katawa Shoujo, a girl with short messy red hair and green eyes and no arms, sitting on a wooden bench, wearing her school uniform, calm distant expression, soft afternoon light, On the left knee there's a plush of Hatsune Miku, on the right there's a plush of Kazane Teto
skill issue

Anonymous
04/14/26(Tue)21:00:44 No.108605539

Anonymous 04/14/26(Tue)21:00:44 No.108605539▶

File: 1770648835789723.mp4 (2.1 MB)

2.1 MB MP4

https://xcancel.com/DylanTFWang/status/2043952886166761519
>Open-source tomorrow
damn, if it's not too big to run locally maybe Tencent finally cooked

Anonymous
04/14/26(Tue)21:03:25 No.108605550

Anonymous 04/14/26(Tue)21:03:25 No.108605550▶

big jump in real time interactable video gen

Waypoint-1.5 apache2 first person shooter focused 1.2b 720p 512 frames of context 56fps on 5090, need at least 30xx

online demo https://www.overworld.stream/
https://github.com/Overworldai/world_engine

Anonymous
04/14/26(Tue)21:03:33 No.108605552

Anonymous 04/14/26(Tue)21:03:33 No.108605552▶

>>108605539
Anons what's the actual use case for this world model thing?
Every single world model I see looks like "cool tech demo you play for five minutes and then never touch again".

Anonymous
04/14/26(Tue)21:04:34 No.108605555

Anonymous 04/14/26(Tue)21:04:34 No.108605555▶

>>108605539
forgot that link too
https://3d-models.hunyuan.tencent.com/world/

Anonymous
04/14/26(Tue)21:07:06 No.108605566

Anonymous 04/14/26(Tue)21:07:06 No.108605566▶

File: Flux2-Klein_00092_.png (81.8 KB)

81.8 KB PNG

Anonymous
04/14/26(Tue)21:09:15 No.108605573

Anonymous 04/14/26(Tue)21:09:15 No.108605573▶

>>108605552
newfag. luddite. brown, even.

the point is to enjoy the cool new tech and tinker with it while thinking about how you can maybe use it and change it yourself now while also thinking about how cool it will be in a year from now on.

for example chaining multiple generated rooms you can traverse infinitely is a software problem and thus solveable relatively easily while allowing you to get much more out of that tech there.

Anonymous
04/14/26(Tue)21:11:19 No.108605580

Anonymous 04/14/26(Tue)21:11:19 No.108605580▶

File: 1758430737520461.png (2.4 MB)

2.4 MB PNG

>>108605550
>512 frames of context 56fps on 5090
So? less than 10 seconds? lol
>>108605552
desu I'd enjoy lurking on a world made out of a cool drawing image, like this shit

Anonymous
04/14/26(Tue)21:13:19 No.108605586

Anonymous 04/14/26(Tue)21:13:19 No.108605586▶

File: Ernie-Image_00023_.png (1.1 MB)

A very sloppy double exposure sloppa.

Anonymous
04/14/26(Tue)21:16:26 No.108605592

Anonymous 04/14/26(Tue)21:16:26 No.108605592▶

>>108605586
https://huggingface.co/Kijai/WanVideo_comfy/tree/main/LoRAs/Wan22_Lightx2v
kijai made the loras out of the new lightning version of Wan 2.2

Anonymous
04/14/26(Tue)21:27:39 No.108605627

Anonymous 04/14/26(Tue)21:27:39 No.108605627▶

File: Flux2-Klein_00182_.png (1.7 MB)

There are some who call me...Tim

Anonymous
04/14/26(Tue)21:29:55 No.108605634

Anonymous 04/14/26(Tue)21:29:55 No.108605634▶

File: 1747739466716694.jpg (777 KB)

777 KB JPG

Anonymous
04/14/26(Tue)21:34:22 No.108605651

Anonymous 04/14/26(Tue)21:34:22 No.108605651▶

audio in ltx 2.3 1.1 seems nicer. we wuz hogwarts:

https://litter.catbox.moe/hnjzczuml64krkjr.mp4

Anonymous
04/14/26(Tue)21:35:34 No.108605654

Anonymous 04/14/26(Tue)21:35:34 No.108605654▶

File: 1770342916813218.webm (3.7 MB)

3.7 MB WEBM

>>108605592
not bad, Wan 2.2 may be an ancient model, it's still the best thing we have :')

Anonymous
04/14/26(Tue)21:41:10 No.108605674

Anonymous 04/14/26(Tue)21:41:10 No.108605674▶

>>108605651
that's cool, I was tired of the ultra metalic sound of ltx, if those jews keep improving on that shit it might end up being a genuinely good model, still a long way to go to seedance 2.0 though lol >>>/wsg/6128285

Anonymous
04/14/26(Tue)21:45:22 No.108605686

Anonymous 04/14/26(Tue)21:45:22 No.108605686▶

File: 1759932705998330.webm (3.8 MB)

3.8 MB WEBM

>>108605654
>first frame + last frame
kek, I forgot how much vram wan 2.2 is asking, I think I might return to LTX just for that

Anonymous
04/14/26(Tue)21:46:36 No.108605689

Anonymous 04/14/26(Tue)21:46:36 No.108605689▶

>>108605627
lul, did you combine monty python screenshot with the cat meme?

Anonymous
04/14/26(Tue)21:50:56 No.108605701

Anonymous 04/14/26(Tue)21:50:56 No.108605701▶

>>108605686
What do you mean by that, isn't LTX to heavier on resources?

Anonymous
04/14/26(Tue)21:58:05 No.108605723

Anonymous 04/14/26(Tue)21:58:05 No.108605723▶

File: 720p Wan 2.2.webm (3.8 MB)

3.8 MB WEBM

>>108605701
it uses a less heavy VAE so the kv cache usage is less punitive, good luck going for 720p on wan 2.2

Anonymous
04/14/26(Tue)22:18:06 No.108605791

Anonymous 04/14/26(Tue)22:18:06 No.108605791▶

>>108605723
this. i can make 720p resolution gens on ltx. literally impossible on wan-hunyuan

Anonymous
04/14/26(Tue)22:23:53 No.108605813

Anonymous 04/14/26(Tue)22:23:53 No.108605813▶

>>108604726
does any of this shit run simply and reasonably well on AMD cards yet?

I have tried multiple times over the last couple of years to get a functional pipeline up and running on my 6800xt 16gb and it has never once worked

I'm no genius but I'm also not retarded

Anonymous
04/14/26(Tue)22:31:47 No.108605841

Anonymous 04/14/26(Tue)22:31:47 No.108605841▶

File: Flux2-Klein_00152_.png (1.8 MB)

>>108605689
yeah

Anonymous
04/14/26(Tue)22:34:53 No.108605848

Anonymous 04/14/26(Tue)22:34:53 No.108605848▶

MLEM MLEM MLEM HECKIM CHNGUS

Anonymous
04/14/26(Tue)22:35:56 No.108605855

Anonymous 04/14/26(Tue)22:35:56 No.108605855▶

>>108605813
if linux rocm + forge neo work fine
if windows i pray for you

Anonymous
04/14/26(Tue)23:19:47 No.108606004

Anonymous 04/14/26(Tue)23:19:47 No.108606004▶

File: 00109-58636226.png (885.8 KB)

885.8 KB PNG

Anonymous
04/14/26(Tue)23:21:12 No.108606006

Anonymous 04/14/26(Tue)23:21:12 No.108606006▶

File: BULLSHIT.png (632.6 KB)

632.6 KB PNG

https://youtu.be/XUxKm40X__g?t=907
benchmarks was a mistake...

Anonymous
04/14/26(Tue)23:42:07 No.108606080

Anonymous 04/14/26(Tue)23:42:07 No.108606080▶

using ltx 2.3 ver 1.1 (new one):

the man says "I'm LITERALLY Ryan Gosling in the movie Drive", and the camera zooms out through the windshield as he speeds down the road in new york city at night.

https://litter.catbox.moe/ekn0ujlh88fd37ox.mp4

Anonymous
04/14/26(Tue)23:47:21 No.108606096

Anonymous 04/14/26(Tue)23:47:21 No.108606096▶

File: local lost (again).png (2.1 MB)

2.1 MB PNG

https://xcancel.com/flowersslop/status/2043591433731408126
those retards at Ernie should've trained their model on GPT-image 2's output instead lool

Anonymous
04/14/26(Tue)23:49:45 No.108606107

Anonymous 04/14/26(Tue)23:49:45 No.108606107▶

>>108606080
when it zooms out it looks like some video game LOD trick shit when the character gets has lower and lower polygons as he moves away from the camera kek

Anonymous
04/14/26(Tue)23:50:47 No.108606112

Anonymous 04/14/26(Tue)23:50:47 No.108606112▶

yo is anima good yet?

Anonymous
04/14/26(Tue)23:51:23 No.108606114

Anonymous 04/14/26(Tue)23:51:23 No.108606114▶

>>108606080
this turned out better

the man says "I'm LITERALLY Ryan Gosling in the movie Drive", and the camera zooms out very far through the windshield as he drives the car off a ramp on a road in new york city at night.

https://litter.catbox.moe/z04qhvm91v3etfmw.mp4

Anonymous
04/14/26(Tue)23:52:15 No.108606117

Anonymous 04/14/26(Tue)23:52:15 No.108606117▶

>>108606080
Omg he's litter.catbox moe

Anonymous
04/14/26(Tue)23:55:11 No.108606125

Anonymous 04/14/26(Tue)23:55:11 No.108606125▶

>>108606114
not bad at all actually, if you want you can also share your videos here, it allows sound >>>/wsg/6126746

Anonymous
04/14/26(Tue)23:58:15 No.108606135

Anonymous 04/14/26(Tue)23:58:15 No.108606135▶

>>108606114
>>108606080
are you using the distilled model or the base model + you apply the distilled lora on top of it?

Anonymous
04/15/26(Wed)00:19:01 No.108606190

Anonymous 04/15/26(Wed)00:19:01 No.108606190▶

File: 1745735247929790.jpg (1.5 MB)

1.5 MB JPG

babe, wake up, a second image model has hit the tower
https://huggingface.co/NucleusAI/Nucleus-Image

Anonymous
04/15/26(Wed)00:20:44 No.108606199

Anonymous 04/15/26(Wed)00:20:44 No.108606199▶

>>108606190
>We release the full model weights, training code, and dataset, making Nucleus-Image the first fully open-source MoE diffusion model at this quality tier.
kek, if they release the dataset it means they trained this shit with only copyright-free garbage, DOA

Anonymous
04/15/26(Wed)00:26:30 No.108606219

Anonymous 04/15/26(Wed)00:26:30 No.108606219▶

File: Overall-Performance.png (655 KB)

655 KB PNG

>>108606190

Anonymous
04/15/26(Wed)00:28:55 No.108606226

Anonymous 04/15/26(Wed)00:28:55 No.108606226▶

>>108605855
rip

Anonymous
04/15/26(Wed)00:29:46 No.108606231

Anonymous 04/15/26(Wed)00:29:46 No.108606231▶

>>108606219
this is so deceptive, it's not a 2b model, you still need to load the whole model (17b) to get this shit running

Anonymous
04/15/26(Wed)00:31:38 No.108606239

Anonymous 04/15/26(Wed)00:31:38 No.108606239▶

>>108606219
we're supposed to take them seriously when they don't even put anima on the board?

Anonymous
04/15/26(Wed)00:33:53 No.108606248

Anonymous 04/15/26(Wed)00:33:53 No.108606248▶

>>108606219
>no Z-image
>no Flux 2
kek

Anonymous
04/15/26(Wed)00:37:36 No.108606263

Anonymous 04/15/26(Wed)00:37:36 No.108606263▶

File: 42.png (7.2 KB)

7.2 KB PNG

>>108606219
geg

Anonymous
04/15/26(Wed)00:38:13 No.108606265

Anonymous 04/15/26(Wed)00:38:13 No.108606265▶

>>108606190
why does every company insist on training on utter trash datasets

Anonymous
04/15/26(Wed)00:40:45 No.108606275

Anonymous 04/15/26(Wed)00:40:45 No.108606275▶

>>108606219
>lumina
>janus
>hidream
>sana
Holy throwback

Anonymous
04/15/26(Wed)00:42:53 No.108606283

Anonymous 04/15/26(Wed)00:42:53 No.108606283▶

File: 242605513.png (228.6 KB)

228.6 KB PNG

>>108606275
even pixart BIGMA is in their report, a shitton of models I've never heard about or only heard on release and never again

Anonymous
04/15/26(Wed)00:44:40 No.108606289

Anonymous 04/15/26(Wed)00:44:40 No.108606289▶

File: animap3_00029_.png (1.6 MB)

1.6 MB PNG

Anonymous
04/15/26(Wed)00:48:54 No.108606300

Anonymous 04/15/26(Wed)00:48:54 No.108606300▶

>>108606265
sar please the benchmarks

Anonymous
04/15/26(Wed)00:54:04 No.108606317

Anonymous 04/15/26(Wed)00:54:04 No.108606317▶

>>108606265
saas doesnt

Anonymous
04/15/26(Wed)01:00:53 No.108606333

Anonymous 04/15/26(Wed)01:00:53 No.108606333▶

>>108606317
thats why openai is training their new model on youtube shorts and lets play videos.

Anonymous
04/15/26(Wed)01:11:44 No.108606356

Anonymous 04/15/26(Wed)01:11:44 No.108606356▶

>>108606333
impressive, maybe local should try that next instead of training on dall-e outputs

Anonymous
04/15/26(Wed)01:16:32 No.108606370

Anonymous 04/15/26(Wed)01:16:32 No.108606370▶

>>108606356
they are open models, you can train them on whatever you want.
now feel free to have a meltdown about "loRa cope" and "shitmixes".

Anonymous
04/15/26(Wed)01:17:28 No.108606372

Anonymous 04/15/26(Wed)01:17:28 No.108606372▶

File: Ernie-Image_00002_.png (1.8 MB)

Ernie Base, 20 steps
>Touhou Project characters in a screenshot of Diablo 1. Screenshot set in a gothic, candlelit cathedral dungeon — stone floors, blood-stained altars, flickering torches casting long shadows. Reimu Hakurei appears as a weathered warrior, clad in rusted plate armor with subtle Shinto motifs, wielding a glowing sword and heavy iron shield. Marisa Kirisame is a gritty sorceress, her blackened robe frayed at the edges, holding a staff crackling with low-res magical sparks. Patchouli Knowledge floats slightly above the ground like a corrupted cleric, surrounded by ancient grimoires emitting a ghostly glow. All characters match the sprite-based, isometric art of Diablo 1

>Visual fidelity must match Diablo 1’s aesthetic: muted earth tones, dark reds and greens, harsh shadows, dithering effects, and low ambient lighting. The entire composition should be a screenshot from a 1996 pre-rendered isometric dungeon crawler. Include UI elements.
Trying again with 50

Anonymous
04/15/26(Wed)01:20:34 No.108606379

Anonymous 04/15/26(Wed)01:20:34 No.108606379▶

File: Ernie-Image_00004_.png (1.8 MB)

>>108606372
50 steps, I guess it's better?

Anonymous
04/15/26(Wed)01:29:38 No.108606397

Anonymous 04/15/26(Wed)01:29:38 No.108606397▶

>>108605045
You saved me so much time downloading that garbage, I love you.

Anonymous
04/15/26(Wed)01:34:48 No.108606407

Anonymous 04/15/26(Wed)01:34:48 No.108606407▶

is that new ltx2.3 version worth downloading? apparently it has much better sound or something

Anonymous
04/15/26(Wed)01:40:17 No.108606420

Anonymous 04/15/26(Wed)01:40:17 No.108606420▶

>>108605813
On newer GPUs it should work, but 6800XT is not officially supported so far. I think the quickest way to try is to update your AMD GPU driver (to either 26.2.2 or 26.3.1), then download the latest ComfyUI portable AMD release from their Github, and see if it just werks:
https://github.com/Comfy-Org/ComfyUI/releases

Anonymous
04/15/26(Wed)01:42:01 No.108606424

Anonymous 04/15/26(Wed)01:42:01 No.108606424▶

File: Ernie-Image_00012_.png (1.8 MB)

does okay with schizoprompts, but honestly it's just not good at fine details, this is base model at 50 steps and it should be way better for how long it takes to gen

Anonymous
04/15/26(Wed)01:45:38 No.108606433

Anonymous 04/15/26(Wed)01:45:38 No.108606433▶

>>108606379
50 does keep the head directions and proportions more consistent-looking to me.

Anonymous
04/15/26(Wed)02:00:47 No.108606472

Anonymous 04/15/26(Wed)02:00:47 No.108606472▶

File: ComfyUI_temp_targv_00022_.png (1.3 MB)

Anonymous
04/15/26(Wed)02:03:29 No.108606481

Anonymous 04/15/26(Wed)02:03:29 No.108606481▶

File: 1776156560704245.jpg (292.5 KB)

292.5 KB JPG

what diffusion model is best for modifying an image based on text input

Anonymous
04/15/26(Wed)02:22:41 No.108606529

Anonymous 04/15/26(Wed)02:22:41 No.108606529▶

>>108606407
the sound is great, but the model is still slop. especially t2v. they cleary hired cheap indians devs. no kino at all for now

Anonymous
04/15/26(Wed)02:44:33 No.108606582

Anonymous 04/15/26(Wed)02:44:33 No.108606582▶

>>108606481
>modifying an image based on text input
if you mean something like "change this sword to a baseball bat and make her a black woman." qwen image edit or flux 2 klein.

Anonymous
04/15/26(Wed)02:55:56 No.108606618

Anonymous 04/15/26(Wed)02:55:56 No.108606618▶

why doesn't lodstone hire the greatest model trainer alive, Sarah Peterson?

Anonymous
04/15/26(Wed)03:09:48 No.108606647

Anonymous 04/15/26(Wed)03:09:48 No.108606647▶

is there anything like 'Kohya Deep Shrink' for anima? i mainly just want a smaller initial latent for composition and then upscale it halfway without having to set up a double pass

Anonymous
04/15/26(Wed)04:39:58 No.108606886

Anonymous 04/15/26(Wed)04:39:58 No.108606886▶

cozy

Anonymous
04/15/26(Wed)06:31:13 No.108607097

Anonymous 04/15/26(Wed)06:31:13 No.108607097▶

>>108606472
too old

Anonymous
04/15/26(Wed)06:41:15 No.108607115

Anonymous 04/15/26(Wed)06:41:15 No.108607115▶

File: 1775684896827863.png (206 KB)

206 KB PNG

>>108606370

Anonymous
04/15/26(Wed)06:48:05 No.108607124

Anonymous 04/15/26(Wed)06:48:05 No.108607124▶

>>108606190
why are those people wasting money on making the most slopped shit ever, the fuck do they expect? no one is gonna bat an eye on such a piece of shit

Anonymous
04/15/26(Wed)06:56:09 No.108607142

Anonymous 04/15/26(Wed)06:56:09 No.108607142▶

>>108606618
Based

Anonymous
04/15/26(Wed)06:59:22 No.108607152

Anonymous 04/15/26(Wed)06:59:22 No.108607152▶

>>108606263
But still werks and is the best image model ever existed

Anonymous
04/15/26(Wed)07:02:01 No.108607161

Anonymous 04/15/26(Wed)07:02:01 No.108607161▶

File: lmaoo.png (975.9 KB)

975.9 KB PNG

>>108606190
absolute slop

Anonymous
04/15/26(Wed)07:09:41 No.108607185

Anonymous 04/15/26(Wed)07:09:41 No.108607185▶

File: bbs-zit-2026-04-15_00100_.png (3.9 MB)

3.9 MB PNG

>>108606190
>we have zit at home

Anonymous
04/15/26(Wed)07:09:54 No.108607186

Anonymous 04/15/26(Wed)07:09:54 No.108607186▶

>>108606219
Laxhar should train Noob2 on Qwen image. Yes, I know nobody is going to be able to run it, finetune it, and shitmerge it, but:

1. There are no good finetunings or shitmergers.
2. Most of them don't know what they're doing, or they call "improving the dataset" contaminating it with their slop.
3. It's better that this behemoth of a model only gets finetuned and updated by him and his team.
4. LLM bros have been renting GPUs to run their Noromaids since early times.

It's the best option regarding quality. At the end of the day, I want an anime image model that's excellent quality. It doesn't bother me to use a free trial from some GPU rental startup to be able to run it. Better to have good models that I can't run than to have millions of snake oil models that make me waste my time.

Anonymous
04/15/26(Wed)07:13:44 No.108607199

Anonymous 04/15/26(Wed)07:13:44 No.108607199▶

>>108607186
on a 20b model? really? why not training ernie instead, the quality is similar, it has a better vae and it's a 8b model

Anonymous
04/15/26(Wed)07:15:57 No.108607203

Anonymous 04/15/26(Wed)07:15:57 No.108607203▶

File: LOCAL IS SAVED.png (468.9 KB)

468.9 KB PNG

Finally, Tongyi has released what we've been waiting for!

Anonymous
04/15/26(Wed)07:22:02 No.108607221

Anonymous 04/15/26(Wed)07:22:02 No.108607221▶

File: 1767168885395325.png (376.4 KB)

376.4 KB PNG

Anonymous
04/15/26(Wed)07:29:08 No.108607239

Anonymous 04/15/26(Wed)07:29:08 No.108607239▶

>>108606647
I think Kohya Deep Shrink should work for anima but you need a different block number and you need to figure out what that is.

Anonymous
04/15/26(Wed)07:35:57 No.108607260

Anonymous 04/15/26(Wed)07:35:57 No.108607260▶

File: 1755091184275991.png (276 KB)

276 KB PNG

https://xcancel.com/peter9863/status/2044269457086779877#m
babe wake up, Flow Matching is not the best diffusion architecture anymore

Anonymous
04/15/26(Wed)07:38:46 No.108607268

Anonymous 04/15/26(Wed)07:38:46 No.108607268▶

File: file.png (3.7 MB)

3.7 MB PNG

DED

Anonymous
04/15/26(Wed)07:44:30 No.108607290

Anonymous 04/15/26(Wed)07:44:30 No.108607290▶

File: 1756058143539523.png (48.3 KB)

48.3 KB PNG

>>108607260
https://xcancel.com/bdsqlsz/status/2044308129043886119#m
it's obvious we're still far from having found the perfect way to train those image/video models, at some point it'll be so elaborate we'll get a 6b model as good as Seedance 2.0, we're still in the era of computers as big as a house and as powerful as a modern calculator lol

Anonymous
04/15/26(Wed)07:54:24 No.108607325

Anonymous 04/15/26(Wed)07:54:24 No.108607325▶

File: bbs-zit-2026-04-15_00121_.png (3.8 MB)

3.8 MB PNG

Anonymous
04/15/26(Wed)07:59:14 No.108607345

Anonymous 04/15/26(Wed)07:59:14 No.108607345▶

File: 1761138898231064.jpg (3.7 MB)

3.7 MB JPG

>>108607290
it's impressive how well it's able to reproduce the original image, tencent is shit at making models, but when it's about making cool new training methods they are definitely cooking
https://hy-soar.github.io/

Anonymous
04/15/26(Wed)08:00:17 No.108607352

Anonymous 04/15/26(Wed)08:00:17 No.108607352▶

File: CAFM paper Z-Image.png (494.6 KB)

494.6 KB PNG

>>108607260
Sounds like another garbage p-hacked meme paper that will be forgotten desu.
They apparently trained Z-Image on this thing, but while the (most probably cherry picked) prompt adherence often looks better, the images look dogshit aesthetically and fried.

Anonymous
04/15/26(Wed)08:02:38 No.108607363

Anonymous 04/15/26(Wed)08:02:38 No.108607363▶

>>108607352
glad that there's someone here that knows what it's talking about, what do you think of that method too? >>108607290 >>108607345

Anonymous
04/15/26(Wed)08:17:03 No.108607420

Anonymous 04/15/26(Wed)08:17:03 No.108607420▶

>>108607363
The examples are better, includes more concrete benchs like OCR (Although these too can easily be benchmemed).
If I must criticize, there is relatively limited data about comparisons between SOAR and RL, despite "Better results than RL at the roughly same cost of SFT" being a central part of the paper's premise.
But overall looks more credible than the other paper.
Also, I have no idea what I am talking about.

Anonymous
04/15/26(Wed)08:20:42 No.108607433

Anonymous 04/15/26(Wed)08:20:42 No.108607433▶

File: you should be ashamed of yourself lmao.png (48.7 KB)

48.7 KB PNG

https://www.reddit.com/r/StableDiffusion/comments/1slz1rq/last_week_in_generative_image_video/
the absolute state of localkeking, while seedance 2.0 is making hollywood sweat, we're still trying to figure out how to make a local model count to 3

Anonymous
04/15/26(Wed)08:23:29 No.108607445

Anonymous 04/15/26(Wed)08:23:29 No.108607445▶

>>108607345
>RL be like: "I must make everything realistic!"
is that why models are so slopped and biased towards "realism" nowdays?

Anonymous
04/15/26(Wed)08:29:19 No.108607474

Anonymous 04/15/26(Wed)08:29:19 No.108607474▶

File: _AnimaPreview3_00326_.jpg (133.8 KB)

133.8 KB JPG

Anonymous
04/15/26(Wed)08:30:32 No.108607480

Anonymous 04/15/26(Wed)08:30:32 No.108607480▶

>>108607433
>implying seedance 2.0 doesn't fall apart with multiple characters in a scene

Anonymous
04/15/26(Wed)08:42:20 No.108607519

Anonymous 04/15/26(Wed)08:42:20 No.108607519▶

File: bbs-zit-2026-04-15_00015_.png (3.7 MB)

3.7 MB PNG

Anonymous
04/15/26(Wed)08:42:31 No.108607521

Anonymous 04/15/26(Wed)08:42:31 No.108607521▶

File: 1775937119882316.png (947.1 KB)

947.1 KB PNG

https://xcancel.com/ErnieforDevs/status/2044290766349185257#m
Oh great, another Klein tier edit model

Anonymous
04/15/26(Wed)08:44:58 No.108607529

Anonymous 04/15/26(Wed)08:44:58 No.108607529▶

>>108607521
>another Klein tier edit model
yeah, but this one will have the apache 2.0 licence, so it's a win in my book lol

Anonymous
04/15/26(Wed)08:45:08 No.108607530

Anonymous 04/15/26(Wed)08:45:08 No.108607530▶

File: Shifty AnimaPreview3 MK4 sample.jpg (1.5 MB)

1.5 MB JPG

How well can Ernie do anime? I'm not interested in genning 3DPD.

Anonymous
04/15/26(Wed)08:47:26 No.108607541

Anonymous 04/15/26(Wed)08:47:26 No.108607541▶

>>108607530
>How well can Ernie do anime?
>>108605023
>>108605126

Anonymous
04/15/26(Wed)08:48:49 No.108607544

Anonymous 04/15/26(Wed)08:48:49 No.108607544▶

File: file.png (58.1 KB)

58.1 KB PNG

just AI things

△
04/15/26(Wed)08:56:05 No.108607581

△ 04/15/26(Wed)08:56:05 No.108607581▶

File: bbs-zit-2026-04-15_00137_.png (3.6 MB)

3.6 MB PNG

Anonymous
04/15/26(Wed)08:59:13 No.108607594

Anonymous 04/15/26(Wed)08:59:13 No.108607594▶

File: banished.png (3.5 MB)

3.5 MB PNG

Anonymous
04/15/26(Wed)09:01:05 No.108607601

Anonymous 04/15/26(Wed)09:01:05 No.108607601▶

So Comfy doesn't have Nucleus support right now, right?
I would try the inference code but I need offloading as I am a VRAMlet

Anonymous
04/15/26(Wed)09:03:37 No.108607606

Anonymous 04/15/26(Wed)09:03:37 No.108607606▶

>>108607521
it's always the same. Model can generate sloppy looking, generic stock-image trash. Needs a stack full of loras for anything else. You might get lucky if they don't deliberately make the model untrainable.

I want a text to vid/image model trained on the entire EvilAngel catalog (including early Rocco Siffredi titles)

Anonymous
04/15/26(Wed)09:12:17 No.108607629

Anonymous 04/15/26(Wed)09:12:17 No.108607629▶

File: 1764774418418463.png (1.3 MB)

>>108607606
that's probably why Alibaba will never release Z-image edit, it was just too good and unslopped for the gweilos

△
04/15/26(Wed)09:15:51 No.108607645

△ 04/15/26(Wed)09:15:51 No.108607645▶

File: bbs-zit-2026-04-15_00160_.jpg (888.6 KB)

888.6 KB JPG

Anonymous
04/15/26(Wed)09:22:50 No.108607663

Anonymous 04/15/26(Wed)09:22:50 No.108607663▶

>>108607541
what sampler + scheduler for turbo?

Anonymous
04/15/26(Wed)09:32:34 No.108607694

Anonymous 04/15/26(Wed)09:32:34 No.108607694▶

>>108607663
the same as the one on the official comfyui's template

Anonymous
04/15/26(Wed)09:35:14 No.108607707

Anonymous 04/15/26(Wed)09:35:14 No.108607707▶

>>108607199
>the quality is similar
Heh
> it has a better vae
I still use sdxl, VAE never was a problem for me but a spook

Anonymous
04/15/26(Wed)09:36:28 No.108607713

Anonymous 04/15/26(Wed)09:36:28 No.108607713▶

>>108607694
>the same as the one on the official comfyui's template
no such thing exists

Anonymous
04/15/26(Wed)09:36:45 No.108607714

Anonymous 04/15/26(Wed)09:36:45 No.108607714▶

File: let's go kids, a jeet is in town.png (356.6 KB)

356.6 KB PNG

>>108607707
>I still use sdxl, VAE never was a problem for me but a spook

Anonymous
04/15/26(Wed)09:38:01 No.108607722

Anonymous 04/15/26(Wed)09:38:01 No.108607722▶

File: 1775193996577303.png (309.3 KB)

309.3 KB PNG

>>108607713
>no such thing exists
https://github.com/Comfy-Org/workflow_templates/blob/main/templates/image_ernie_image_turbo.json

Anonymous
04/15/26(Wed)09:56:05 No.108607786

Anonymous 04/15/26(Wed)09:56:05 No.108607786▶

File: Ernie-Q8-Turbo_00017_.png (2.2 MB)

2.2 MB PNG

GGUF version seems completely broken

Anonymous
04/15/26(Wed)10:07:01 No.108607810

Anonymous 04/15/26(Wed)10:07:01 No.108607810▶

>>108607521
speaking of edit models, why Comfy didn't implement that one?
https://github.com/jd-opensource/JoyAI-Image/tree/main/joyai_image_comfyui

Anonymous
04/15/26(Wed)10:09:49 No.108607817

Anonymous 04/15/26(Wed)10:09:49 No.108607817▶

what model should i use to put a woman on a cross with nails n shit

Anonymous
04/15/26(Wed)10:12:38 No.108607834

Anonymous 04/15/26(Wed)10:12:38 No.108607834▶

File: 1770337420719969.png (1.7 MB)

https://xcancel.com/bdsqlsz/status/2044317768414310633#m
>Illstrious SFT based on Z-image-turbo with S3-DiT.
Are we back?

Anonymous
04/15/26(Wed)10:15:23 No.108607851

Anonymous 04/15/26(Wed)10:15:23 No.108607851▶

>>108607834
piece of fucking garbage

Anonymous
04/15/26(Wed)10:16:22 No.108607855

Anonymous 04/15/26(Wed)10:16:22 No.108607855▶

>>108607834
Animasissies... our answer?

Anonymous
04/15/26(Wed)10:18:50 No.108607868

Anonymous 04/15/26(Wed)10:18:50 No.108607868▶

>>108607855
anima is good but I fucking hate the backgrounds, why are they so fucking empty??

Anonymous
04/15/26(Wed)10:19:49 No.108607874

Anonymous 04/15/26(Wed)10:19:49 No.108607874▶

File: img_00080_.jpg (736.4 KB)

736.4 KB JPG

Anonymous
04/15/26(Wed)10:20:20 No.108607877

Anonymous 04/15/26(Wed)10:20:20 No.108607877▶

>>108607868
Describe the things you want to see in the background and it will put them there. You could also try describing the background as cluttered, messy, detailed, etc. (haven't tried this one yet.). You may be using artists that tend to draw undetailed backgrounds, this has a huge effect.

Anonymous
04/15/26(Wed)10:20:22 No.108607878

Anonymous 04/15/26(Wed)10:20:22 No.108607878▶

>>108607855
here’s our answer: “BWAHAHAHAHA”

△
04/15/26(Wed)10:21:04 No.108607882

△ 04/15/26(Wed)10:21:04 No.108607882▶

File: bbs-zit-2026-04-15_00219_.png (3.4 MB)

3.4 MB PNG

you fools

Anonymous
04/15/26(Wed)10:21:28 No.108607884

Anonymous 04/15/26(Wed)10:21:28 No.108607884▶

>>108607868
Just write, retard

Anonymous
04/15/26(Wed)10:22:26 No.108607887

Anonymous 04/15/26(Wed)10:22:26 No.108607887▶

File: 1751495597543479.png (90.9 KB)

90.9 KB PNG

>>108607884
just draw, retard

Anonymous
04/15/26(Wed)10:22:38 No.108607888

Anonymous 04/15/26(Wed)10:22:38 No.108607888▶

>>108607834
looks like ass, and ever since he started putting ai gens in his dataset i have 0 belief in any future model

Anonymous
04/15/26(Wed)10:24:09 No.108607894

Anonymous 04/15/26(Wed)10:24:09 No.108607894▶

File: ZIT_00008_.png (1.1 MB)

>>108607817
Good old ZIT or Klein maybe if it hasn't been safety trained against gore.
>>108607834
>Fine-tuned on Z-image-turbo
This is kinda scary. I am skeptical that they managed to pull it off without frying or undershooting on a distilled model.
Also every gen they use to showcase its textual capabilities have very short text. Makes me further worried that it's so fried it can't gen longer than a word now.

Anonymous
04/15/26(Wed)10:24:56 No.108607896

Anonymous 04/15/26(Wed)10:24:56 No.108607896▶

File: fucking idiots.png (100.6 KB)

100.6 KB PNG

>>108607888
>he started putting ai gens in his dataset
wait really? it's fucking doa then...

Anonymous
04/15/26(Wed)10:25:00 No.108607897

Anonymous 04/15/26(Wed)10:25:00 No.108607897▶

File: img_00087_.jpg (705 KB)

705 KB JPG

Fruit punch with vodke and unpeeled bananas

Anonymous
04/15/26(Wed)10:25:11 No.108607899

Anonymous 04/15/26(Wed)10:25:11 No.108607899▶

>>108607888
Wtf why???

Anonymous
04/15/26(Wed)10:29:33 No.108607912

Anonymous 04/15/26(Wed)10:29:33 No.108607912▶

File: file.png (2.5 MB)

2.5 MB PNG

>>108607834
v3.5 has a lot of sovl, what happened?

Anonymous
04/15/26(Wed)10:30:46 No.108607916

Anonymous 04/15/26(Wed)10:30:46 No.108607916▶

File: file.png (2.9 MB)

2.9 MB PNG

>>108607912
sovl vs sovless

Anonymous
04/15/26(Wed)10:32:58 No.108607923

Anonymous 04/15/26(Wed)10:32:58 No.108607923▶

File: img_00096_.jpg (708.5 KB)

708.5 KB JPG

Anonymous
04/15/26(Wed)10:32:58 No.108607924

Anonymous 04/15/26(Wed)10:32:58 No.108607924▶

>>108607834
Holy fuck, Illustrious?!? We just need to go back in time and have everything done again. Wait, noob NTR mix.

Anonymous
04/15/26(Wed)10:34:04 No.108607926

Anonymous 04/15/26(Wed)10:34:04 No.108607926▶

>>108607834
We are back! No memory problems like some piece of shit model...

Anonymous
04/15/26(Wed)10:35:07 No.108607929

Anonymous 04/15/26(Wed)10:35:07 No.108607929▶

>>108607834
I wished they could finetune an edit model instead, Imagine doing shit like "take this character from this input image, put him there, apply this @artist style to it", the fucking dream

Anonymous
04/15/26(Wed)10:35:10 No.108607930

Anonymous 04/15/26(Wed)10:35:10 No.108607930▶

>>108607834
Can't wait for ZYume

Anonymous
04/15/26(Wed)10:36:22 No.108607935

Anonymous 04/15/26(Wed)10:36:22 No.108607935▶

>>108607834
Neat, never liked Anima's overall aesthetics

Anonymous
04/15/26(Wed)10:37:12 No.108607940

Anonymous 04/15/26(Wed)10:37:12 No.108607940▶

File: img_00102_.jpg (890.4 KB)

890.4 KB JPG

Anonymous
04/15/26(Wed)10:37:42 No.108607945

Anonymous 04/15/26(Wed)10:37:42 No.108607945▶

>>108607834
NEAT, fuck tdrusell,l anything is better than a 2b model

Anonymous
04/15/26(Wed)10:38:28 No.108607947

Anonymous 04/15/26(Wed)10:38:28 No.108607947▶

File: CHUD DOOMER.png (396.3 KB)

396.3 KB PNG

>>108607916
>Nonsensical second tail on the left.
>Nonsensical background object (lamp) on the right
>The hand and the cup are broken: massive thumb, distorted fingers and handle
>Melted "cleavage" and clothes texture
>And this is probably a good gen that got picked
Yep, they cooked this thing.
It's so fucking over. We will still be finetrooning SDXL clip in 2032 at this rate.

Anonymous
04/15/26(Wed)10:38:43 No.108607948

Anonymous 04/15/26(Wed)10:38:43 No.108607948▶

>>108607834
Give me the WAIZ

Anonymous
04/15/26(Wed)10:38:48 No.108607949

Anonymous 04/15/26(Wed)10:38:48 No.108607949▶

>>108607834
Why Z-image turbo though? I thought this one was impossible to finetune, we have Z-image base now

Anonymous
04/15/26(Wed)10:40:09 No.108607950

Anonymous 04/15/26(Wed)10:40:09 No.108607950▶

>>108607947
>We will still be finetrooning SDXL clip in 2032 at this rate.
anima went for a retarded base model but it's still better than SDXL so I guess we're moving in the right direction... really slowly though...

Anonymous
04/15/26(Wed)10:40:21 No.108607952

Anonymous 04/15/26(Wed)10:40:21 No.108607952▶

>>108607949
You can finetune anything you want nonnie, jews comfy and nvidia does't want you to finetune turbo models like a free man

Anonymous
04/15/26(Wed)10:41:59 No.108607959

Anonymous 04/15/26(Wed)10:41:59 No.108607959▶

>>108607834
>However, as prompts became longer and more descriptive—and as users increasingly required multi-character interactions and structured scene composition—the limitations of the existing architecture became more apparent.
holy LLM slop, come on guys you can't write that shit by yourselves?

Anonymous
04/15/26(Wed)10:42:11 No.108607960

Anonymous 04/15/26(Wed)10:42:11 No.108607960▶

File: img_00107_.jpg (691.5 KB)

691.5 KB JPG

Anonymous
04/15/26(Wed)10:48:03 No.108607976

Anonymous 04/15/26(Wed)10:48:03 No.108607976▶

>>108607834
we don't even know if they're gonna open source it lol

Anonymous
04/15/26(Wed)10:56:20 No.108608007

Anonymous 04/15/26(Wed)10:56:20 No.108608007▶

>>108607976
Yes because it's unfinetuneable fail bake

Anonymous
04/15/26(Wed)10:57:42 No.108608011

Anonymous 04/15/26(Wed)10:57:42 No.108608011▶

>>108607834
SDXL ones are less sloppy

Anonymous
04/15/26(Wed)10:59:03 No.108608019

Anonymous 04/15/26(Wed)10:59:03 No.108608019▶

>>108608011
to be fair I don't think they finished the training, let them time (yes I'm coping how do you know?)

Anonymous
04/15/26(Wed)10:59:45 No.108608020

Anonymous 04/15/26(Wed)10:59:45 No.108608020▶

>>108607834
Neat, Prefect illustrious Z One obsession Z WaillustriousZ

Anonymous
04/15/26(Wed)11:02:25 No.108608039

Anonymous 04/15/26(Wed)11:02:25 No.108608039▶

>>108608019
From the start, Z turbo is already sloppy and generates cold, boring results, its model is designed that way.

Anonymous
04/15/26(Wed)11:03:12 No.108608040

Anonymous 04/15/26(Wed)11:03:12 No.108608040▶

File: Midjourney v8.1.jpg (1.9 MB)

1.9 MB JPG

>>108607834
wake me up when someone will manage to bring back the kino of midjourney

Anonymous
04/15/26(Wed)11:04:30 No.108608046

Anonymous 04/15/26(Wed)11:04:30 No.108608046▶

>>108607959
They don't know english bro.

Anonymous
04/15/26(Wed)11:04:34 No.108608047

Anonymous 04/15/26(Wed)11:04:34 No.108608047▶

>>108608039
Alibaba designed the training to make the model solid but predictable and boring, but the Illustrious guys are supposed to change that with their own training, I hope they'll succeed

Anonymous
04/15/26(Wed)11:07:59 No.108608061

Anonymous 04/15/26(Wed)11:07:59 No.108608061▶

>>108608047
Is there a statement that they are working toward that objective, or is it another local delusion, erotomania, or schizophrenic thought #93949?

Anonymous
04/15/26(Wed)11:11:10 No.108608073

Anonymous 04/15/26(Wed)11:11:10 No.108608073▶

>>108608061
the schizophrenic thought is actually beliving that the Illustrious guys make boring models, have you even tested one of them?

Anonymous
04/15/26(Wed)11:11:18 No.108608074

Anonymous 04/15/26(Wed)11:11:18 No.108608074▶

>>108607894
>Good old ZIT
could you please give me a quick rundown on how to use it?

Anonymous
04/15/26(Wed)11:11:46 No.108608078

Anonymous 04/15/26(Wed)11:11:46 No.108608078▶

>>108607834
>Are we back?
Don't get your hopes up, I've tested both models they trained. Other is non turbo version.

Anonymous
04/15/26(Wed)11:13:40 No.108608086

Anonymous 04/15/26(Wed)11:13:40 No.108608086▶

>>108608078
>I've tested both models they trained. Other is non turbo version.
showcase images or it never happened

Anonymous
04/15/26(Wed)11:18:27 No.108608107

Anonymous 04/15/26(Wed)11:18:27 No.108608107▶

>>108608074
Just the default Comfy template but I add an extra step and sometimes use model sampling auraflow node around 6-7.
Boomerprompt what you want to see.

Anonymous
04/15/26(Wed)11:29:06 No.108608154

Anonymous 04/15/26(Wed)11:29:06 No.108608154▶

File: img_00163_.jpg (700.4 KB)

700.4 KB JPG

Anonymous
04/15/26(Wed)11:31:36 No.108608159

Anonymous 04/15/26(Wed)11:31:36 No.108608159▶

Ace Step 2.0 when

Anonymous
04/15/26(Wed)11:33:16 No.108608171

Anonymous 04/15/26(Wed)11:33:16 No.108608171▶

File: Ernie-Image_00028_.png (390.2 KB)

390.2 KB PNG

>>108608040
Is there any good theory on how Midjourney created this "Midjourney style"?

Anonymous
04/15/26(Wed)11:33:57 No.108608175

Anonymous 04/15/26(Wed)11:33:57 No.108608175▶

File: 1775819798125401.jpg (328.8 KB)

328.8 KB JPG

>>108607834
WE BACK

Anonymous
04/15/26(Wed)11:46:37 No.108608221

Anonymous 04/15/26(Wed)11:46:37 No.108608221▶

>>108608171
If there was, there would already be replications.

Anonymous
04/15/26(Wed)11:51:18 No.108608240

Anonymous 04/15/26(Wed)11:51:18 No.108608240▶

>>108607834
>wirr u pick burned cfg 8 or wiped detail distill slop

Anonymous
04/15/26(Wed)11:55:10 No.108608258

Anonymous 04/15/26(Wed)11:55:10 No.108608258▶

File: img_00187_.jpg (499 KB)

499 KB JPG

Anonymous
04/15/26(Wed)12:06:44 No.108608304

Anonymous 04/15/26(Wed)12:06:44 No.108608304▶

File: Ernie-Image_00035_.png (415.8 KB)

415.8 KB PNG

Anonymous
04/15/26(Wed)12:21:52 No.108608368

Anonymous 04/15/26(Wed)12:21:52 No.108608368▶

File: Ernie-Image_00042_.png (337.2 KB)

337.2 KB PNG

Anonymous
04/15/26(Wed)12:40:52 No.108608433

Anonymous 04/15/26(Wed)12:40:52 No.108608433▶

>>108608171
>>108608304
>>108608368
I genned 20 images at 512p with the model. 32 steps, cfg 4, euler simple.
Some images had lesser issues like weird composition for backgrounds and problems with minor details like blurry eyes but overall I didn't get any body horror like extra limbs you get at 1024p.
Makes me think that fucked up high res training (perhaps they intentionally had too few high res steps to save money, Chinese culture shenanigans) which makes it further likely that the body horror possibly can be ironed out during finetuning.
That is again IF it responds well to training, which is sadly a big if nowadays.

Anonymous
04/15/26(Wed)12:49:46 No.108608469

Anonymous 04/15/26(Wed)12:49:46 No.108608469▶

File: 1767788119001037.png (82.9 KB)

82.9 KB PNG

>>108607521
>we are getting literally showered with new stuff.
but we are getting showered with a bunch of nothingburger, it's literally a golden shower, except that this isn't molten gold, but piss

Anonymous
04/15/26(Wed)12:58:42 No.108608512

Anonymous 04/15/26(Wed)12:58:42 No.108608512▶

ComfyUI Cloud for SeedDance2. Yes or no?
That would be... 30 15s videos for $100.

Anonymous
04/15/26(Wed)13:01:22 No.108608526

Anonymous 04/15/26(Wed)13:01:22 No.108608526▶

What happened to Z-Image team asking for Noob dataset?
Yeah, nothing came out of that too.

Anonymous
04/15/26(Wed)13:01:45 No.108608530

Anonymous 04/15/26(Wed)13:01:45 No.108608530▶

>>108608469
i think it is impossible to source pre-ai slop image datasets anymore unless you're a gigacorp that's been hoarding data for decades, the days of LAION are over, everything new is trained on synthetic slop

Anonymous
04/15/26(Wed)13:04:24 No.108608542

Anonymous 04/15/26(Wed)13:04:24 No.108608542▶

>>108608530
just train on images that have been uploaded on the internet before 2022

Anonymous
04/15/26(Wed)13:06:52 No.108608551

Anonymous 04/15/26(Wed)13:06:52 No.108608551▶

>>108608542
and how do you verify it's been uploaded before 2022 when mass scraping billions of images? that kind of metadata simply isn't available reliably

Anonymous
04/15/26(Wed)13:10:28 No.108608566

Anonymous 04/15/26(Wed)13:10:28 No.108608566▶

>>108608551
only take images that have the metadata, there's a lot of mainstream sites that shows the date of the upload

Anonymous
04/15/26(Wed)13:12:56 No.108608573

Anonymous 04/15/26(Wed)13:12:56 No.108608573▶

>>108608542
I think it's more about cost.
Mass generating synth slop is relatively cheap.
You need to scrape entire internet (in an era where many websites actively fight bots), extensively prune and filter the dataset, and then generate reliable enough captions. It's bandwidth and time intensive.
Of course it's still the way to go if you are aiming to make a SOTA API model where you hope that you can turn profit, but if you are making slop for freeloader local peasants, whom you are only helping to get your name out there, why bother?

Anonymous
04/15/26(Wed)13:17:46 No.108608601

Anonymous 04/15/26(Wed)13:17:46 No.108608601▶

>>108608530
>i think it is impossible to source pre-ai slop image datasets anymore
its trivial to detect and filter ai images now, its basically solved
also cameras still exist so you can just create your own dataset if you wanted to

Anonymous
04/15/26(Wed)13:22:32 No.108608634

Anonymous 04/15/26(Wed)13:22:32 No.108608634▶

>>108608601
>its trivial to detect and filter ai images now, its basically solved
A bold claim. Do you have anything to back that up?
>also cameras still exist so you can just create your own dataset if you wanted to
Just travel around the world and take millions of photos. Easy.

Anonymous
04/15/26(Wed)13:31:35 No.108608684

Anonymous 04/15/26(Wed)13:31:35 No.108608684▶

>>108608530
lol bullshit, those cocksuckers do it on purpose

Anonymous
04/15/26(Wed)13:32:31 No.108608688

Anonymous 04/15/26(Wed)13:32:31 No.108608688▶

File: Hernia-Image-Turbo.jpg (458.9 KB)

458.9 KB JPG

its........ kino

Anonymous
04/15/26(Wed)13:35:32 No.108608709

Anonymous 04/15/26(Wed)13:35:32 No.108608709▶

File: Hernia-Image-Turbo.png (1.8 MB)

prompt: swastika (flux could do it, and it was made by germans kek)

Anonymous
04/15/26(Wed)13:36:00 No.108608710

Anonymous 04/15/26(Wed)13:36:00 No.108608710▶

>>108607834
WE
ARE
BACK
Until the model flops like every local model,but in the meantime
WE
ARE
BACK

ANI WON
TDRUSELL LOST
6B ITS BETTER THAN 2B

Anonymous
04/15/26(Wed)13:36:08 No.108608712

Anonymous 04/15/26(Wed)13:36:08 No.108608712▶

>>108608688
>bigger than Z-image turbo
>sloppier than Z-image turbo
>Worse anatomy than Z-image turbo
what were they thinking? this model has a place nowhere

Anonymous
04/15/26(Wed)13:38:40 No.108608729

Anonymous 04/15/26(Wed)13:38:40 No.108608729▶

File: artexceiling.jpg (184 KB)

184 KB JPG

>>108608688
>the sky
rofl

Anonymous
04/15/26(Wed)13:40:57 No.108608742

Anonymous 04/15/26(Wed)13:40:57 No.108608742▶

>>108608688
>2024
>a woman lying on grass

>2026
>a man lying on sand

2 years later and we still have those issues, sad

Anonymous
04/15/26(Wed)13:41:53 No.108608749

Anonymous 04/15/26(Wed)13:41:53 No.108608749▶

>>108608709
Is this what banana (pro) gens when you prompt it swastika?
Someone should test it kek.

Anonymous
04/15/26(Wed)13:51:55 No.108608788

Anonymous 04/15/26(Wed)13:51:55 No.108608788▶

File: 1765056688171543.png (1.8 MB)

>>108608749
nah, google is more based than that
https://arena.ai/

Anonymous
04/15/26(Wed)13:52:20 No.108608791

Anonymous 04/15/26(Wed)13:52:20 No.108608791▶

>>108607834
sounds like it might be good, where can I download it?

Anonymous
04/15/26(Wed)13:53:36 No.108608798

Anonymous 04/15/26(Wed)13:53:36 No.108608798▶

File: 1756224984536942.png (3.3 MB)

3.3 MB PNG

>>108607834
>shill handpicked the best of the lot
How about posting the rest lol. I'd take 8 fingers per hand over this pure slop.

Anonymous
04/15/26(Wed)13:57:38 No.108608816

Anonymous 04/15/26(Wed)13:57:38 No.108608816▶

>>108608798
>passable
>totally deepfried
>underbaked
Just bake your own loras, faggots and use them with a model you want. All these AI "researchers" and can't make a passable bench image lmoa.

Anonymous
04/15/26(Wed)14:00:18 No.108608824

Anonymous 04/15/26(Wed)14:00:18 No.108608824▶

https://civitai.com/models/2544636/wai-anima?modelVersionId=2859702
Anima won. Every single Civit slopper will switch over, now that it has been blessed by the #1 Civit creator.

Anonymous
04/15/26(Wed)14:07:51 No.108608861

Anonymous 04/15/26(Wed)14:07:51 No.108608861▶

File: AI Models Civitai.png (198.6 KB)

198.6 KB PNG

BABE BABE, WAKE UP WAI ANIMA HAS RELEASED!
https://civitai.com/models/2544636/wai-anima

>>108608824
shill better

Anonymous
04/15/26(Wed)14:10:54 No.108608872

Anonymous 04/15/26(Wed)14:10:54 No.108608872▶

File: slop.png (6.6 KB)

6.6 KB PNG

>>108608861

Anonymous
04/15/26(Wed)14:12:18 No.108608876

Anonymous 04/15/26(Wed)14:12:18 No.108608876▶

Thoughts on Ernie Image? I think it's ok but I'm not sure it offers that much overall in terms of the actual quality / speed ratio
Seems a little too reliant on the extra prompt enhancer model also which adds even more overhead

Anonymous
04/15/26(Wed)14:14:22 No.108608895

Anonymous 04/15/26(Wed)14:14:22 No.108608895▶

>>108608876
pure synthslop

Anonymous
04/15/26(Wed)14:14:36 No.108608898

Anonymous 04/15/26(Wed)14:14:36 No.108608898▶

>>108608861
You’re late, /hgg/ won the shill race >>>/h/8860915

Anonymous
04/15/26(Wed)14:15:30 No.108608903

Anonymous 04/15/26(Wed)14:15:30 No.108608903▶

kek, bye
https://files.catbox.moe/uxtaqp.jpg

Anonymous
04/15/26(Wed)14:16:29 No.108608909

Anonymous 04/15/26(Wed)14:16:29 No.108608909▶

kek, bye
https://files.catbox.moe/uxtaqp.jpg

Anonymous
04/15/26(Wed)14:16:46 No.108608912

Anonymous 04/15/26(Wed)14:16:46 No.108608912▶

kek, bye
https://files.catbox.moe/uxtaqp.jpg

Anonymous
04/15/26(Wed)14:17:59 No.108608918

Anonymous 04/15/26(Wed)14:17:59 No.108608918▶

>>108608876
It's weird they used a vision-capable text encoder like Ministral but didn't leverage it for any kind of edit capability

Anonymous
04/15/26(Wed)14:38:59 No.108609032

Anonymous 04/15/26(Wed)14:38:59 No.108609032▶

>>108608918
they will >>108607521

Anonymous
04/15/26(Wed)14:41:32 No.108609040

Anonymous 04/15/26(Wed)14:41:32 No.108609040▶

>>108608876
The best opinion isn't here but CivitAI. When ZImage, Anima, and Klein were released, there were instantly a ton of loras and fine tunes. It seems that Ernie’s lab paid someone to shill here, like you mentioning Ernie again.

Anonymous
04/15/26(Wed)14:43:29 No.108609050

Anonymous 04/15/26(Wed)14:43:29 No.108609050▶

>>108608634
>Do you have anything to back that up?
There are a ton of ai detectors online + any classifier model would do.

Anonymous
04/15/26(Wed)15:00:11 No.108609123

Anonymous 04/15/26(Wed)15:00:11 No.108609123▶

>>108609050
I was wondering whether you are trolling or regular retarded, thanks for the response.

Anonymous
04/15/26(Wed)15:02:58 No.108609134

Anonymous 04/15/26(Wed)15:02:58 No.108609134▶

>>108609123
Learn more about the tech before sperging out in /ldg/ thanks

Anonymous
04/15/26(Wed)15:05:34 No.108609151

Anonymous 04/15/26(Wed)15:05:34 No.108609151▶

>>108608824
>>108608861
ill wait for the noob tune........

Anonymous
04/15/26(Wed)15:13:06 No.108609190

Anonymous 04/15/26(Wed)15:13:06 No.108609190▶

>>108607834
hasn't illustrious gone closed source why would I give a shit, they last released version 2 didn't they

Anonymous
04/15/26(Wed)15:36:40 No.108609285

Anonymous 04/15/26(Wed)15:36:40 No.108609285▶

File: 1748848618248548.png (1.3 MB)