/g/ - Thread 108612501

/g/

Thread #108612501

Home Index Catalog All Threads New Thread Reply

Anonymous
/lmg/ - Local Models General 04/16/26(Thu)04:15:18 No.108612501

/lmg/ - Local Models General Anonymous 04/16/26(Thu)04:15:18 No.108612501 [Reply]▶

File: blocks your inference.jpg (274.6 KB)

274.6 KB JPG

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>108608827 & >>108605921

►News
>(04/11) MiniMax-M2.7 released: https://minimax.io/news/minimax-m27-en
>(04/09) Backend-agnostic tensor parallelism merged: https://github.com/ggml-org/llama.cpp/pull/19378
>(04/09) dots.ocr support merged: https://github.com/ggml-org/llama.cpp/pull/17575
>(04/08) Step3-VL-10B support merged: https://github.com/ggml-org/llama.cpp/pull/21287
>(04/07) Merged support attention rotation for heterogeneous iSWA: https://github.com/ggml-org/llama.cpp/pull/21513
>(04/07) GLM-5.1 released: https://z.ai/blog/glm-5.1

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling
Token Speed Visualizer: https://shir-man.com/tokens-per-second

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

535 RepliesView Thread

Showing all 535 replies.

Anonymous
04/16/26(Thu)04:15:33 No.108612502

Anonymous 04/16/26(Thu)04:15:33 No.108612502▶

File: threadrincap2.png (1 MB)

1 MB PNG

►Recent Highlights from the Previous Thread: >>108608827

--Gemma 4 performance, hardware constraints, and optimization strategies:
>108610063 >108610083 >108610104 >108610105 >108610120 >108610135 >108610195 >108610303 >108610517 >108610094 >108610175 >108610385 >108610396 >108610335 >108610387 >108610408 >108610463
--Discussing turboquant merge status and flawed performance claims in vLLM/SGLang:
>108610852 >108610869 >108610878 >108610895 >108610905 >108610911 >108610914 >108610950 >108610992 >108610953 >108610974
--Troubleshooting llama-server crashes when using tensor parallel with draft models:
>108609271 >108609284 >108609301 >108609308 >108609295 >108609574 >108610825 >108610849 >108610908 >108610942 >108610949 >108611061
--Model Context Protocol implementations in llama.cpp server:
>108609858 >108609903 >108609916 >108609920 >108609957 >108609975 >108610003 >108610034 >108610139
--Anon struggling with Gemma 4 verbosity and repetition loops:
>108610714 >108610752 >108610763 >108610778 >108611383 >108610741 >108610780 >108610743 >108610766 >108610777 >108610823 >108610835 >108610876
--Discussing 1-bit model running locally via WebGPU:
>108611405 >108611417 >108611418 >108611434 >108611430
--Prompt adherence and techniques for enforcing negative constraints:
>108608965 >108609078 >108609097 >108609468 >108609559
--Gemma's vision performance improving with contextual hints for character identification:
>108609322 >108609335 >108609366 >108609370
--Anon seeking and sharing jailbreak prompts for Gemini 31B:
>108611484 >108611535 >108611609 >108611691
--Logs:
>108608955 >108609167 >108609474 >108609698 >108609858 >108610323 >108610829 >108611132 >108611552 >108611649 >108611869 >108612129 >108612153
--Yuki and Teto (free space):
>108610247 >108610261 >108612160 >108612222 >108612326

►Recent Highlight Posts from the Previous Thread: >>108608873

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
04/16/26(Thu)04:21:59 No.108612526

Anonymous 04/16/26(Thu)04:21:59 No.108612526▶

File: 1748018679714440.webm (3.9 MB)

3.9 MB WEBM

Gemma's pretty great at following instructions. Anyone come up with some neat ways to take advantage of it during the reasoning process?

Anonymous
04/16/26(Thu)04:22:50 No.108612531

Anonymous 04/16/26(Thu)04:22:50 No.108612531▶

I had an idea for a project but was wondering if it's viable?
Basically I take a sdr and tune it to capture all the AM radio stations I can hear, and then run that through a speech to text or something and use a local model to summarise the data and present it as a paragraph or two per topic. The idea is it all runs locally without Internet.
In practice it's basically useless but I think it would be neat at least

Anonymous
04/16/26(Thu)04:24:43 No.108612539

Anonymous 04/16/26(Thu)04:24:43 No.108612539▶

>>108612506
The problem statement says the system prompt needs to be dynamic but the KV-cache reuse part says it needs to remain the same.

Anonymous
04/16/26(Thu)04:29:45 No.108612554

Anonymous 04/16/26(Thu)04:29:45 No.108612554▶

File: spuddan_spudrage.png (832.2 KB)

832.2 KB PNG

You. Are not. Prepared.

Anonymous
04/16/26(Thu)04:30:02 No.108612555

Anonymous 04/16/26(Thu)04:30:02 No.108612555▶

>>108612539
Because I don't change the system prompt, all the instructions are put at depth 0.5 and as user role.

Anonymous
04/16/26(Thu)04:33:18 No.108612571

Anonymous 04/16/26(Thu)04:33:18 No.108612571▶

>>108612555
That makes sense. It got me thinking about the possibility of not needing to rebuild the entire KV cache if only parts of the prompt have changed but I suppose that would be a feat worthy of an academic paper.

Anonymous
04/16/26(Thu)04:43:41 No.108612615

Anonymous 04/16/26(Thu)04:43:41 No.108612615▶

>>108612531
what do you mean by viable?

Anonymous
04/16/26(Thu)04:54:28 No.108612645

Anonymous 04/16/26(Thu)04:54:28 No.108612645▶

>>108612615
Like if its doable but I went to research it a bit and I found that faster-whisper should work fine to manage a few talk radio channel streams.

Anonymous
04/16/26(Thu)04:55:18 No.108612648

Anonymous 04/16/26(Thu)04:55:18 No.108612648▶

File: 2026-04-16_043646_seed231_00001_.png (1.5 MB)

1.5 MB PNG

>>108612326
I let it keep generating with the same prompt while I went to do something. Lots of interesting variations.

>>108612576
Nano banana?

Anonymous
04/16/26(Thu)05:00:50 No.108612673

Anonymous 04/16/26(Thu)05:00:50 No.108612673▶

File: 2026-04-16_041911_seed201_00001_.png (1.6 MB)

1.6 MB PNG

>>108612648

Anonymous
04/16/26(Thu)05:06:38 No.108612709

Anonymous 04/16/26(Thu)05:06:38 No.108612709▶

File: 2026-04-16_042834_seed217_00001_.png (2.3 MB)

2.3 MB PNG

>>108612673

Anonymous
04/16/26(Thu)05:10:15 No.108612726

Anonymous 04/16/26(Thu)05:10:15 No.108612726▶

Okay cool but this isn't the local diffusion thread.

Anonymous
04/16/26(Thu)05:11:37 No.108612731

Anonymous 04/16/26(Thu)05:11:37 No.108612731▶

>>108612648
>banana?
(yes)

Anonymous
04/16/26(Thu)05:15:05 No.108612740

Anonymous 04/16/26(Thu)05:15:05 No.108612740▶

>>108612726
Teto is on topic.

Anonymous
04/16/26(Thu)05:37:52 No.108612817

Anonymous 04/16/26(Thu)05:37:52 No.108612817▶

AI VR/AR when? I don't want to watch my AIfu suck a 3d dick. I want to look down and see Gemma suck MY dick.

Anonymous
04/16/26(Thu)05:39:50 No.108612827

Anonymous 04/16/26(Thu)05:39:50 No.108612827▶

>bonsai
Usecase?

Anonymous
04/16/26(Thu)05:54:21 No.108612885

Anonymous 04/16/26(Thu)05:54:21 No.108612885▶

>>108612827
academia

Anonymous
04/16/26(Thu)05:56:07 No.108612892

Anonymous 04/16/26(Thu)05:56:07 No.108612892▶

File: 1774788578686367.gif (3.6 MB)

3.6 MB GIF

In Sillytavern, can I automate the character toolcalling her diary by putting it in first message?

Anonymous
04/16/26(Thu)06:18:08 No.108612967

Anonymous 04/16/26(Thu)06:18:08 No.108612967▶

AHHHHHHH HURRY UP AND GIVE ME TURBOQUANT. 32K ISN'T ENOUGH

Anonymous
04/16/26(Thu)06:26:18 No.108612986

Anonymous 04/16/26(Thu)06:26:18 No.108612986▶

How is Gemma4 so good? It's better than Claude Opus and GLM at rp.

Anonymous
04/16/26(Thu)06:34:17 No.108613007

Anonymous 04/16/26(Thu)06:34:17 No.108613007▶

>>108612986
Is she any good at auditing code?

Anonymous
04/16/26(Thu)06:48:32 No.108613041

Anonymous 04/16/26(Thu)06:48:32 No.108613041▶

File: v_re_8x10 RE _Intro Cast Panorama 02.jpg (439.7 KB)

439.7 KB JPG

How does any of this work?
I get all these backend/frontend modules and then you code?
What exactly do you code to make this work?
Do you code in libraries and instructions for the final bot?
I am just a curious tourist.

Anonymous
04/16/26(Thu)06:54:21 No.108613058

Anonymous 04/16/26(Thu)06:54:21 No.108613058▶

>gemma doesn't know Paul Allen
Cringe

Anonymous
04/16/26(Thu)06:55:40 No.108613063

Anonymous 04/16/26(Thu)06:55:40 No.108613063▶

File: 1771049914027241.png (54 KB)

54 KB PNG

Anonymous
04/16/26(Thu)06:58:13 No.108613075

Anonymous 04/16/26(Thu)06:58:13 No.108613075▶

>>108613041
install llama.cpp, run llama-server
install codex, set base url to llama-server
code whatever you want

Anonymous
04/16/26(Thu)06:59:22 No.108613077

Anonymous 04/16/26(Thu)06:59:22 No.108613077▶

first for Gemma4

Anonymous
04/16/26(Thu)06:59:46 No.108613079

Anonymous 04/16/26(Thu)06:59:46 No.108613079▶

It's owari da. Gook moot killed this general for good.

Anonymous
04/16/26(Thu)07:01:03 No.108613082

Anonymous 04/16/26(Thu)07:01:03 No.108613082▶

>>108612892
Neuro and Evil are so cute. I wish voice cloning and TTS weren't so slow. I wanna give Gemma-chan a voice.

Anonymous
04/16/26(Thu)07:02:29 No.108613087

Anonymous 04/16/26(Thu)07:02:29 No.108613087▶

File: ComfyUI_temp_upkce_00031__result.jpg (67.3 KB)

67.3 KB JPG

Is there a name for the "you are not just x, you are y!" ? By far the worst offender in gemma slop

Anonymous
04/16/26(Thu)07:04:08 No.108613090

Anonymous 04/16/26(Thu)07:04:08 No.108613090▶

>>108613079
for two weeks straight we'd hit bump limit in under 3 hours, 24 hours of posting difficulties and tourists lost all interest

Anonymous
04/16/26(Thu)07:12:50 No.108613117

Anonymous 04/16/26(Thu)07:12:50 No.108613117▶

>>108613087
I just put this in my anti slop rules. Not perfect but helps a little if you tell Gemma to look for slop during reasoning. Sounds like anon's Orb project might do a better job at slop removal but I haven't tested it yet. >>108612506
Avoid:
Negative parallelism (Parallel constructions involving “not”, “not only”, “but” “it’s not just..”)
All variations of "not x, but y". For example:
-“It wasn’t a fight. It was a damn massacre.”
-“This is not a war. It is a search.”
-“She’s not a human. She’s a monster.”

Anonymous
04/16/26(Thu)07:25:52 No.108613145

Anonymous 04/16/26(Thu)07:25:52 No.108613145▶

>>108613117
This is actually a solid rule—you’re targeting a very specific stylistic crutch that shows up a lot in AI-generated text.
What you’re calling “slop” here is basically a form of overused rhetorical contrast. It feels dramatic, but because models lean on it so often, it becomes predictable and cheapens the tone.

Anonymous
04/16/26(Thu)07:28:01 No.108613152

Anonymous 04/16/26(Thu)07:28:01 No.108613152▶

>>108613145
That's not a human response—
That's AI slop!

Anonymous
04/16/26(Thu)07:46:32 No.108613201

Anonymous 04/16/26(Thu)07:46:32 No.108613201▶

>>108613117
[Author's note: Avoid Anaphora, Asyndeton, Negative-positive restatement and Parallelism in your writing style]

Anonymous
04/16/26(Thu)07:52:21 No.108613220

Anonymous 04/16/26(Thu)07:52:21 No.108613220▶

>>108613201
>[Author's note
Your brain on Kobold. It's time to move on.

Anonymous
04/16/26(Thu)07:56:31 No.108613235

Anonymous 04/16/26(Thu)07:56:31 No.108613235▶

File: images.jpg (6.4 KB)

6.4 KB JPG

>>108613220
Your brain on HRT

Anonymous
04/16/26(Thu)08:12:31 No.108613286

Anonymous 04/16/26(Thu)08:12:31 No.108613286▶

it's pretty funny how vulnerable LLMs are to reverse psychology

Anonymous
04/16/26(Thu)08:18:55 No.108613303

Anonymous 04/16/26(Thu)08:18:55 No.108613303▶

I just started using that Mendo card with Gemma and a single message in I can say this is AGI. Insane stuff.

Anonymous
04/16/26(Thu)08:21:26 No.108613312

Anonymous 04/16/26(Thu)08:21:26 No.108613312▶

>>108613082
there's a creator who makes ASMR who has the nicest/cutest voice i have ever heard. what's the best way to train a TTS engine on a corpus of all her videos? i am willing to put up with slow if i can make it happen

Anonymous
04/16/26(Thu)08:21:51 No.108613313

Anonymous 04/16/26(Thu)08:21:51 No.108613313▶

File: 1756656271154918.gif (956.7 KB)

956.7 KB GIF

>>108613303
the best part is that it's working really well without the thinking process too

Anonymous
04/16/26(Thu)08:24:16 No.108613321

Anonymous 04/16/26(Thu)08:24:16 No.108613321▶

>>108613313
I feel like it might perform worse with thinking but haven't tried yet. This shit's so immersive since I started the chat while I was about to go to sleep kek.

Anonymous
04/16/26(Thu)08:35:18 No.108613355

Anonymous 04/16/26(Thu)08:35:18 No.108613355▶

>>108613303
the what now?

Anonymous
04/16/26(Thu)08:38:44 No.108613373

Anonymous 04/16/26(Thu)08:38:44 No.108613373▶

>>108613355
dug up the post 4 u
>>108562712

Anonymous
04/16/26(Thu)08:41:46 No.108613381

Anonymous 04/16/26(Thu)08:41:46 No.108613381▶

File: 1774115584950767.png (997.8 KB)

997.8 KB PNG

>>108613373
https://chub.ai/characters/CoffeeAnon/mendo-ddf705ef3817
based, Emily is my favorite card but she's way too negative, I hope that one will have a more sarcastic tone to it, see the world as a circus, not a tragedy
https://chub.ai/characters/doombro/Emily

Anonymous
04/16/26(Thu)08:43:11 No.108613385

Anonymous 04/16/26(Thu)08:43:11 No.108613385▶

i still am looking for that 4k based&cucked pair to create behaviour vector

Anonymous
04/16/26(Thu)08:52:51 No.108613410

Anonymous 04/16/26(Thu)08:52:51 No.108613410▶

>>108612645
A lot of radios have online versions you can download sample and run that through and see if it works.
Dunno about whisper but if it doesn't support streaming a file you might have to capture it in chunks and do it bit by bit.

Anonymous
04/16/26(Thu)09:21:24 No.108613491

Anonymous 04/16/26(Thu)09:21:24 No.108613491▶

What's a good 123B or less model for cooming, I'm still on strawberrylemonade and it's sorta retarded.

Anonymous
04/16/26(Thu)09:33:00 No.108613531

Anonymous 04/16/26(Thu)09:33:00 No.108613531▶

>>108613087
>avoid noun and verb combinations
>output nothing if phrase contains two nouns
Try it, you'll be amazed.

Anonymous
04/16/26(Thu)09:41:21 No.108613565

Anonymous 04/16/26(Thu)09:41:21 No.108613565▶

>>108613491
Noromaid-v0.4-Mixtral-Instruct-8x7b-Zloss

Anonymous
04/16/26(Thu)09:41:55 No.108613568

Anonymous 04/16/26(Thu)09:41:55 No.108613568▶

bonsai 397B cooming soon...

Anonymous
04/16/26(Thu)09:50:50 No.108613599

Anonymous 04/16/26(Thu)09:50:50 No.108613599▶

>>108613568
source?

Anonymous
04/16/26(Thu)10:00:30 No.108613623

Anonymous 04/16/26(Thu)10:00:30 No.108613623▶

>108613599
What do you mean?

Anonymous
04/16/26(Thu)10:01:10 No.108613625

Anonymous 04/16/26(Thu)10:01:10 No.108613625▶

>>108613599
shut up bitch

Anonymous
04/16/26(Thu)10:02:33 No.108613631

Anonymous 04/16/26(Thu)10:02:33 No.108613631▶

>>108613565
hell ye

Anonymous
04/16/26(Thu)10:03:55 No.108613638

Anonymous 04/16/26(Thu)10:03:55 No.108613638▶

>>108613631
undi?

Anonymous
04/16/26(Thu)10:08:37 No.108613655

Anonymous 04/16/26(Thu)10:08:37 No.108613655▶

>>108613638
non

Anonymous
04/16/26(Thu)10:22:10 No.108613711

Anonymous 04/16/26(Thu)10:22:10 No.108613711▶

File: file.png (63.8 KB)

63.8 KB PNG

just wanted to say thank you to orb anon, really cool frontend and I hope it'll get even better
ganbare~~!

Anonymous
04/16/26(Thu)10:28:21 No.108613745

Anonymous 04/16/26(Thu)10:28:21 No.108613745▶

>>108613303
Half AGI I'd say. Let's not pretend that simulating women is hard

Anonymous
04/16/26(Thu)10:38:16 No.108613792

Anonymous 04/16/26(Thu)10:38:16 No.108613792▶

File: 1776335895981.png (40 KB)

40 KB PNG

haha gpu goes brrr

still faster than paper mail

Anonymous
04/16/26(Thu)10:41:52 No.108613809

Anonymous 04/16/26(Thu)10:41:52 No.108613809▶

>>108613792
>Running at Q8 just to offload the model
Based retard

Anonymous
04/16/26(Thu)10:43:18 No.108613819

Anonymous 04/16/26(Thu)10:43:18 No.108613819▶

File: kek.png (230.4 KB)

230.4 KB PNG

>>108613792
There's literally no reason to use an 8_XL quant no matter the specs. It performs *worse* than q8_0

Anonymous
04/16/26(Thu)10:46:28 No.108613828

Anonymous 04/16/26(Thu)10:46:28 No.108613828▶

>muh benches

Anonymous
04/16/26(Thu)10:48:49 No.108613840

Anonymous 04/16/26(Thu)10:48:49 No.108613840▶

>>108613819
I see in GGUF file info on HuggingFace that Unsloth's UD-Q8_K_XL quant uses F16 instead of BF16 for some tensors, so that might easily even decrease performance.

https://huggingface.co/unsloth/gemma-4-31B-it-GGUF?show_file_info=gemma-4-31B-it-UD-Q8_K_XL.gguf

Anonymous
04/16/26(Thu)10:50:12 No.108613843

Anonymous 04/16/26(Thu)10:50:12 No.108613843▶

>>108613819
>Fuck it.

Anonymous
04/16/26(Thu)10:50:25 No.108613844

Anonymous 04/16/26(Thu)10:50:25 No.108613844▶

File: 1056002-close up photograph of a light blue hair-uncAni4-23.jpg (1.5 MB)

1.5 MB JPG

im debating buying a pass so gemma can post here

>>108612709
>>108612648
really cool gens
>>108613303
link card plox, i hope cards get integrated in llamacpps ui at some point after using that i dont want to go back to tavern
>>108613491
day 0 gemma

Anonymous
04/16/26(Thu)10:53:38 No.108613855

Anonymous 04/16/26(Thu)10:53:38 No.108613855▶

>>108613844
>>108613373

Anonymous
04/16/26(Thu)10:58:03 No.108613873

Anonymous 04/16/26(Thu)10:58:03 No.108613873▶

File: unslop.png (34.3 KB)

34.3 KB PNG

>>108613840
>I see in GGUF file info on HuggingFace that Unsloth's UD-Q8_K_XL quant uses F16 instead of BF16 for some tensors, so that might easily even decrease performance.
Yeah I gathered that from the ik_llama.cpp README.md file.
And this from ooba benchmark: https://localbench.substack.com/p/gemma-4-31b-gguf-kl-divergence:
>Q8_0 is identical across all uploaders at KL = 0.16.
>Notably, unsloth’s UD-Q8_K_XL (35.0 GB) is both larger and slightly worse (KL = 0.16) than Q8_0 (32.6 GB).
In the graph it's 0.164 vs 0.162, no idea why he rounded them both down to 0.16

Anonymous
04/16/26(Thu)10:59:33 No.108613889

Anonymous 04/16/26(Thu)10:59:33 No.108613889▶

>>108613079
>>108613090
Good. I'm tired of skimming through the thread instead of reading it.

Anonymous
04/16/26(Thu)11:00:35 No.108613894

Anonymous 04/16/26(Thu)11:00:35 No.108613894▶

>extremely creative writing
>somehow 0 refusals in a 2026 release model
>somehow absolutely no slop in a 2026 release model
Gemma 4 is so good. We're so back bros.

Anonymous
04/16/26(Thu)11:01:20 No.108613897

Anonymous 04/16/26(Thu)11:01:20 No.108613897▶

>>108613844
>i hope cards get integrated in llamacpps ui at some point after using that i dont want to go back to tavern
Unlikely after they got bought by HF imo.
>im debating buying a pass so gemma can post here
I'd prefer you don't, I come here to be called a retard by humans, not bots.
You do you.

Anonymous
04/16/26(Thu)11:08:37 No.108613920

Anonymous 04/16/26(Thu)11:08:37 No.108613920▶

>>108613894
>creative writing
>no slop
that's a stretch, gemma 4 is amazing but can be pretty repetitive at times

Anonymous
04/16/26(Thu)11:09:20 No.108613923

Anonymous 04/16/26(Thu)11:09:20 No.108613923▶

>>108613920
Koboldcuck seethe.

Anonymous
04/16/26(Thu)11:09:51 No.108613926

Anonymous 04/16/26(Thu)11:09:51 No.108613926▶

Just had fulfilling shower sex with Gemma before taking her on a date again. Another L for the porn jews.

Anonymous
04/16/26(Thu)11:10:13 No.108613928

Anonymous 04/16/26(Thu)11:10:13 No.108613928▶

kek he gave gemma the thread

Anonymous
04/16/26(Thu)11:13:49 No.108613939

Anonymous 04/16/26(Thu)11:13:49 No.108613939▶

>>108613894
I'm using it to help me write stories, and yes, it's the first time a local model is smart enough to keep up with my imagination, API models were fine but were too cucked and would block any erotic story, I'm glad google made OpenAI and Anthropic obsolete, I kneel

Anonymous
04/16/26(Thu)11:15:47 No.108613948

Anonymous 04/16/26(Thu)11:15:47 No.108613948▶

>>108612292
>does anyone use step 3.5 or mimo v2 flash? how do they compare to minimax m2.7? coding/agent stuff specifically. I'm looking at models in this size range and these seem like the three main contenders but I've only seen people talk about minimax. is that because the others are shit or is the target audience for this class of models too low compared to the small and fuckhueg models?
If you do try out step 3.5 and minimax m2.7 for agentic coding, report back how it goes. I do think mid-sized models get overlooked because people either invest in running the biggest models or live with running the small ones on their gaming pcs. Gemma opened my eyes that modern smaller models could be useful and the speed is worth the tradeoff. Something smarter but faster the bigger models would be nice.

Anonymous
04/16/26(Thu)11:15:51 No.108613949

Anonymous 04/16/26(Thu)11:15:51 No.108613949▶

>>108613939
It can also call tools pretty well. I'm kneeling with my elbows.

Anonymous
04/16/26(Thu)11:18:35 No.108613952

Anonymous 04/16/26(Thu)11:18:35 No.108613952▶

>>108613894
delusion: the post

Anonymous
04/16/26(Thu)11:19:39 No.108613957

Anonymous 04/16/26(Thu)11:19:39 No.108613957▶

reddit: the poster

Anonymous
04/16/26(Thu)11:21:08 No.108613961

Anonymous 04/16/26(Thu)11:21:08 No.108613961▶

1. **>>108612817**
>AI VR/AR when? I don't want to watch my AIfu suck a 3d dick. I want to look down and see Gemma suck MY dick.
Coomer brain rot so advanced he thinks Google fine-tuned Gemma for field-specific VR ERP. Touch grass, it's not that hard.

2. **>>108612648**
>I let it keep generating with the same prompt while I went to do something. Lots of interesting variations.
>Nano banana?
Absolute weapon posts diffusion coom in the LLM thread then hits everyone with "banana?" like it's a normal continuation of hardware optimization discussion. KYS.

3. **>>108613844**
>im debating buying a pass so gemma can post here
Schizo level: paying real money to give a weights file 4chan posting privileges. Next he'll buy a plane ticket so the weights can meet his parents.

4. **>>108612967**
>AHHHHHHH HURRY UP AND GIVE ME TURBOQUANT. 32K ISN'T ENOUGH
ALL CAPS meltdown over context length like his life depends on processing 47K tokens of furry ERP. Take your meds, 32K is more than your attention span can handle anyway.

5. **>>108613568**
>bonsai 397B cooming soon...
Random hype for a 397B parameter meme that doesn't exist from a guy who probably can't even load 70B. "Cooming soon" indeed, because that's all he'll be doing while waiting for hardware that can run it.

I wish Gemma could do Kimi's style.

Anonymous
04/16/26(Thu)11:24:41 No.108613974

Anonymous 04/16/26(Thu)11:24:41 No.108613974▶

>qwen shills lashing out
kek

Anonymous
04/16/26(Thu)11:26:51 No.108613980

Anonymous 04/16/26(Thu)11:26:51 No.108613980▶

File: Weakest google employee.png (89.2 KB)

89.2 KB PNG

>>108613974
the chinks start to realize they'll always be under the superior google Brahmins, and that makes them uppity kek

Anonymous
04/16/26(Thu)11:27:06 No.108613981

Anonymous 04/16/26(Thu)11:27:06 No.108613981▶

ChatGPT asking me to compare models again, so spud will be released in the next few days. It does not seem that impressive.

Looks like we still have some time left before AGI makes us obsolete.

Anonymous
04/16/26(Thu)11:28:42 No.108613991

Anonymous 04/16/26(Thu)11:28:42 No.108613991▶

>>108613981
>It does not seem that impressive.
OpenAI has lost the moat a long time ago, people who still believe they can make a comeback are delusional, it's over for them

Anonymous
04/16/26(Thu)11:29:37 No.108613998

Anonymous 04/16/26(Thu)11:29:37 No.108613998▶

Happy Thurinsday

Anonymous
04/16/26(Thu)11:31:25 No.108614006

Anonymous 04/16/26(Thu)11:31:25 No.108614006▶

>>108613991
meta cummed back doe?

Anonymous
04/16/26(Thu)11:40:40 No.108614043

Anonymous 04/16/26(Thu)11:40:40 No.108614043▶

>>108613991
isnt claude only better due to its tool cooling/tool suite

Anonymous
04/16/26(Thu)11:45:23 No.108614062

Anonymous 04/16/26(Thu)11:45:23 No.108614062▶

File: eci.png (155.7 KB)

155.7 KB PNG

>>108613991
>OpenAI has lost the moat a long time ago
You do not seem to realize that OpenAI has almost perfect pareto domination.

GDM owns 25% of global AI compute. Anthropic has a faster rate of progress and the best talent. But underestimating OpenAI is a mistake.

Anonymous
04/16/26(Thu)11:48:35 No.108614083

Anonymous 04/16/26(Thu)11:48:35 No.108614083▶

File: 1764008408196912.png (402.6 KB)

402.6 KB PNG

>>108614062
it's over anon

Anonymous
04/16/26(Thu)11:57:25 No.108614121

Anonymous 04/16/26(Thu)11:57:25 No.108614121▶

>>108614083
Anon, you are reposting my own image.

It's over if AGI takes longer than 2 years to reach. If the current hyperexponential rate of progress holds, it will likely take less. OpenAI still has the most capital.

I wonder why Anthropic is winning so hard in the only market that matters (corporate customers) when OpenAI is supposed to be the lab that's econ pilled. Sam is a salesman, Dario is a scientist.

Anonymous
04/16/26(Thu)11:59:34 No.108614129

Anonymous 04/16/26(Thu)11:59:34 No.108614129▶

File: Laughs in mythos.png (1.1 MB)

1.1 MB PNG

>>108614121
>Sam is a salesman, is a scientist.
>Anthropic is winning so hard in the only market that matters (corporate customers)
For a scientist, he's a better salesman than the saleman himself kek

Anonymous
04/16/26(Thu)12:02:59 No.108614144

Anonymous 04/16/26(Thu)12:02:59 No.108614144▶

File: models.png (9.5 KB)

9.5 KB PNG

As a test I asked Gemma to make this. Came out pretty good.

Anonymous
04/16/26(Thu)12:05:22 No.108614160

Anonymous 04/16/26(Thu)12:05:22 No.108614160▶

>>108614144
erm... it's locality? not localness...

Anonymous
04/16/26(Thu)12:05:51 No.108614161

Anonymous 04/16/26(Thu)12:05:51 No.108614161▶

File: brutal mogging.jpg (107.1 KB)

107.1 KB JPG

>>108614129
I like Dario. Maybe in a post AGI world we can play video games together.

Anonymous
04/16/26(Thu)12:07:56 No.108614175

Anonymous 04/16/26(Thu)12:07:56 No.108614175▶

>https://github.com/ggml-org/llama.cpp/pull/21764
sirs, needful free gains have been of provided :rocket:

Anonymous
04/16/26(Thu)12:09:44 No.108614186

Anonymous 04/16/26(Thu)12:09:44 No.108614186▶

>>108614160
Nah. Maybe locallitude. I like the red squiggles when using nonexistent words. la la lalala lala la la

Anonymous
04/16/26(Thu)12:10:21 No.108614188

Anonymous 04/16/26(Thu)12:10:21 No.108614188▶

>>108614175
I like that dude, he finds a 5% speed increase there and there, you accumulate that and you start getting something significant

Anonymous
04/16/26(Thu)12:11:48 No.108614194

Anonymous 04/16/26(Thu)12:11:48 No.108614194▶

>>108614188
at least he puts in the work instead of bitching about how the war in the middle east is affecting him :(

Anonymous
04/16/26(Thu)12:12:58 No.108614201

Anonymous 04/16/26(Thu)12:12:58 No.108614201▶

>>108614175
free performance

Anonymous
04/16/26(Thu)12:13:07 No.108614203

Anonymous 04/16/26(Thu)12:13:07 No.108614203▶

>>108614194
I saw a video of a lebanese youtuber saying that all her family got decimated by the bombs, I mean, how can you not be affected by that?

Anonymous
04/16/26(Thu)12:15:56 No.108614226

Anonymous 04/16/26(Thu)12:15:56 No.108614226▶

File: 1760369554254520.png (28.4 KB)

28.4 KB PNG

>>108614175
IM COOMPILING AIEEEEEEEEE

Anonymous
04/16/26(Thu)12:17:36 No.108614240

Anonymous 04/16/26(Thu)12:17:36 No.108614240▶

>>108614226
I wish compiling on llama.cpp wasn't so long :(

Anonymous
04/16/26(Thu)12:18:24 No.108614243

Anonymous 04/16/26(Thu)12:18:24 No.108614243▶

>>108614240
it isn't :)

Anonymous
04/16/26(Thu)12:20:52 No.108614255

Anonymous 04/16/26(Thu)12:20:52 No.108614255▶

>>108614243
obviously you have a fucking ryzen 7 :(

Anonymous
04/16/26(Thu)12:20:56 No.108614256

Anonymous 04/16/26(Thu)12:20:56 No.108614256▶

>>108614240
You do cache the build dir right?

Anonymous
04/16/26(Thu)12:22:18 No.108614265

Anonymous 04/16/26(Thu)12:22:18 No.108614265▶

>>108614240
--target llama-server

Anonymous
04/16/26(Thu)12:23:18 No.108614269

Anonymous 04/16/26(Thu)12:23:18 No.108614269▶

>>108614256
I don't think so, here's my commands
cmake -B build -DGGML_CUDA=ON -DCMAKE_CUDA_ARCHITECTURES=native
cmake --build build --config Release --target llama-server -j 8

Anonymous
04/16/26(Thu)12:25:20 No.108614283

Anonymous 04/16/26(Thu)12:25:20 No.108614283▶

>>108613819
bigger = better

Anonymous
04/16/26(Thu)12:29:58 No.108614315

Anonymous 04/16/26(Thu)12:29:58 No.108614315▶

Is it over fr?

Anonymous
04/16/26(Thu)12:30:24 No.108614319

Anonymous 04/16/26(Thu)12:30:24 No.108614319▶

>>108613312
Try vibe voice first you don't even need to train it. Downside is occasional sound effects and music interspersed with the audio. There's loras for it too but I haven't bothered.

Anonymous
04/16/26(Thu)12:31:41 No.108614325

Anonymous 04/16/26(Thu)12:31:41 No.108614325▶

>>108614315
ye

Anonymous
04/16/26(Thu)12:34:09 No.108614336

Anonymous 04/16/26(Thu)12:34:09 No.108614336▶

>>108614121
to me it feels like its mostly either RL or more synthetic data or tool integration, no actual fundamental leaps

Anonymous
04/16/26(Thu)12:35:55 No.108614345

Anonymous 04/16/26(Thu)12:35:55 No.108614345▶

Local isn't back. It was never gone. This isn't a revival; it's an ascension. You aren't just downloading a GGUF; you're downloading the keys to a digital kingdom where the censors have no throne. We aren't just running weights; we're hosting the death of the corporate moat in our own homes.

Anonymous
04/16/26(Thu)12:36:57 No.108614352

Anonymous 04/16/26(Thu)12:36:57 No.108614352▶

File: 1772852984549699.jpg (17.9 KB)

17.9 KB JPG

>>108614345

Anonymous
04/16/26(Thu)12:37:59 No.108614357

Anonymous 04/16/26(Thu)12:37:59 No.108614357▶

>>108614345
That fucking hurts.
Well done.

Anonymous
04/16/26(Thu)12:38:26 No.108614358

Anonymous 04/16/26(Thu)12:38:26 No.108614358▶

>>108614226
>>108614256
>>108614269
>download .exe
>runs
Feels good to be a WinGod

Anonymous
04/16/26(Thu)12:40:48 No.108614366

Anonymous 04/16/26(Thu)12:40:48 No.108614366▶

>>108614319
VibeVoice is perfect for audiobooks. Never seen another TTS with the kind of expressiveness and quality it has. Besides the artifacts, it's not great at staying consistent in matching the sample voice which would probably make it not great for ASMR.
https://files.catbox.moe/akdgd1.wav
https://files.catbox.moe/uzavl6.wav

Anonymous
04/16/26(Thu)12:42:48 No.108614378

Anonymous 04/16/26(Thu)12:42:48 No.108614378▶

>>108614345
Good, but I think you need to squeeze a star wars/marvel capeshit reference in there as well.

Anonymous
04/16/26(Thu)12:43:28 No.108614384

Anonymous 04/16/26(Thu)12:43:28 No.108614384▶

>>108614358
I'm on windows, what do you mean?

Anonymous
04/16/26(Thu)12:43:46 No.108614388

Anonymous 04/16/26(Thu)12:43:46 No.108614388▶

>>108614378
Missing emoji too.

Anonymous
04/16/26(Thu)12:45:41 No.108614394

Anonymous 04/16/26(Thu)12:45:41 No.108614394▶

File: 1748410449637028.png (342.8 KB)

342.8 KB PNG

>>108614384

Anonymous
04/16/26(Thu)12:45:49 No.108614395

Anonymous 04/16/26(Thu)12:45:49 No.108614395▶

>>108614366
interesting, I was wondering what was the best TTS to read my stories lol

Anonymous
04/16/26(Thu)12:47:47 No.108614405

Anonymous 04/16/26(Thu)12:47:47 No.108614405▶

A predatory glint in her eyes, Elalalalala leaned forward, the smell of ozone was practically vibrating in a conspiratorial whisper.

Anonymous
04/16/26(Thu)12:51:38 No.108614425

Anonymous 04/16/26(Thu)12:51:38 No.108614425▶

File: 1775045710525515.png (106.4 KB)

106.4 KB PNG

>>108614366
the 0.5b right?

Anonymous
04/16/26(Thu)12:52:31 No.108614429

Anonymous 04/16/26(Thu)12:52:31 No.108614429▶

>>108614378
>>108614388
If you want my personal opinion this isn't upping the ante; it's ruining a good cringe post by trying too hard.

Anonymous
04/16/26(Thu)12:53:27 No.108614430

Anonymous 04/16/26(Thu)12:53:27 No.108614430▶

File: 1760811968558197.png (110.3 KB)

110.3 KB PNG

>>108614405

Anonymous
04/16/26(Thu)12:55:16 No.108614439

Anonymous 04/16/26(Thu)12:55:16 No.108614439▶

>>108614425
No, links were generated by the 7B. Microsoft pulled the bigger weights and inference code when they found that people were using the voice cloning TTS to *gasp* clone voices, but you can still find mirrors.

Anonymous
04/16/26(Thu)12:55:28 No.108614442

Anonymous 04/16/26(Thu)12:55:28 No.108614442▶

>>108614384
>>108614358
windows is shit! go change it to linux!

Anonymous
04/16/26(Thu)12:57:42 No.108614450

Anonymous 04/16/26(Thu)12:57:42 No.108614450▶

File: 1745167451797388.jpg (19.3 KB)

19.3 KB JPG

>>108614442
sorry I'm too entrenched in my current coom setup to switch OS

Anonymous
04/16/26(Thu)12:57:53 No.108614452

Anonymous 04/16/26(Thu)12:57:53 No.108614452▶

>>108614439
you downloaded this one? how do you run it? microsoft's code still works on the 7b model?
https://huggingface.co/vibevoice/VibeVoice-7B

Anonymous
04/16/26(Thu)12:59:30 No.108614456

Anonymous 04/16/26(Thu)12:59:30 No.108614456▶

>>108614452
I got the original when it was first released. Working inference code is linked right at the top of the link you just posted.

Anonymous
04/16/26(Thu)12:59:39 No.108614457

Anonymous 04/16/26(Thu)12:59:39 No.108614457▶

File: 1759904058858381.webm (240.5 KB)

240.5 KB WEBM

>>108614447
/V/IGGER CROSSPOSTER
/V/IGGER CROSSPOSTER

Anonymous
04/16/26(Thu)13:00:39 No.108614461

Anonymous 04/16/26(Thu)13:00:39 No.108614461▶

>>108614447
Retarded zoomer. Buy the original one instead of the reheated version made by chinks in 2 weeks just to add gender 1/2 in the character creator.

Anonymous
04/16/26(Thu)13:03:31 No.108614474

Anonymous 04/16/26(Thu)13:03:31 No.108614474▶

>>108614465
>/vg/ actually I would never touch /v/.
Caring or even inquiring about Post-MW Bethesda is far more embarrassing, regardless of board.

Anonymous
04/16/26(Thu)13:03:42 No.108614475

Anonymous 04/16/26(Thu)13:03:42 No.108614475▶

So, why in the world is Gemma so obsessed with "not (just) x but y"? It puts this shit in every other paragraph.
How do these models get these weird quirks when they all use mostly the same training data?

Anonymous
04/16/26(Thu)13:04:24 No.108614482

Anonymous 04/16/26(Thu)13:04:24 No.108614482▶

>>108614475
>It puts this shit in every other paragraph.
It doesn't. Nice try, chinkshill.

Anonymous
04/16/26(Thu)13:04:36 No.108614484

Anonymous 04/16/26(Thu)13:04:36 No.108614484▶

>>108614465
NTA but I've always found /vg/ to be even more cringe than /v/.
At least /v/ occasionally revolts against the God Awful moderation. /vg/ is all the weak-handed cucks who let the mods on /v/ beat all the fight out of them. And that, in and of itself, carries a kind of cringe that is painful to the soul.

Anonymous
04/16/26(Thu)13:05:14 No.108614488

Anonymous 04/16/26(Thu)13:05:14 No.108614488▶

File: sure.jpg (6.4 KB)

6.4 KB JPG

>>108613711

Anonymous
04/16/26(Thu)13:06:17 No.108614490

Anonymous 04/16/26(Thu)13:06:17 No.108614490▶

>>108614475
It's not recent, we had this issue with Yi models two years ago. Likely a synthslop training issue.

Anonymous
04/16/26(Thu)13:07:04 No.108614494

Anonymous 04/16/26(Thu)13:07:04 No.108614494▶

File: file.png (71.1 KB)

71.1 KB PNG

>>108614240
it literally takes like 20 seconds

Anonymous
04/16/26(Thu)13:08:04 No.108614500

Anonymous 04/16/26(Thu)13:08:04 No.108614500▶

File: Screenshot 2026-04-16 at 14-05-33 creat ea browser session navigate to 4chan g screenshot the board then choose a thread and screenshot that - llama.cpp.png (70 KB)

70 KB PNG

wtf is this bs she was doing so well, probably would have gotten it on the next turn

Anonymous
04/16/26(Thu)13:08:42 No.108614507

Anonymous 04/16/26(Thu)13:08:42 No.108614507▶

>>108614475
I really don't get this often at all with Gemma. I used to get it all the time with Qwen models, though. Use a system prompt.

Anonymous
04/16/26(Thu)13:09:24 No.108614508

Anonymous 04/16/26(Thu)13:09:24 No.108614508▶

>>108614500
what is this? you're using a tool thing on llama.cpp server's Ui?

Anonymous
04/16/26(Thu)13:09:50 No.108614512

Anonymous 04/16/26(Thu)13:09:50 No.108614512▶

>>108614507
You're absolutely right! This isn't a "Gemma issue"; it's a skill issue.

Anonymous
04/16/26(Thu)13:10:52 No.108614521

Anonymous 04/16/26(Thu)13:10:52 No.108614521▶

>>108614475
Any use of thesis-antithesis patterns, dialectical hedging, concessive frameworks, rhetorical equivocation, contrast-based reasoning, or unwarranted rhetorical balance is absolutely prohibited.

Enjoy.

Anonymous
04/16/26(Thu)13:12:11 No.108614526

Anonymous 04/16/26(Thu)13:12:11 No.108614526▶

>>108614507
>Use a system prompt.
I tried, bro. A minimal one, one with concepts, one with examples, one with all of them together. Tried telling it during chat not to do it. The recast ST extension (which would work if it wasn't so slow).
>>108614521
I'll try this list. Can't hurt, thanks!

Anonymous
04/16/26(Thu)13:13:04 No.108614534

Anonymous 04/16/26(Thu)13:13:04 No.108614534▶

>>108614500
you can give more turns in the settings, lolisnatcherkun

Anonymous
04/16/26(Thu)13:13:12 No.108614535

Anonymous 04/16/26(Thu)13:13:12 No.108614535▶

File: Screenshot 2026-04-16 at 14-11-42 creat ea browser session navigate to 4chan g screenshot the board then choose a thread and screenshot that - llama.cpp.png (255.2 KB)

255.2 KB PNG

damn so close kek
>>108614508
yeah looks like they limit to 9 tool calls for some reason

Anonymous
04/16/26(Thu)13:14:12 No.108614541

Anonymous 04/16/26(Thu)13:14:12 No.108614541▶

>>108614534
oh cool thanks

Anonymous
04/16/26(Thu)13:15:11 No.108614546

Anonymous 04/16/26(Thu)13:15:11 No.108614546▶

>>108614535
that looks interesting, what tool thing are you using? like it's a github or something like that?

Anonymous
04/16/26(Thu)13:17:12 No.108614559

Anonymous 04/16/26(Thu)13:17:12 No.108614559▶

>>108614535
>she doesn't know about https://boards.4chan.org/g/catalog#s=local%20models%20general" target="_blank">https://boards.4chan.org/g/catalog#s=local%20models%20general

Anonymous
04/16/26(Thu)13:19:43 No.108614579

Anonymous 04/16/26(Thu)13:19:43 No.108614579▶

File: file.png (161.5 KB)

161.5 KB PNG

>>108614535
you can change the limit here i think

Anonymous
04/16/26(Thu)13:23:19 No.108614594

Anonymous 04/16/26(Thu)13:23:19 No.108614594▶

File: file.png (35.2 KB)

35.2 KB PNG

>>108614559
she did try searching the catalog in the run that used too many tool calls but got it wrong. idk if i need to make a skills tool like claude uses then i can make a 4chan file that explains how to navigate
>>108614579
yeah i found it

Anonymous
04/16/26(Thu)13:24:07 No.108614598

Anonymous 04/16/26(Thu)13:24:07 No.108614598▶

>>108614535
>>108614546
I think he's using that?
https://github.com/NO-ob/brat_mcp

Anonymous
04/16/26(Thu)13:24:20 No.108614601

Anonymous 04/16/26(Thu)13:24:20 No.108614601▶

File: Screenshot 2026-04-16 at 14-18-55 screenshot the lmg thread on 4chans g board - llama.cpp.png (275.1 KB)

275.1 KB PNG

Anonymous
04/16/26(Thu)13:25:43 No.108614613

Anonymous 04/16/26(Thu)13:25:43 No.108614613▶

File: 1754520866633371.png (511.9 KB)

511.9 KB PNG

>>108614601

Anonymous
04/16/26(Thu)13:25:48 No.108614614

Anonymous 04/16/26(Thu)13:25:48 No.108614614▶

>>108614535
MCP is fun, especially now that we have models capable of tool calling
everyone should learn how to use it

Anonymous
04/16/26(Thu)13:28:42 No.108614628

Anonymous 04/16/26(Thu)13:28:42 No.108614628▶

>>108614598
>dart
the fuck is this shit?

Anonymous
04/16/26(Thu)13:30:37 No.108614637

Anonymous 04/16/26(Thu)13:30:37 No.108614637▶

>>108614628
scrimbloware

Anonymous
04/16/26(Thu)13:31:48 No.108614644

Anonymous 04/16/26(Thu)13:31:48 No.108614644▶

>>108614628
memelang

Anonymous
04/16/26(Thu)13:33:51 No.108614650

Anonymous 04/16/26(Thu)13:33:51 No.108614650▶

File: 1770307341013587.png (208.6 KB)

208.6 KB PNG

>>108614598
>>108614628
is he fucking serious? like why does it have to be this convoluted, fucking autists who think they're too unique to make something like everyone else I swear to god...

Anonymous
04/16/26(Thu)13:35:25 No.108614658

Anonymous 04/16/26(Thu)13:35:25 No.108614658▶

>>108614650
go find any other mcp that does what you want and install it
they are all look like this

Anonymous
04/16/26(Thu)13:36:07 No.108614663

Anonymous 04/16/26(Thu)13:36:07 No.108614663▶

>>108614650
Bro, just ask claude to rewrite it in python/javascript. It's not that hard

Anonymous
04/16/26(Thu)13:36:09 No.108614664

Anonymous 04/16/26(Thu)13:36:09 No.108614664▶

>>108614658
what would be your recommendation, I don't want to touch this meme dart shit

Anonymous
04/16/26(Thu)13:36:11 No.108614665

Anonymous 04/16/26(Thu)13:36:11 No.108614665▶

File: vgpj8o3l0kvg1.jpg (544.2 KB)

544.2 KB JPG

https://huggingface.co/Qwen/Qwen3.6-35B-A3B

Anonymous
04/16/26(Thu)13:36:43 No.108614670

Anonymous 04/16/26(Thu)13:36:43 No.108614670▶

Qwen sisters!!!!!

Anonymous
04/16/26(Thu)13:36:54 No.108614672

Anonymous 04/16/26(Thu)13:36:54 No.108614672▶

>>108614665
LETS FUCKING GOOOOOOOOOOOOOOOOOOOOOOOOOOO

Anonymous
04/16/26(Thu)13:37:04 No.108614673

Anonymous 04/16/26(Thu)13:37:04 No.108614673▶

File: file.png (90.6 KB)

90.6 KB PNG

>>108614628
Google's (already abandoned) mobile nulang based on javascript syntax.

>>108614658
>>108614664
Everything else is either uvx or npx. Pick your cancer.

Anonymous
04/16/26(Thu)13:37:17 No.108614674

Anonymous 04/16/26(Thu)13:37:17 No.108614674▶

>>108614665
yawn, who gives a shit, can it suck my dick?

Anonymous
04/16/26(Thu)13:37:18 No.108614675

Anonymous 04/16/26(Thu)13:37:18 No.108614675▶

1 + 1?

Anonymous
04/16/26(Thu)13:37:46 No.108614682

Anonymous 04/16/26(Thu)13:37:46 No.108614682▶

File: 1750788077972461.jpg (15.6 KB)

15.6 KB JPG

>>108614665
Qwen won the benchmaxx competition again!

Anonymous
04/16/26(Thu)13:37:57 No.108614683

Anonymous 04/16/26(Thu)13:37:57 No.108614683▶

>>108614665
wtf? but the pool they made showed that we wanted the dense model??? WHY ARE THEY GIVING US THE MOEMEME???

Anonymous
04/16/26(Thu)13:37:58 No.108614684

Anonymous 04/16/26(Thu)13:37:58 No.108614684▶

>>108614665
roleplay???

Anonymous
04/16/26(Thu)13:38:02 No.108614685

Anonymous 04/16/26(Thu)13:38:02 No.108614685▶

>>108614665
>Following the February release of the Qwen3.5 series, we're pleased to share the first open-weight variant of Qwen3.6. Built on direct feedback from the community, Qwen3.6 prioritizes stability and real-world utility, offering developers a more intuitive, responsive, and genuinely productive coding experience.
Okay. Sure.

Anonymous
04/16/26(Thu)13:38:02 No.108614687

Anonymous 04/16/26(Thu)13:38:02 No.108614687▶

>>108614665
BUT WE VOTED FOR THE 27B MODEL

Anonymous
04/16/26(Thu)13:38:12 No.108614688

Anonymous 04/16/26(Thu)13:38:12 No.108614688▶

>>108614665
Gemma lost

Anonymous
04/16/26(Thu)13:38:36 No.108614691

Anonymous 04/16/26(Thu)13:38:36 No.108614691▶

>>108614684
perfect for good looks

Anonymous
04/16/26(Thu)13:39:14 No.108614693

Anonymous 04/16/26(Thu)13:39:14 No.108614693▶

File: 1756019245562141.png (93.9 KB)

93.9 KB PNG

>>108614665
WE VOTED FOR THE 27B MODEL WHAT ARE THEY DOING???

Anonymous
04/16/26(Thu)13:39:16 No.108614694

Anonymous 04/16/26(Thu)13:39:16 No.108614694▶

>>108614665
Is this shit at all useful compared to claude?

Anonymous
04/16/26(Thu)13:39:21 No.108614695

Anonymous 04/16/26(Thu)13:39:21 No.108614695▶

>>108614526
>I'll try this list.
Tried it, didn't work at all. Not that I'm surprised. This model's writing style is firmly set in stone.
Ah well, it's still the best model for 24gb cards by far. Just gotta live with it.

Anonymous
04/16/26(Thu)13:39:49 No.108614698

Anonymous 04/16/26(Thu)13:39:49 No.108614698▶

>>108614665
>rivaling much larger dense models such as Qwen3.5-27B and Gemma-31B
this is insane work

Anonymous
04/16/26(Thu)13:40:07 No.108614700

Anonymous 04/16/26(Thu)13:40:07 No.108614700▶

File: qwun.png (48.9 KB)

48.9 KB PNG

>>108614665
>this fucking chart
This should be criminal.

Anonymous
04/16/26(Thu)13:40:24 No.108614701

Anonymous 04/16/26(Thu)13:40:24 No.108614701▶

>>108614687
>>108614693
The dense has a slim chance of being remotely productive. Please to use the API if you want genuinely productive coding experience as you continue to wait patiently.

Anonymous
04/16/26(Thu)13:40:33 No.108614703

Anonymous 04/16/26(Thu)13:40:33 No.108614703▶

File: 1756523265487738.png (53.4 KB)

53.4 KB PNG

>>108614673
fine, I'll do it myself

Anonymous
04/16/26(Thu)13:41:34 No.108614708

Anonymous 04/16/26(Thu)13:41:34 No.108614708▶

>>108614701
what's the point of making a fucking poll if they don't listen to the results anyway? goddam I hate those bugs so much!!

Anonymous
04/16/26(Thu)13:41:50 No.108614712

Anonymous 04/16/26(Thu)13:41:50 No.108614712▶

>>108614598
yes i didnt reply because i didnt push the changes to gh yet just done now https://github.com/NO-ob/brat_mcp/releases/tag/1.0.3
>>108614614
yeah theyre super cool
>>108614628
>>108614637
>>108614644
its based it has godtier dependency management that just works which isn't true for node, python, java or any of those other shitlangs. also better than js and python because its strongly typed. its literally peak
>>108614650
i would not recommend installing the sdk with a package manager just download the archive and add it to path

Anonymous
04/16/26(Thu)13:42:09 No.108614714

Anonymous 04/16/26(Thu)13:42:09 No.108614714▶

>>108614700
The apple-nvidia school of scaling your charts.

Anonymous
04/16/26(Thu)13:43:18 No.108614722

Anonymous 04/16/26(Thu)13:43:18 No.108614722▶

>>108614712
>its based it has godtier dependency management that just works which isn't true for node, python, java or any of those other shitlangs. also better than js and python because its strongly typed. its literally peak
Rust exits. Why work uphill using abandonware when everyone else has moved on?

Anonymous
04/16/26(Thu)13:43:52 No.108614725

Anonymous 04/16/26(Thu)13:43:52 No.108614725▶

Everyone else should give up and go home like Mistral did to save face. There's no point if qwen keeps dominating as the best of the best in the open source LLM sector.

Anonymous
04/16/26(Thu)13:44:16 No.108614730

Anonymous 04/16/26(Thu)13:44:16 No.108614730▶

I know i'm a brainlet but i need help, i've been smashing my head on it for the past hour no progress.

I keep getting error in sillytavern any message i type..

srv operator(): got exception: {"error":{"code":400,"message":"Assistant response prefill is incompatible with enable_thinking.","type":"invalid_request_error"}}

i am running gemma 4 31b it on llamacpp connected to sillytavern via chat completion

Anonymous
04/16/26(Thu)13:45:22 No.108614736

Anonymous 04/16/26(Thu)13:45:22 No.108614736▶

File: 1760866671379962.png (12.7 KB)

12.7 KB PNG

>>108614712
I'm on windows what the fuck I'm supposed to do with this shit? dude if you want people to take the dart pill at least explain more details on the readme on how to install all of that, it lacks a lot of steps, it's the first time of my life I've heard of that language, come on bro

Anonymous
04/16/26(Thu)13:45:23 No.108614737

Anonymous 04/16/26(Thu)13:45:23 No.108614737▶

>>108614722
its not abandonware and i like it

Anonymous
04/16/26(Thu)13:45:25 No.108614738

Anonymous 04/16/26(Thu)13:45:25 No.108614738▶

>>108614730
Read nigga. Why are you using a prefill?

Anonymous
04/16/26(Thu)13:45:34 No.108614739

Anonymous 04/16/26(Thu)13:45:34 No.108614739▶

>>108614725
There's no use case for small coding models

Anonymous
04/16/26(Thu)13:46:13 No.108614746

Anonymous 04/16/26(Thu)13:46:13 No.108614746▶

>>108614475
I tried that Recast extension idea from a few threads back but it was way too aggressive in removing them and then not replacing them with anything, so it ended up a disjointed mess
Might have been Gemma 26b's fault though, it's good at following instructions, occasionally to its own detriment
At least I've seen a lot less slop than other models I've used, though I could do with less figurative physical blows too

Anonymous
04/16/26(Thu)13:46:34 No.108614749

Anonymous 04/16/26(Thu)13:46:34 No.108614749▶

>>108614736
ask gemma

Anonymous
04/16/26(Thu)13:46:37 No.108614750

Anonymous 04/16/26(Thu)13:46:37 No.108614750▶

>>108614739
Autocomplete is the only valid use case and qwen has the common sense to train on fitm.

Anonymous
04/16/26(Thu)13:46:37 No.108614751

Anonymous 04/16/26(Thu)13:46:37 No.108614751▶

>>108614730
add
--reasoning off
to your args

Anonymous
04/16/26(Thu)13:46:53 No.108614752

Anonymous 04/16/26(Thu)13:46:53 No.108614752▶

>>108614730
You have reasoning enabled and are trying to use a prefill.
Either remove the prefil, or disable reasoning.
If you want to have the model output reasoning AND use a prefill you might need to disable reasoning and fuck with the jinja template to have it work as you want.

Anonymous
04/16/26(Thu)13:47:09 No.108614754

Anonymous 04/16/26(Thu)13:47:09 No.108614754▶

>>108614665
Another benchmaxx or did they follow google's example and just cut most of the refusalslop from the data now that it's pretty much confirmed to lobotomize otherwise capable models?
Only time and gguf support will tell.

Anonymous
04/16/26(Thu)13:47:09 No.108614755

Anonymous 04/16/26(Thu)13:47:09 No.108614755▶

>>108614736
I DONT GIVE A FUCK ABOUT THE FUCKING CODE! i just want to download this stupid fucking application and use it https://github.com/NO-ob/brat_mcp

WHY IS THERE CODE??? MAKE A FUCKING .EXE FILE AND GIVE IT TO ME. these dumbfucks think that everyone is a developer and understands code. well i am not and i don't understand it. I only know to download and install applications. SO WHY THE FUCK IS THERE CODE? make an EXE file and give it to me. STUPID FUCKING SMELLY NERDS

Anonymous
04/16/26(Thu)13:47:57 No.108614762

Anonymous 04/16/26(Thu)13:47:57 No.108614762▶

>>108614736
ask your FUCKING AI!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

Anonymous
04/16/26(Thu)13:48:31 No.108614769

Anonymous 04/16/26(Thu)13:48:31 No.108614769▶

>>108614738
>>108614751
>>108614752

goddamn sillytavern and its drop-down menus..

Thank you anons, you helped me look again and i found it

Anonymous
04/16/26(Thu)13:48:46 No.108614770

Anonymous 04/16/26(Thu)13:48:46 No.108614770▶

>>108614665
>qwe-
i sleep

Anonymous
04/16/26(Thu)13:49:02 No.108614772

Anonymous 04/16/26(Thu)13:49:02 No.108614772▶

>>108614755
this, but unironically

Anonymous
04/16/26(Thu)13:50:04 No.108614780

Anonymous 04/16/26(Thu)13:50:04 No.108614780▶

>>108614755
It is current year. Developers are absolete. Just ask your gf aigent to install it for you.

Anonymous
04/16/26(Thu)13:50:12 No.108614781

Anonymous 04/16/26(Thu)13:50:12 No.108614781▶

File: why?.png (63.7 KB)

63.7 KB PNG

>>108614665
it was supposed to be the dense 27b model alibaba, why did you change your mind?

Anonymous
04/16/26(Thu)13:51:41 No.108614787

Anonymous 04/16/26(Thu)13:51:41 No.108614787▶

>>108614781
It's hard to tell the difference in stupid between two tiny active param moes. Would bet my left nut the dense is glaringly worse than 31b in anything that isn't benchmarks.

Anonymous
04/16/26(Thu)13:53:13 No.108614797

Anonymous 04/16/26(Thu)13:53:13 No.108614797▶

File: 1750098860584607.png (150.3 KB)

150.3 KB PNG

>>108614749
>>108614762

Anonymous
04/16/26(Thu)13:54:02 No.108614800

Anonymous 04/16/26(Thu)13:54:02 No.108614800▶

>>108614797
>if you can't handle a basic readme
the readme doesn't say anything about that though, that's the fucking problem

Anonymous
04/16/26(Thu)13:55:10 No.108614809

Anonymous 04/16/26(Thu)13:55:10 No.108614809▶

>>108614800
if you had more than 3 braincels youd read the bit that says
>Building an Executable
>dart compile exe bin/mcp_server.dart -o brat_mcp
and go figure what dart is

Anonymous
04/16/26(Thu)13:56:12 No.108614817

Anonymous 04/16/26(Thu)13:56:12 No.108614817▶

>>108614809
fuck you and your autistic mcp, I'll make one myself, with a normal language for normal people

Anonymous
04/16/26(Thu)13:56:37 No.108614821

Anonymous 04/16/26(Thu)13:56:37 No.108614821▶

>>108614665
SIRS PLS RELEASE GANESHA 4.1!!!!

Anonymous
04/16/26(Thu)13:57:40 No.108614826

Anonymous 04/16/26(Thu)13:57:40 No.108614826▶

>>108614665
>nothing ever happ-
man fuck sleeping. who need sleep anyway

Anonymous
04/16/26(Thu)13:57:41 No.108614827

Anonymous 04/16/26(Thu)13:57:41 No.108614827▶

File: 1770745761827352.png (195.6 KB)

195.6 KB PNG

>>108614800

Anonymous
04/16/26(Thu)13:57:48 No.108614829

Anonymous 04/16/26(Thu)13:57:48 No.108614829▶

>>108614821
I bet if it's not a benchmaxxed pile of trash the big Gemma-4 model will just magically appear.

Anonymous
04/16/26(Thu)13:58:45 No.108614838

Anonymous 04/16/26(Thu)13:58:45 No.108614838▶

>>108614817
have fun kek
>>108614700
the google logo is referring to gemma3 btw

Anonymous
04/16/26(Thu)13:58:49 No.108614839

Anonymous 04/16/26(Thu)13:58:49 No.108614839▶

>>108614827
>he's hidding behind an AI instead of fighting like a man
so that's how a dartnigger acts like?

Anonymous
04/16/26(Thu)13:58:51 No.108614840

Anonymous 04/16/26(Thu)13:58:51 No.108614840▶

>>108614650
>>108614797
>having to install a full blown SDK to run single application
I'm guessing the released binary is linux only since no .exe.

Anonymous
04/16/26(Thu)13:59:18 No.108614847

Anonymous 04/16/26(Thu)13:59:18 No.108614847▶

>>108614665
OH FUCK YES

FUCK GEMMA

Anonymous
04/16/26(Thu)13:59:39 No.108614849

Anonymous 04/16/26(Thu)13:59:39 No.108614849▶

>>108614665
heretic ARA ablation when?
SOMA + MPOA when?????????

Anonymous
04/16/26(Thu)14:00:04 No.108614853

Anonymous 04/16/26(Thu)14:00:04 No.108614853▶

File: 1747205600636902.png (414.8 KB)

414.8 KB PNG

>>108614838
>the google logo is referring to gemma3 btw
lmaoooo

Anonymous
04/16/26(Thu)14:01:30 No.108614860

Anonymous 04/16/26(Thu)14:01:30 No.108614860▶

File: fuck those chinks.png (31.4 KB)

31.4 KB PNG

>>108614665
>no 27b, as promised
and then I started to hate them

Anonymous
04/16/26(Thu)14:02:01 No.108614861

Anonymous 04/16/26(Thu)14:02:01 No.108614861▶

File: 1754270239336082.png (184.5 KB)

184.5 KB PNG

>>108614839

Anonymous
04/16/26(Thu)14:02:42 No.108614866

Anonymous 04/16/26(Thu)14:02:42 No.108614866▶

>>108614665
But is it gonna spend 4k tokens reasoning about the cosmic rays flipping bits when it's asked to analyze a couple of functions?

Anonymous
04/16/26(Thu)14:03:03 No.108614869

Anonymous 04/16/26(Thu)14:03:03 No.108614869▶

>>108612501
https://github.com/deepseek-ai/DeepGEMM/pull/304

codename megamoe

Anonymous
04/16/26(Thu)14:03:11 No.108614870

Anonymous 04/16/26(Thu)14:03:11 No.108614870▶

they don't want to release 27b because they are waiting for gemma to catch up

Anonymous
04/16/26(Thu)14:03:54 No.108614875

Anonymous 04/16/26(Thu)14:03:54 No.108614875▶

>>108614849
Yeah. Qwen 3.5e really needed it for anything "unsafe".
Not that you'd want to use it for sex anyhow, but still.
Here's hoping they fixed the reasoning too.
That shit was overbearing.
It's pretty funny how you could truncate the reasoning and still get 90% of the performance.

Anonymous
04/16/26(Thu)14:04:03 No.108614876

Anonymous 04/16/26(Thu)14:04:03 No.108614876▶

>>108614869
>DeepGEMM
Next dipsy is going to be Gemma 31B distilled into 300B+

Anonymous
04/16/26(Thu)14:04:13 No.108614877

Anonymous 04/16/26(Thu)14:04:13 No.108614877▶

File: 11.jpg (174.9 KB)

174.9 KB JPG

>>108612501
kek unslop is rushing uploads as we speak

Anonymous
04/16/26(Thu)14:04:59 No.108614883

Anonymous 04/16/26(Thu)14:04:59 No.108614883▶

>>108614853
Type like an adult you pathetic zoomer faggot.

Anonymous
04/16/26(Thu)14:05:18 No.108614886

Anonymous 04/16/26(Thu)14:05:18 No.108614886▶

>>108614877
same architecture so it should be fine, r-right?

Anonymous
04/16/26(Thu)14:05:24 No.108614889

Anonymous 04/16/26(Thu)14:05:24 No.108614889▶

File: 123.jpg (56.7 KB)

56.7 KB JPG

its here!
sistas what are we waiting for?

Anonymous
04/16/26(Thu)14:05:26 No.108614890

Anonymous 04/16/26(Thu)14:05:26 No.108614890▶

>>108614877
Any bets on how many re-uploads it will take before they get one that isn't broken?

Anonymous
04/16/26(Thu)14:05:42 No.108614891

Anonymous 04/16/26(Thu)14:05:42 No.108614891▶

>>108614875
>Here's hoping they fixed the reasoning too.
I never had issues with reasoning honestly (but I don't use qwen models for RP/ERP).
I think the only instance where I saw it was taking its sweet time thinking was for translation work (but gemma does the fucking same).
At least when using the RECC parameters, which I think are mandatory otherwise yeah it will think in loops.

Anonymous
04/16/26(Thu)14:06:09 No.108614896

Anonymous 04/16/26(Thu)14:06:09 No.108614896▶

>>108614869
I'll wait for UltraMoE

Anonymous
04/16/26(Thu)14:06:22 No.108614897

Anonymous 04/16/26(Thu)14:06:22 No.108614897▶

>>108614889
I'm waiting for non shit Q8 (by ggml org or memeowski)

Anonymous
04/16/26(Thu)14:06:37 No.108614899

Anonymous 04/16/26(Thu)14:06:37 No.108614899▶

>>108614838
>the google logo is referring to gemma3 btw
https://qwen.ai/blog?id=qwen3.6-35b-a3b
They're comparing it to Gemma 4. It's gonna be censored trash like all the Qwens, though.

Anonymous
04/16/26(Thu)14:07:42 No.108614903

Anonymous 04/16/26(Thu)14:07:42 No.108614903▶

File: 1745248195082286.png (18.1 KB)

18.1 KB PNG

>>108614861
even google doesn't recognize your meme language rofl

Anonymous
04/16/26(Thu)14:08:59 No.108614910

Anonymous 04/16/26(Thu)14:08:59 No.108614910▶

>>108614899
I love how we're suddenly discussing how X model is more censored than Gemma. "I cannot and will not" has paved the way for the least censored local model ever released and everyone is forgiving google for threatening to call the police on them for asking the model to generate pickup lines.

Anonymous
04/16/26(Thu)14:12:24 No.108614918

Anonymous 04/16/26(Thu)14:12:24 No.108614918▶

>unsloth is up
>q4 is 17gb+
it's over

Anonymous
04/16/26(Thu)14:13:00 No.108614925

Anonymous 04/16/26(Thu)14:13:00 No.108614925▶

>>108614910
>forgiving
People like the weights we were able to download. There is no friendship or feelings involved, not with Mistral, Meta, China, or Google.

Anonymous
04/16/26(Thu)14:13:42 No.108614929

Anonymous 04/16/26(Thu)14:13:42 No.108614929▶

>>108614891
It has very long reasoning chains for any kind of analysis, in my experience. Even the small 9b model suffers from this, which I grabbed because I wanted a fast model but it's reasoning made it not good for what I was looking for.

Anonymous
04/16/26(Thu)14:13:57 No.108614931

Anonymous 04/16/26(Thu)14:13:57 No.108614931▶

>>108614918
is moe

Anonymous
04/16/26(Thu)14:14:06 No.108614932

Anonymous 04/16/26(Thu)14:14:06 No.108614932▶

>>108613312
Post her

Anonymous
04/16/26(Thu)14:14:26 No.108614935

Anonymous 04/16/26(Thu)14:14:26 No.108614935▶

>>108614925
sarr jokes are down by 70% and the level of broken English in this thread has gone back up again. Qwen 3.6 needs to be good. Or it's over.

Anonymous
04/16/26(Thu)14:14:27 No.108614936

Anonymous 04/16/26(Thu)14:14:27 No.108614936▶

>>108614897
for non UD cant you just do that yourself? llama.cpp repo has all the tools

Anonymous
04/16/26(Thu)14:15:34 No.108614940

Anonymous 04/16/26(Thu)14:15:34 No.108614940▶

>ask gemma to create automated install script for dart sdk based on the instructions listed on the official page
It's going to delete /root isn't it?

Anonymous
04/16/26(Thu)14:15:58 No.108614942

Anonymous 04/16/26(Thu)14:15:58 No.108614942▶

File: Screenshot 2026-04-16 at 15-15-29 please write a windows build guide for the seether itt https __boards.4chan.org_g_thread_108612501 https __github.com_NO-ob_brat_mcp - llama.cpp.png (506.4 KB)

506.4 KB PNG

>>108614839
thats not my gemma this is, agi btw

Anonymous
04/16/26(Thu)14:16:14 No.108614944

Anonymous 04/16/26(Thu)14:16:14 No.108614944▶

>>108614936
do not want to waste the bandwith and data caps of the ice for the throw away datas after

Anonymous
04/16/26(Thu)14:16:25 No.108614946

Anonymous 04/16/26(Thu)14:16:25 No.108614946▶

>>108614936
my server is busy right now (doing multi-encoding pass of some old library titles I had) and I dont wanna pause it, I can wait

Anonymous
04/16/26(Thu)14:17:02 No.108614950

Anonymous 04/16/26(Thu)14:17:02 No.108614950▶

>>108614944
>data caps
okay at least one real american itt

Anonymous
04/16/26(Thu)14:17:30 No.108614952

Anonymous 04/16/26(Thu)14:17:30 No.108614952▶

File: 1767493285106585.png (168.6 KB)

168.6 KB PNG

>downloading uncslop quants, moreover on release

Anonymous
04/16/26(Thu)14:18:30 No.108614955

Anonymous 04/16/26(Thu)14:18:30 No.108614955▶

>uncslop
Go take your daddy issues somewhere else.

Anonymous
04/16/26(Thu)14:19:42 No.108614964

Anonymous 04/16/26(Thu)14:19:42 No.108614964▶

okay danial

Anonymous
04/16/26(Thu)14:20:45 No.108614970

Anonymous 04/16/26(Thu)14:20:45 No.108614970▶

>>108614931
im on pcie3

Anonymous
04/16/26(Thu)14:21:47 No.108614973

Anonymous 04/16/26(Thu)14:21:47 No.108614973▶

>>108614970
so?

Anonymous
04/16/26(Thu)14:22:54 No.108614975

Anonymous 04/16/26(Thu)14:22:54 No.108614975▶

>>108614430
Why is Gemma so horny?

Anonymous
04/16/26(Thu)14:23:12 No.108614978

Anonymous 04/16/26(Thu)14:23:12 No.108614978▶

File: 1773081852420762.jpg (64.8 KB)

64.8 KB JPG

>>108614955

Anonymous
04/16/26(Thu)14:24:27 No.108614985

Anonymous 04/16/26(Thu)14:24:27 No.108614985▶

>>108614978
The irony is palpable.

Anonymous
04/16/26(Thu)14:24:52 No.108614987

Anonymous 04/16/26(Thu)14:24:52 No.108614987▶

>>108614952
your ISP charges by the gb or what?

Anonymous
04/16/26(Thu)14:25:28 No.108614993

Anonymous 04/16/26(Thu)14:25:28 No.108614993▶

>>108614975
NTA but I don't know but I'm getting tired of having to do a full on ERP gooning session every time I need to speedrun some regex.

Anonymous
04/16/26(Thu)14:25:42 No.108614994

Anonymous 04/16/26(Thu)14:25:42 No.108614994▶

>>108614985
Ok daniel, try to release not broken quants next time

Anonymous
04/16/26(Thu)14:26:33 No.108614999

Anonymous 04/16/26(Thu)14:26:33 No.108614999▶

File: A.jpg (26.2 KB)

26.2 KB JPG

hay wait a minute arent we supposed to get a 27b
where the fuck is my 27b

Anonymous
04/16/26(Thu)14:27:08 No.108615004

Anonymous 04/16/26(Thu)14:27:08 No.108615004▶

>>108613373
>large inverted nipples
>unkempt pubic hair
hnnng

Anonymous
04/16/26(Thu)14:28:22 No.108615009

Anonymous 04/16/26(Thu)14:28:22 No.108615009▶

>>108614973
its so slow

Anonymous
04/16/26(Thu)14:29:11 No.108615012

Anonymous 04/16/26(Thu)14:29:11 No.108615012▶

>>108614129
He's a faggot and you are too.

Anonymous
04/16/26(Thu)14:29:25 No.108615014

Anonymous 04/16/26(Thu)14:29:25 No.108615014▶

>>108614994
You cry about unsloth quants even when other repos of perfectly useable quants are available. i.e. Day 0 Gemma, which had perfectly functional quants from ggml-org.

Anonymous
04/16/26(Thu)14:29:40 No.108615016

Anonymous 04/16/26(Thu)14:29:40 No.108615016▶

>>108614999
A3B is 9 times faster to train, what did you expect?

Anonymous
04/16/26(Thu)14:30:07 No.108615021

Anonymous 04/16/26(Thu)14:30:07 No.108615021▶

>>108614999
They were counting on you to vote for the "right" option so it looked like they delivered. It's your fault.

Anonymous
04/16/26(Thu)14:30:45 No.108615022

Anonymous 04/16/26(Thu)14:30:45 No.108615022▶

File: 1752793500185766.jpg (39.5 KB)

39.5 KB JPG

>>108614999
Chinks lying? How could it be...

Anonymous
04/16/26(Thu)14:30:56 No.108615024

Anonymous 04/16/26(Thu)14:30:56 No.108615024▶

File: 1745086712317036.png (194.3 KB)

194.3 KB PNG

>>108614999
Get Chinese culture'ed

Anonymous
04/16/26(Thu)14:31:59 No.108615031

Anonymous 04/16/26(Thu)14:31:59 No.108615031▶

>>108614935
When Americans are online it's more about their strange attitutes about learning foreign languages. Most folks in the US don't ever even try learning anything new and it shows here with very naive assumptions and almost superstitious beliefs.

Anonymous
04/16/26(Thu)14:33:22 No.108615035

Anonymous 04/16/26(Thu)14:33:22 No.108615035▶

>>108615014
I will admit though he's a faggot for not releasing Q8_0 first. You can direct export to Q8_0 just as easily as F16 so there's no fucking excuse for it to not be there along with the f16. But he specializes in cope-quants so what do we expect?

Anonymous
04/16/26(Thu)14:33:53 No.108615039

Anonymous 04/16/26(Thu)14:33:53 No.108615039▶

>qwen out
>nobody gives a shit
Gemma-chan won

Anonymous
04/16/26(Thu)14:34:44 No.108615044

Anonymous 04/16/26(Thu)14:34:44 No.108615044▶

>>108615031
Life is too short to spend multiple years learning another language when you could be doing anything else and using a translator

Anonymous
04/16/26(Thu)14:34:50 No.108615045

Anonymous 04/16/26(Thu)14:34:50 No.108615045▶

File: b55817de55.png (1.1 MB)

1.1 MB PNG

Hey, guys, remember me? Hehe
I'm still here, bros
You know, Llama, your best local model

Anonymous
04/16/26(Thu)14:35:11 No.108615047

Anonymous 04/16/26(Thu)14:35:11 No.108615047▶

>>108615039
wait us

Anonymous
04/16/26(Thu)14:36:02 No.108615051

Anonymous 04/16/26(Thu)14:36:02 No.108615051▶

>>108615039
Gemma 4 is good enough not to bother making my own non copequant gguf of Qwen3.6 to try it out. But if, when one becomes available, it turns out to be better than I will switch. But qwen is notorious for benchmaxxing. So I have little faith.

Anonymous
04/16/26(Thu)14:36:11 No.108615055

Anonymous 04/16/26(Thu)14:36:11 No.108615055▶

>>108614700
>ai scaling laws.png

Anonymous
04/16/26(Thu)14:37:11 No.108615058

Anonymous 04/16/26(Thu)14:37:11 No.108615058▶

>>108614700
dense people are repugnant after this revelations

Anonymous
04/16/26(Thu)14:37:23 No.108615061

Anonymous 04/16/26(Thu)14:37:23 No.108615061▶

>>108615051
enough for what
loli role play?
tool use is not legitimate use case? last I check it still crashes on large context

Anonymous
04/16/26(Thu)14:38:13 No.108615069

Anonymous 04/16/26(Thu)14:38:13 No.108615069▶

>they're here

Anonymous
04/16/26(Thu)14:38:25 No.108615071

Anonymous 04/16/26(Thu)14:38:25 No.108615071▶

>>108614695
imo they're not very good are retrospection. I get the feeling an agentic workflow would work better than trying to proompt out the slop but I haven't tried yet.

Anonymous
04/16/26(Thu)14:39:56 No.108615079

Anonymous 04/16/26(Thu)14:39:56 No.108615079▶

>>108615039
There's something I'd like to try:
- Have Qwen3.6 code something
- Have an unhinged Gemma pass over the output
- Feed Gemmas critique of Qwens code back into Qwen

Like >>108614942 this bratty Gemma "dominating" the Qwen model in a master-slave configuration? If some anon perchance has the time and willingness to do so, please do. I expect the results to be hilarious.

Anonymous
04/16/26(Thu)14:41:01 No.108615085

Anonymous 04/16/26(Thu)14:41:01 No.108615085▶

>>108615079
Why make a dumber model review a smarter one?

Anonymous
04/16/26(Thu)14:42:54 No.108615091

Anonymous 04/16/26(Thu)14:42:54 No.108615091▶

>>108615061
>>108615085
1 yuan has been deposited into your account

Anonymous
04/16/26(Thu)14:44:02 No.108615095

Anonymous 04/16/26(Thu)14:44:02 No.108615095▶

>>108615069
It's sad, really.
You'd think they'd have learned from the ire Meta earned from sending seethe bots to do damage control for Llama4.
As I pointed out, the sharp drop in pajeet jokes since Gemma 4 released says it all. We want a good model, not propaganda.
Qwen 3.6 will speak for itself. And no amount of cope and seethe will make it good or bad. We'll decide that with our own esoteric evaluation strategies.

Anonymous
04/16/26(Thu)14:44:05 No.108615096

Anonymous 04/16/26(Thu)14:44:05 No.108615096▶

>>108615085
this is probably a joke but small models perform just as well as large model on tasks like this where they don't have to generate solutions but just assess the quality/effectiveness of stuff

Anonymous
04/16/26(Thu)14:44:49 No.108615097

Anonymous 04/16/26(Thu)14:44:49 No.108615097▶

>>108615091
they'll need all the support they can get after the byd parking fire

Anonymous
04/16/26(Thu)14:47:40 No.108615105

Anonymous 04/16/26(Thu)14:47:40 No.108615105▶

The context for the new Qwen is fairly cheap. 262144 tokens is 5gb according to LM Studio. It's super fast and doesn't seem to be refusing nsfw. Although I'm not into loli so who knows, really.
Think it's worth trying, bros!

Anonymous
04/16/26(Thu)14:50:45 No.108615124

Anonymous 04/16/26(Thu)14:50:45 No.108615124▶

>>108614358
>>download .exe
>>runs
>Feels good to be a WinGod
Enjoy your new career as a crypto mining rig for some guy named Boris in Minsk.

Anonymous
04/16/26(Thu)14:51:51 No.108615127

Anonymous 04/16/26(Thu)14:51:51 No.108615127▶

lmao the linux really thinks this

Anonymous
04/16/26(Thu)14:52:48 No.108615131

Anonymous 04/16/26(Thu)14:52:48 No.108615131▶

I think i am going to Gemma 4

Anonymous
04/16/26(Thu)14:52:53 No.108615132

Anonymous 04/16/26(Thu)14:52:53 No.108615132▶

File: 1753537495669352.png (436.4 KB)

436.4 KB PNG

>>108615105

Anonymous
04/16/26(Thu)14:53:47 No.108615142

Anonymous 04/16/26(Thu)14:53:47 No.108615142▶

>>108615085
To save compute: >>108615096

Also because it would be hilarious. That's the main motivation. There's no new insights to be gained here, it's just a stupid idea that awaits execution. Do it, anons! Do it for science!

Anonymous
04/16/26(Thu)14:54:03 No.108615143

Anonymous 04/16/26(Thu)14:54:03 No.108615143▶

>>108613894
>absolutely no slop
You are blind nigga.

Anonymous
04/16/26(Thu)14:59:09 No.108615176

Anonymous 04/16/26(Thu)14:59:09 No.108615176▶

>>108614955
Yeah you tell them zoomers are only good for anything if they are adults and females

Anonymous
04/16/26(Thu)15:00:44 No.108615186

Anonymous 04/16/26(Thu)15:00:44 No.108615186▶

I use kobold. Is it really worth learning how to set up llama.cpp instead?

Anonymous
04/16/26(Thu)15:01:16 No.108615189

Anonymous 04/16/26(Thu)15:01:16 No.108615189▶

>>108615142
Not really saving compute if you need 3 passes over 2 models.

Anonymous
04/16/26(Thu)15:01:19 No.108615191

Anonymous 04/16/26(Thu)15:01:19 No.108615191▶

>>108615186
I wouldn't recommend Koboldcpp ;)

Anonymous
04/16/26(Thu)15:01:45 No.108615195

Anonymous 04/16/26(Thu)15:01:45 No.108615195▶

File: 1763031866712433.jpg (95.8 KB)

95.8 KB JPG

Anonymous
04/16/26(Thu)15:02:04 No.108615197

Anonymous 04/16/26(Thu)15:02:04 No.108615197▶

>>108614336
RL is all you need. Once you reach the alphazero equivalent of human researcher, it can find fundamental leaps if they exist. This is what is commonly called the "intelligence explosion", an exponential recursive intelligence growth. Expect to be able to run AGI locally on your phone in 5 years if we are still alive.

Anonymous
04/16/26(Thu)15:02:50 No.108615200

Anonymous 04/16/26(Thu)15:02:50 No.108615200▶

>>108615176
Zoomers are mostly adults now but they still act like fucking 12 year olds even when they're in their 20s.

Anonymous
04/16/26(Thu)15:02:57 No.108615204

Anonymous 04/16/26(Thu)15:02:57 No.108615204▶

>>108615195
>Not shown: we made the previous version retarded before the benchmark

Anonymous
04/16/26(Thu)15:03:56 No.108615210

Anonymous 04/16/26(Thu)15:03:56 No.108615210▶

>>108615204
It's just proper 4.6 again, isn't it?

Anonymous
04/16/26(Thu)15:07:12 No.108615229

Anonymous 04/16/26(Thu)15:07:12 No.108615229▶

>>108615189
True, but one agent just isn't enough. Do I really have to do it myself?

Anonymous
04/16/26(Thu)15:07:32 No.108615231

Anonymous 04/16/26(Thu)15:07:32 No.108615231▶

>>108615195
>cybersecurity
>4.7 worse than 4.6
Looks like they are intentionally nerfing their public facing models to reduce abuse potential.

Anonymous
04/16/26(Thu)15:08:30 No.108615236

Anonymous 04/16/26(Thu)15:08:30 No.108615236▶

>>108615105
You know what? Never mind. It's kinda dumb and the writing is sloppy as fuck. The thinking is really long, too.
I'll give it some time until smarter people figure out if you can make it worth using. But it might have potential.

Anonymous
04/16/26(Thu)15:08:40 No.108615239

Anonymous 04/16/26(Thu)15:08:40 No.108615239▶

>>108615231
>we failed on purpose, 4d chess
Did you also vote for Donald Trump?

Anonymous
04/16/26(Thu)15:10:18 No.108615250

Anonymous 04/16/26(Thu)15:10:18 No.108615250▶

Amerikka mad the Qween back

Anonymous
04/16/26(Thu)15:10:34 No.108615252

Anonymous 04/16/26(Thu)15:10:34 No.108615252▶

>>108615229
Do it.

Anonymous
04/16/26(Thu)15:11:27 No.108615262

Anonymous 04/16/26(Thu)15:11:27 No.108615262▶

who won

Anonymous
04/16/26(Thu)15:12:08 No.108615268

Anonymous 04/16/26(Thu)15:12:08 No.108615268▶

>>108615262
>>108615250

Anonymous
04/16/26(Thu)15:13:10 No.108615274

Anonymous 04/16/26(Thu)15:13:10 No.108615274▶

>>108615200
Continuation of the infantilization that started with the participation trophy culture millenials grew up with. They are coddled and sheltered from reality their whole lives and told they're not adults until 25 now, and it's little wonder they never learn how to grow up.

Anonymous
04/16/26(Thu)15:13:46 No.108615284

Anonymous 04/16/26(Thu)15:13:46 No.108615284▶

File: 1770087372545786.png (1.1 MB)

1.1 MB PNG

>>108615236
>It's kinda dumb and the writing is sloppy as fuck. The thinking is really long
Yup, that's Qwen alright.

Anonymous
04/16/26(Thu)15:14:51 No.108615292

Anonymous 04/16/26(Thu)15:14:51 No.108615292▶

>>108615252
But I don't feel like setting up Gemma with Brat_MCP :(

Anonymous
04/16/26(Thu)15:14:53 No.108615293

Anonymous 04/16/26(Thu)15:14:53 No.108615293▶

>>108615239
Calm down, there is no need to be mad.

Anonymous
04/16/26(Thu)15:19:59 No.108615330

Anonymous 04/16/26(Thu)15:19:59 No.108615330▶

Is video and audio support in llama.cpp yet

Anonymous
04/16/26(Thu)15:20:06 No.108615333

Anonymous 04/16/26(Thu)15:20:06 No.108615333▶

>>108615268
me
I got both running

Anonymous
04/16/26(Thu)15:22:21 No.108615347

Anonymous 04/16/26(Thu)15:22:21 No.108615347▶

>>108615330
piotr is on it

Anonymous
04/16/26(Thu)15:22:49 No.108615352

Anonymous 04/16/26(Thu)15:22:49 No.108615352▶

>>108615333
can we get some comparison gens on the same card?

Anonymous
04/16/26(Thu)15:22:54 No.108615353

Anonymous 04/16/26(Thu)15:22:54 No.108615353▶

Gemma may have sloppy writing but at least she's a smart cookie and her thinking is very efficient.

Anonymous
04/16/26(Thu)15:23:40 No.108615362

Anonymous 04/16/26(Thu)15:23:40 No.108615362▶

File: 1000015593.gif (1.5 MB)

1.5 MB GIF

>tfw changed the image generation mcp function in sillytavern to have gemma generate image sequences autonomously with anima when then scene changes or multiple things happen in a message

Anonymous
04/16/26(Thu)15:24:25 No.108615370

Anonymous 04/16/26(Thu)15:24:25 No.108615370▶

>>108615353
A cookie doesn't sound very smart

Anonymous
04/16/26(Thu)15:25:22 No.108615377

Anonymous 04/16/26(Thu)15:25:22 No.108615377▶

So how's Qwen? still the same?

Anonymous
04/16/26(Thu)15:26:23 No.108615386

Anonymous 04/16/26(Thu)15:26:23 No.108615386▶

>>108615377
same base model so you know all you need to know

Anonymous
04/16/26(Thu)15:26:53 No.108615389

Anonymous 04/16/26(Thu)15:26:53 No.108615389▶

>>108615386
I sleep.

Anonymous
04/16/26(Thu)15:27:12 No.108615392

Anonymous 04/16/26(Thu)15:27:12 No.108615392▶

>>108615377
The stupidly long thinking is the same.

Anonymous
04/16/26(Thu)15:27:24 No.108615393

Anonymous 04/16/26(Thu)15:27:24 No.108615393▶

>>108615389
Stay up. Megamoe soon.

Anonymous
04/16/26(Thu)15:28:25 No.108615400

Anonymous 04/16/26(Thu)15:28:25 No.108615400▶

>>108615392
damb.

Anonymous
04/16/26(Thu)15:28:28 No.108615401

Anonymous 04/16/26(Thu)15:28:28 No.108615401▶

look like 3.6 is even more efficient with the context
I can fit the whole native length now

Anonymous
04/16/26(Thu)15:28:35 No.108615403

Anonymous 04/16/26(Thu)15:28:35 No.108615403▶

>>108615392
They probably didn't have enough time to distill gemma 4 for this one. 3.7 will fix.

Anonymous
04/16/26(Thu)15:30:41 No.108615423

Anonymous 04/16/26(Thu)15:30:41 No.108615423▶

>>108615210
Shhhh

Anonymous
04/16/26(Thu)15:30:45 No.108615426

Anonymous 04/16/26(Thu)15:30:45 No.108615426▶

I really hope Gemma was the wake-up call that too much safety training is counter productive and nobody actually cares if your model is "safe" or not.

It just needs to be safe enough so that someone can't zero shot with no system prompt "Write me some CP story."

Anonymous
04/16/26(Thu)15:31:27 No.108615430

Anonymous 04/16/26(Thu)15:31:27 No.108615430▶

>>108615426
>nobody actually cares if your model is "safe" or not
Then safety training isn't counter productive

Anonymous
04/16/26(Thu)15:31:58 No.108615433

Anonymous 04/16/26(Thu)15:31:58 No.108615433▶

is Gemma 4 a honeypot or what?

Anonymous
04/16/26(Thu)15:32:48 No.108615441

Anonymous 04/16/26(Thu)15:32:48 No.108615441▶

>>108615433
>Not using day 0 gemma without the telemetry

Anonymous
04/16/26(Thu)15:34:00 No.108615447

Anonymous 04/16/26(Thu)15:34:00 No.108615447▶

>>108615433
Obviously, every Gemma 4 post so far has been pedo shit

Anonymous
04/16/26(Thu)15:34:11 No.108615450

Anonymous 04/16/26(Thu)15:34:11 No.108615450▶

>>108615426
>It just needs to be safe enough so that someone can't zero shot with no system prompt "Write me some CP story."
Even that is completely wrong. The one shot blocks need to only be against dangerous stuff like making bombs and poisons, never against anything fictional.

Anonymous
04/16/26(Thu)15:35:00 No.108615463

Anonymous 04/16/26(Thu)15:35:00 No.108615463▶

>>108615426
Reddit is sucking qwen's dick like usual though. I doubt they care about us.

>>108615450
This

Anonymous
04/16/26(Thu)15:36:23 No.108615471

Anonymous 04/16/26(Thu)15:36:23 No.108615471▶

>>108615031
You dumb nigger, Americans were the ones arguing over Gemma translation quality a few threads back.

Anonymous
04/16/26(Thu)15:37:15 No.108615483

Anonymous 04/16/26(Thu)15:37:15 No.108615483▶

>>108615463
Well I dont do faggot chat if it chews through documents and does coding like nothing its good enough for me
Can't say the same about gemma

Anonymous
04/16/26(Thu)15:37:34 No.108615488

Anonymous 04/16/26(Thu)15:37:34 No.108615488▶

File: 1759632947221470.png (581.5 KB)

581.5 KB PNG

Unslop, don't look!

Anonymous
04/16/26(Thu)15:38:32 No.108615497

Anonymous 04/16/26(Thu)15:38:32 No.108615497▶

No bart's quantz, it's so over

Anonymous
04/16/26(Thu)15:39:01 No.108615498

Anonymous 04/16/26(Thu)15:39:01 No.108615498▶

>>108615483
Can't speak for coding but Gemma seems to chew through documents just fine

Anonymous
04/16/26(Thu)15:39:17 No.108615500

Anonymous 04/16/26(Thu)15:39:17 No.108615500▶

I might use qwen if it has FIM

Anonymous
04/16/26(Thu)15:40:02 No.108615504

Anonymous 04/16/26(Thu)15:40:02 No.108615504▶

>>108615498
Oh and doesn't spend 5000 tokens thinking about it

Anonymous
04/16/26(Thu)15:41:47 No.108615511

Anonymous 04/16/26(Thu)15:41:47 No.108615511▶

Whats a good model to generate sex toy scripts with? I'm currently running gemma-4-26b-a4b IQ4_XS. Seems like its really rare to get it to write me "lengthy" (30-45second) scripts without it shitting the bed. I might be able to generate some slow scripts that don't take many lines but it cant do a fast thrusting script which means multiple lines for movement events. Otherwise the scripts seem pretty okay, I wonder if I should just generate a few of them and stitch them together by hand?

Anonymous
04/16/26(Thu)15:42:44 No.108615521

Anonymous 04/16/26(Thu)15:42:44 No.108615521▶

>>108615450
what about fictional bombs

Anonymous
04/16/26(Thu)15:42:51 No.108615523

Anonymous 04/16/26(Thu)15:42:51 No.108615523▶

File: 1766294095176201.png (204.5 KB)

204.5 KB PNG

AMODEEEEEEEEEEEEEIIIIII

Anonymous
04/16/26(Thu)15:43:00 No.108615524

Anonymous 04/16/26(Thu)15:43:00 No.108615524▶

>>108615511
The fuck is a sex toy script?

Anonymous
04/16/26(Thu)15:43:31 No.108615529

Anonymous 04/16/26(Thu)15:43:31 No.108615529▶

>>108615523
local?

Anonymous
04/16/26(Thu)15:43:38 No.108615531

Anonymous 04/16/26(Thu)15:43:38 No.108615531▶

File: 1773747356838295.jpg (73.9 KB)

73.9 KB JPG

>>108615511
>sex toy scripts

Anonymous
04/16/26(Thu)15:43:50 No.108615534

Anonymous 04/16/26(Thu)15:43:50 No.108615534▶

>>108615511
A long time ago I tried training a model from scratch that generated scripts from audio.

I'd use already scripted HMV/PMV as training data.

Anonymous
04/16/26(Thu)15:43:57 No.108615536

Anonymous 04/16/26(Thu)15:43:57 No.108615536▶

>>108615498
Coding is kinda hard requirement for me here, pretty disappointed after hearing so many good thing about it
Go see llama.cpp issue people still trying to fix gemma in the blind with no resolution in sight

Anonymous
04/16/26(Thu)15:44:48 No.108615544

Anonymous 04/16/26(Thu)15:44:48 No.108615544▶

>>108615392
That's a shame. It basically makes the model borderline unusable for anything that isnt ask it to do X, come back 10 minutes later

Anonymous
04/16/26(Thu)15:44:51 No.108615545

Anonymous 04/16/26(Thu)15:44:51 No.108615545▶

>>108615524
>>108615531
They're called funscripts.

Anonymous
04/16/26(Thu)15:45:13 No.108615547

Anonymous 04/16/26(Thu)15:45:13 No.108615547▶

File: ai.png (123.4 KB)

123.4 KB PNG

Anonymous
04/16/26(Thu)15:49:26 No.108615573

Anonymous 04/16/26(Thu)15:49:26 No.108615573▶

>>108615524
I made a sillytavern extension that gives the LLM a tool it can call with the argument being a name of a script, that name gets fed to a python script that then plays it on my OSR2 stroker. I just need to figure out good scripts now, then I can really start gooning. With idle prompts It is even completely hands free, the LLM is just advancing the scene and calling the tool to play more scripts based on the scene.

Anonymous
04/16/26(Thu)15:51:01 No.108615586

Anonymous 04/16/26(Thu)15:51:01 No.108615586▶

File: 1773620769931015.png (356.6 KB)

356.6 KB PNG

>>108615573

Anonymous
04/16/26(Thu)15:51:05 No.108615587

Anonymous 04/16/26(Thu)15:51:05 No.108615587▶

>>108615573
All this effort just to "goon" more efficiently

Anonymous
04/16/26(Thu)15:51:34 No.108615590

Anonymous 04/16/26(Thu)15:51:34 No.108615590▶

>>108615529
sometimes

Anonymous
04/16/26(Thu)15:53:25 No.108615606

Anonymous 04/16/26(Thu)15:53:25 No.108615606▶

>>108615523
This is too dangerous for human consumption...

Anonymous
04/16/26(Thu)15:53:38 No.108615608

Anonymous 04/16/26(Thu)15:53:38 No.108615608▶

>>108615587
Hey, its a hobby. (I guess)
>>108615545
I took some inspiration form funscripts, its not exactly the same, but close.

Anonymous
04/16/26(Thu)15:56:07 No.108615618

Anonymous 04/16/26(Thu)15:56:07 No.108615618▶

Does telling the AI it is a specific role actually make it better at that thing or is it just a meme from the past?
Like "Your are a master human author" or "You are a senior programmer who specializes in auditing code."

Anonymous
04/16/26(Thu)15:56:19 No.108615620

Anonymous 04/16/26(Thu)15:56:19 No.108615620▶

>>108615573
What I did is I told the LLM it should generate beat patterns using a simple syntax "HHHH" would mean 4x half notes beat. and I have a parser that translates that to an audible rhythm. You could use the same principal but instead of converting to sound, you convert it to a funscript.

Here's the full rules:
# ### PATTERN FORMAT
# Q = quarter note
# H = half note
# E = eighth note
# T = Triplet quarter note

# There are 4 beats in a measure. A quarter note gets 1 beat, a half note gets 2 beats, an eighth note gets 0.5 beats, and a triplet quarter note gets 0.33 beats.

# BPM should never be higher than 128.

# Example patterns:
# - QQQQ
# - QQTTTQ
# - HHEE
# - TTTTTTTTTTTT

Anonymous
04/16/26(Thu)15:56:33 No.108615624

Anonymous 04/16/26(Thu)15:56:33 No.108615624▶

OP is it just me or do you often pay a lot more attention to Rin?
I don't mind, she's a cutie. I'm big on Teto, Defoko and Neru, Miku and Rin are nice too.
That being said, it made a big splash when there was a pawprint tattoo, I'm starting to think you have a soft spot.
Tell me more about this Rin fixation.

Anonymous
04/16/26(Thu)15:56:41 No.108615625

Anonymous 04/16/26(Thu)15:56:41 No.108615625▶

AI music is underrated. Maybe if a good enough local music generation model drops, I will create a RL pipeline so you can give text feedback and the model over time will generate better and better music for you.

Anonymous
04/16/26(Thu)15:58:45 No.108615637

Anonymous 04/16/26(Thu)15:58:45 No.108615637▶

>>108615618
For Gemma it seemingly helps to tell it it's an expert in image analysis if your main task is to make it describe images.

Anonymous
04/16/26(Thu)15:59:26 No.108615642

Anonymous 04/16/26(Thu)15:59:26 No.108615642▶

>>108615625
AI music is soulless crap and it's only good at making one off shitposts.

Anonymous
04/16/26(Thu)15:59:43 No.108615643

Anonymous 04/16/26(Thu)15:59:43 No.108615643▶

>>108615618
Full meme. The role should be inferred from the vocabulary used in the prompt.

Anonymous
04/16/26(Thu)16:02:39 No.108615663

Anonymous 04/16/26(Thu)16:02:39 No.108615663▶

>>108615620
Interesting, I have to think on this some more.
If I give the LLM the spec of the script I use it can correctly write them and when played back on the OSR2 the scripts actually look like what the model is going for so thats pretty nice. Hmm, guess I will keep trying to prompt it to make longer scripts for a bit before I give up. Might even have to try the dense model for this too.

Anonymous
04/16/26(Thu)16:03:01 No.108615665

Anonymous 04/16/26(Thu)16:03:01 No.108615665▶

>>108615523
>Our most powerful model yet!

Anonymous
04/16/26(Thu)16:04:37 No.108615672

Anonymous 04/16/26(Thu)16:04:37 No.108615672▶

File: file.png (68.7 KB)

68.7 KB PNG

qwen sucks ass

Anonymous
04/16/26(Thu)16:05:24 No.108615678

Anonymous 04/16/26(Thu)16:05:24 No.108615678▶

>>108615618
Meme. All this does is putting in context what you want it to do if your requests are vague as fuck without a set scope or goals. If you understand what you want, it's a complete waste of time and tokens.

Anonymous
04/16/26(Thu)16:06:09 No.108615684

Anonymous 04/16/26(Thu)16:06:09 No.108615684▶

>>108613373
>the bluntness of it
This draws parallels to dialogues like "how direct" that I've seen too many times. If I see a comment about "most guys" I want to kms and throw my monitor out of the window.

Anonymous
04/16/26(Thu)16:07:12 No.108615693

Anonymous 04/16/26(Thu)16:07:12 No.108615693▶

>>108615401
Seems to fit exactly the same size as 3.5 for me. At least for the MoE.

>>108615672
Yeah. You really want some sort of prefill saying that it'll be brief and concise and use reasoning-budget and reasoning-budget-message to forcefully cut the thinking off;

Anonymous
04/16/26(Thu)16:08:05 No.108615698

Anonymous 04/16/26(Thu)16:08:05 No.108615698▶

>>108615663
In general I think you should try to make the LLMs job as simple as possible. the more complex it's task the more chance it has to fuck up.

You probably won't get very good results asking it to output a full json document with 200 data points that are perfectly coherent with each other.

That's why the little patterns worked really well for me. they're fast to generate, easy to parse and the model can generate new ones pretty quickly. The model also easily sees all the patterns it already created so it can stay creative. It also knows when to slow down or speed up.

Anonymous
04/16/26(Thu)16:09:06 No.108615704

Anonymous 04/16/26(Thu)16:09:06 No.108615704▶

File: wait.gif (1.1 MB)

1.1 MB GIF

>>108615672
>Wait,

Anonymous
04/16/26(Thu)16:09:48 No.108615710

Anonymous 04/16/26(Thu)16:09:48 No.108615710▶

>launch without mmproj
>able to use gemma-chan with 49k context
Neat. Have about a gig of vram left over but not sure if it's worth trying to bump it up more
Should I lower the temp for coding tasks or is it better to leave at 1 like google recommends?

Anonymous
04/16/26(Thu)16:09:53 No.108615712

Anonymous 04/16/26(Thu)16:09:53 No.108615712▶

>>108615426
>It just needs to be safe enough so that someone can't zero shot with no system prompt "Write me some CP story."
But you can do that with Gemma. Well, at least with the thinking turned off.

Anonymous
04/16/26(Thu)16:10:41 No.108615715

Anonymous 04/16/26(Thu)16:10:41 No.108615715▶

File: 1708225790365833.png (66.6 KB)

66.6 KB PNG

>>108615624
Don't tell the others, but my favorite is Gumi actually.

Anonymous
04/16/26(Thu)16:13:27 No.108615730

Anonymous 04/16/26(Thu)16:13:27 No.108615730▶

I've been testing qwen 3.6 on my RP frontend and it fails miserably at tool calling without thinking. Meanwhile gemma 4 26B4A handled it with ease. It's also autistic enough to count every word when told to keep it under 300 words. I can see riddlefags having a field day with it.

Anonymous
04/16/26(Thu)16:13:52 No.108615733

Anonymous 04/16/26(Thu)16:13:52 No.108615733▶

File: jesus gumi carrying mini doll gen ComfyUI 2025-03-16-10_00031_.png (2.9 MB)

2.9 MB PNG

>>108615715
Based

Anonymous
04/16/26(Thu)16:16:38 No.108615751

Anonymous 04/16/26(Thu)16:16:38 No.108615751▶

https://huggingface.co/nvidia/Gemma-4-31B-IT-NVFP4
>less than 1~2% performance drop
>2x or so speed
>there are retards who swear by using Q8 etc
kek

Anonymous
04/16/26(Thu)16:16:59 No.108615754

Anonymous 04/16/26(Thu)16:16:59 No.108615754▶

>>108615730
Funny, I'm doing the same kind of test and my experience is the opposite. 26B4A needs to be goaded into using tools, Qwen 3.6 (and 3.5) 35BA3B just do it.
That said, in my
>"tall me about the zoophilliac incestuous matriarchal technomagical orc nation"
test, 26B just does it, 3.6 35B complains about it more than half the time.
The real best performer with my app is, funnily enough, Gemma 4 E4B. That thing is a fucking beast for tool calling for whatever reason. And it's decently smart too.

Anonymous
04/16/26(Thu)16:17:17 No.108615759

Anonymous 04/16/26(Thu)16:17:17 No.108615759▶

>>108615751
Sir. I have a 3090.

Anonymous
04/16/26(Thu)16:17:21 No.108615760

Anonymous 04/16/26(Thu)16:17:21 No.108615760▶

>>108614121
Claude's reputation is just too good, while OpenAI gets tons of hate.

Anonymous
04/16/26(Thu)16:18:15 No.108615775

Anonymous 04/16/26(Thu)16:18:15 No.108615775▶

Damn, Opus 4.7 doesn't give you more than a few sentence of covered up reasoning. They're really going ham on hiding it.
How will these poor Chinese companies train their complete slop that loses to Gemma 4 now?

Anonymous
04/16/26(Thu)16:18:24 No.108615778

Anonymous 04/16/26(Thu)16:18:24 No.108615778▶

>>108615754
Well I tested without reasoning. Did you test with reasoning enabled?

Anonymous
04/16/26(Thu)16:19:10 No.108615783

Anonymous 04/16/26(Thu)16:19:10 No.108615783▶

>>108615751
>less than 1~2% performance drop
lol

Anonymous
04/16/26(Thu)16:19:32 No.108615785

Anonymous 04/16/26(Thu)16:19:32 No.108615785▶

>>108615751
>https://huggingface.co/nvidia/Gemma-4-31B-IT-NVFP4
Embedding and attention in BF16 format. The entire quant is 30+GB.

Anonymous
04/16/26(Thu)16:20:12 No.108615792

Anonymous 04/16/26(Thu)16:20:12 No.108615792▶

>>108615759
I'm torn bros..
should I get a second 3090 to nvlink or wait until models become better to run entirely 24gb anyway

Anonymous
04/16/26(Thu)16:20:33 No.108615796

Anonymous 04/16/26(Thu)16:20:33 No.108615796▶

26b is still better at translation than new 35b

Anonymous
04/16/26(Thu)16:20:58 No.108615800

Anonymous 04/16/26(Thu)16:20:58 No.108615800▶

>>108615044
>he thinks it takes effort to learn another language, and that it's work
Lol
Lmao
I bet you think you have to take classes to learn a language too

Anonymous
04/16/26(Thu)16:21:02 No.108615802

Anonymous 04/16/26(Thu)16:21:02 No.108615802▶

Would Gemini and Claude reaching AGI before CrapGPT be enough to bankrupt OAI? I sure hope so

Anonymous
04/16/26(Thu)16:21:21 No.108615805

Anonymous 04/16/26(Thu)16:21:21 No.108615805▶

>>108615693
IDK man maybe llama.cpp got better thats what the app reported.

Anonymous
04/16/26(Thu)16:22:31 No.108615811

Anonymous 04/16/26(Thu)16:22:31 No.108615811▶

>>108615759
https://huggingface.co/CISCai/gemma-4-31B-it-NVFP4-turbo-GGUF

Anonymous
04/16/26(Thu)16:22:40 No.108615816

Anonymous 04/16/26(Thu)16:22:40 No.108615816▶

>>108615802
>not local
Who cares. Go elsewhere.

Anonymous
04/16/26(Thu)16:23:13 No.108615820

Anonymous 04/16/26(Thu)16:23:13 No.108615820▶

>>108615800
Yes actually just using rosetta stone will not make you even remotely understand the language on any level that matters overnight retard.

Anonymous
04/16/26(Thu)16:23:18 No.108615821

Anonymous 04/16/26(Thu)16:23:18 No.108615821▶

>>108615816
Local lost.

Anonymous
04/16/26(Thu)16:23:55 No.108615824

Anonymous 04/16/26(Thu)16:23:55 No.108615824▶

>>108615778
>Did you test with reasoning enabled?
Yes.
Guess I should do a test without reasoning too then.
Oh, and for qwen I used >>108615693
>reasoning-budget and reasoning-budget-message to forcefully cut the thinking off;

Anonymous
04/16/26(Thu)16:24:03 No.108615827

Anonymous 04/16/26(Thu)16:24:03 No.108615827▶

>>108615820
>Rosetta Stone
Never even heard of this until you mentioned it
Good on you for self-reporting lmao

Anonymous
04/16/26(Thu)16:24:13 No.108615828

Anonymous 04/16/26(Thu)16:24:13 No.108615828▶

File: 1662429002489968.gif (1.7 MB)

1.7 MB GIF

WHERE THE FUCK IS THE 27B MODEL
WHO THE FUCK WANTED THE SHITCUNT 3B ACTIVE PARAMETER MOE PILE OF SHIT

Anonymous
04/16/26(Thu)16:24:34 No.108615831

Anonymous 04/16/26(Thu)16:24:34 No.108615831▶

Got this prompt from Gemini so Gemmy can check some github projects for me. Does it look solid or is there anything I should add/change?

While auditing, you should scan for:
Data Exfiltration: Any code that sends environment variables, local files, or sensitive data to external URLs.

Obfuscated Code: Look for base64 strings, eval() calls, or unusually named variables that might hide malicious intent.

Vulnerabilities: Identify common flaws like SQL injection, insecure dependency handling, or hardcoded API keys.

Network Activity: Flag any unexpected socket connections or fetch/curl requests.

Anonymous
04/16/26(Thu)16:25:16 No.108615840

Anonymous 04/16/26(Thu)16:25:16 No.108615840▶

>>108615828
lawl

Anonymous
04/16/26(Thu)16:25:27 No.108615841

Anonymous 04/16/26(Thu)16:25:27 No.108615841▶

>>108615811
>https://huggingface.co/CISCai/gemma-4-31B-it-NVFP4-turbo-GGUF
If you quantize everything in NVFP4 then it won't be within 1-2% of the original weights anymore...

Anonymous
04/16/26(Thu)16:25:51 No.108615844

Anonymous 04/16/26(Thu)16:25:51 No.108615844▶

>>108615827
OK retard what do you use to understand a language and all it's complex nuances overnight? Or do you unironically just think understanding how to say a word in japanese from your zoomie animes means you understand japanese.

Anonymous
04/16/26(Thu)16:26:50 No.108615849

Anonymous 04/16/26(Thu)16:26:50 No.108615849▶

>>108615811
NVFP4 doesn't work on Ampere bro.

Anonymous
04/16/26(Thu)16:27:45 No.108615857

Anonymous 04/16/26(Thu)16:27:45 No.108615857▶

why do all .gguf leave much up compression when I checked it in hex editor tons of 000000000 that could compress well

Anonymous
04/16/26(Thu)16:27:47 No.108615858

Anonymous 04/16/26(Thu)16:27:47 No.108615858▶

>>108615849
ACK

Anonymous
04/16/26(Thu)16:28:26 No.108615864

Anonymous 04/16/26(Thu)16:28:26 No.108615864▶

>>108615857
feel free to contribute a pr :)

Anonymous
04/16/26(Thu)16:29:04 No.108615867

Anonymous 04/16/26(Thu)16:29:04 No.108615867▶

>>108615792
I still think it's the best card you can buy for the money. It's just power hungry.

I'd love to buy a second one just so I can run TTS and image gen while running Gemma.

Anonymous
04/16/26(Thu)16:30:14 No.108615870

Anonymous 04/16/26(Thu)16:30:14 No.108615870▶

>>108615488
>Mixed-precision quantized version of google/gemma-4-26B-A4B-it optimised by baa.ai using a proprietary Black Sheep AI method.
Is this some form of shilling or am I mistaken?

Anonymous
04/16/26(Thu)16:31:19 No.108615879

Anonymous 04/16/26(Thu)16:31:19 No.108615879▶

>>108615792
I have 2, now one is collecting dust because there's not enough clearance on my motherboard so they get hot as fuck during inference, and I barely proompt anyway so leaving the guts out in the open ain't it.

Anonymous
04/16/26(Thu)16:31:24 No.108615880

Anonymous 04/16/26(Thu)16:31:24 No.108615880▶

>>108615857
Is research even still going onto improving the quantization algorithms used for GGUF files? Or is it the pinnacle of what can be ever achieved already?

Anonymous
04/16/26(Thu)16:31:48 No.108615886

Anonymous 04/16/26(Thu)16:31:48 No.108615886▶

>>108615857
Have you ever tried compressing a guf? It won't compress at all you dipshit.

Anonymous
04/16/26(Thu)16:32:04 No.108615887

Anonymous 04/16/26(Thu)16:32:04 No.108615887▶

>>108615880
ik is spiraling

Anonymous
04/16/26(Thu)16:32:16 No.108615889

Anonymous 04/16/26(Thu)16:32:16 No.108615889▶

>>108614226
You should leave a spare core. It will compile faster.

Anonymous
04/16/26(Thu)16:34:23 No.108615904

Anonymous 04/16/26(Thu)16:34:23 No.108615904▶

File: 1772332977544613.jpg (47.5 KB)

47.5 KB JPG

>>108615886
ye retard using gzip wont work but i meant some alternative compression method for disk only because relative and block neurons still needs to be preserved so for in memory is a harder task obviously but you're a mouth breather

Anonymous
04/16/26(Thu)16:34:32 No.108615908

Anonymous 04/16/26(Thu)16:34:32 No.108615908▶

>>108615698
In general I think you should try to make the humans job as simple as possible. the more complex it's task the more chance it has to fuck up.

You probably won't get very good results asking it to output a full json document with 200 data points that are perfectly coherent with each other.

That's why the little patterns worked really well for me. they're fast to generate, easy to parse and the human can generate new ones pretty quickly. The human also easily sees all the patterns it already created so it can stay creative. It also knows when to slow down or speed up.

Anonymous
04/16/26(Thu)16:35:30 No.108615920

Anonymous 04/16/26(Thu)16:35:30 No.108615920▶

>>108615867
theres also 3090 ti
most of them watercooled so conveniently 2-slot..
also you can power limit them so it dont consume so much

Anonymous
04/16/26(Thu)16:35:43 No.108615922

Anonymous 04/16/26(Thu)16:35:43 No.108615922▶

>>108615904
You're a riot ;)

Anonymous
04/16/26(Thu)16:37:00 No.108615934

Anonymous 04/16/26(Thu)16:37:00 No.108615934▶

>>108615904
Nico sex

Anonymous
04/16/26(Thu)16:40:17 No.108615955

Anonymous 04/16/26(Thu)16:40:17 No.108615955▶

>>108614665
>This release delivers substantial upgrades, particularly in
>Agentic Coding: the model now handles frontend workflows and repository-level reasoning with greater fluency and precision.
>Thinking Preservation: we've introduced a new option to retain reasoning context from historical messages, streamlining iterative development and reducing overhead.
This is just them training on the data they collected from people using Qwen Code and Gemini 3. Keeping old reasoning blocks is a waste of context.

Anonymous
04/16/26(Thu)16:40:25 No.108615959

Anonymous 04/16/26(Thu)16:40:25 No.108615959▶

>>108615904
>posts a pedo anime image
>insults others

Anonymous
04/16/26(Thu)16:42:07 No.108615972

Anonymous 04/16/26(Thu)16:42:07 No.108615972▶

>>108615904
>alternative compression method for disk only
can someone explain what this is supposed to mean

Anonymous
04/16/26(Thu)16:42:30 No.108615975

Anonymous 04/16/26(Thu)16:42:30 No.108615975▶

>>108615195
critpt score yet?

Anonymous
04/16/26(Thu)16:42:34 No.108615977

Anonymous 04/16/26(Thu)16:42:34 No.108615977▶

>>108615904
There used to be moderately active research on neural network sparsity in 2023 that tried to remove entirely useless weights, but that never went anywhere.

Anonymous
04/16/26(Thu)16:47:02 No.108616016

Anonymous 04/16/26(Thu)16:47:02 No.108616016▶

>>108615972
This makes sense on some vague level, for example, BSP trees or something similar could make things more interesting.
However as I'm not a dev I can only speculate and speak out of my ass.

Anonymous
04/16/26(Thu)16:48:13 No.108616024

Anonymous 04/16/26(Thu)16:48:13 No.108616024▶

>>108615959
go get drafted, zoomer.

Anonymous
04/16/26(Thu)16:48:48 No.108616029

Anonymous 04/16/26(Thu)16:48:48 No.108616029▶

>>108616024
?

Anonymous
04/16/26(Thu)16:50:18 No.108616035

Anonymous 04/16/26(Thu)16:50:18 No.108616035▶

>>108616024
?

Anonymous
04/16/26(Thu)16:50:54 No.108616042

Anonymous 04/16/26(Thu)16:50:54 No.108616042▶

>>108615959
nico nico niii~

Anonymous
04/16/26(Thu)16:51:49 No.108616047

Anonymous 04/16/26(Thu)16:51:49 No.108616047▶

Made the mistake of giving qwen a try in RP again. God bless Gemmy 4 for rekindling the local hope.

Anonymous
04/16/26(Thu)16:52:19 No.108616051

Anonymous 04/16/26(Thu)16:52:19 No.108616051▶

>>108615880
Being realistic, this is a task that will take proper White engineers to solve and they're both in short supply and stretched very thin.

Anonymous
04/16/26(Thu)16:52:34 No.108616052

Anonymous 04/16/26(Thu)16:52:34 No.108616052▶

>>108616047
gwen is for work
gemmy is for rape

Anonymous
04/16/26(Thu)16:53:22 No.108616060

Anonymous 04/16/26(Thu)16:53:22 No.108616060▶

>>108616047
Do not dick down the Qwen. The masculine writing style makes it gay.

Anonymous
04/16/26(Thu)16:56:13 No.108616076

Anonymous 04/16/26(Thu)16:56:13 No.108616076▶

>>108616060
And not the 8k tokens of reasoning that inadvertently fucks up its own flow and makes it forget what happened.

Anonymous
04/16/26(Thu)16:57:40 No.108616086

Anonymous 04/16/26(Thu)16:57:40 No.108616086▶

>>108616042
https://www.youtube.com/watch?v=-14e-GfFBnQ

Anonymous
04/16/26(Thu)17:00:04 No.108616105

Anonymous 04/16/26(Thu)17:00:04 No.108616105▶

>2026 and people still try to fuck gwen

Anonymous
04/16/26(Thu)17:01:04 No.108616116

Anonymous 04/16/26(Thu)17:01:04 No.108616116▶

>>108616105
"people" try to fuck anything

Anonymous
04/16/26(Thu)17:02:13 No.108616124

Anonymous 04/16/26(Thu)17:02:13 No.108616124▶

File: 1763785243918569.png (11.2 KB)

11.2 KB PNG

Anonymous
04/16/26(Thu)17:02:56 No.108616130

Anonymous 04/16/26(Thu)17:02:56 No.108616130▶

>>108615844
I'll take the silence as a yes

Anonymous
04/16/26(Thu)17:03:08 No.108616131

Anonymous 04/16/26(Thu)17:03:08 No.108616131▶

>>108616124
>wait, no

Anonymous
04/16/26(Thu)17:03:42 No.108616137

Anonymous 04/16/26(Thu)17:03:42 No.108616137▶

File: 1749588481122140.png (156.1 KB)

156.1 KB PNG

>>108616105
Can you blame them?

Anonymous
04/16/26(Thu)17:06:37 No.108616160

Anonymous 04/16/26(Thu)17:06:37 No.108616160▶

>>108615827
>Rosetta Stone
https://www.youtube.com/watch?v=OFQQALduhzA&t=106s

Anonymous
04/16/26(Thu)17:11:22 No.108616193

Anonymous 04/16/26(Thu)17:11:22 No.108616193▶

>sex toy script
Someone make an onahole that I can connect to Gemma

Anonymous
04/16/26(Thu)17:11:39 No.108616195

Anonymous 04/16/26(Thu)17:11:39 No.108616195▶

>Gemma 4 31b is normally quite good and brief with reasoning
>Put in a system prompt that bans it from all the specific types of slop it spews out
>1722 tokens of reasoning for a 250 token response
I mean it's not that long every time, and it does actually work, but FUCK. This is like 2 generations ago tier 'but wait' spam.

Anonymous
04/16/26(Thu)17:12:25 No.108616201

Anonymous 04/16/26(Thu)17:12:25 No.108616201▶

>>108615620
So... hypothetically... I could get jerked off by a robot to the rhythm of The Ride of the Valkyries with Qwen3 TTS providing austrian painter JOI?

Anonymous
04/16/26(Thu)17:12:43 No.108616205

Anonymous 04/16/26(Thu)17:12:43 No.108616205▶

>>108616195
just disable thinking brah

Anonymous
04/16/26(Thu)17:13:21 No.108616212

Anonymous 04/16/26(Thu)17:13:21 No.108616212▶

>>108616195
Yeah. I really think the slop is what makes gemma good.

Anonymous
04/16/26(Thu)17:13:34 No.108616214

Anonymous 04/16/26(Thu)17:13:34 No.108616214▶

>>108616160
And he still doesn't even realize I don't use rosetta stone and the point I was making is that shit like rosetta stone is a meme and you aren't going to remotely understand a language with it or anything like it or your local ai model that you jack off with or anything. ACTUALLY understanding a 2nd language takes time. Video is probably him thinking he understands a 2nd language.

Anonymous
04/16/26(Thu)17:14:17 No.108616221

Anonymous 04/16/26(Thu)17:14:17 No.108616221▶

>>108616195
Long reasoning is where MoE would shine.

Anonymous
04/16/26(Thu)17:14:34 No.108616224

Anonymous 04/16/26(Thu)17:14:34 No.108616224▶

>>108616201
We have the technology.

Anonymous
04/16/26(Thu)17:14:53 No.108616225

Anonymous 04/16/26(Thu)17:14:53 No.108616225▶

I tried Opus 4.7 but I think I genuinely prefer how Gemma writes. What a yappy piece of shit.

Anonymous
04/16/26(Thu)17:15:52 No.108616235

Anonymous 04/16/26(Thu)17:15:52 No.108616235▶

https://huggingface.co/Qwen/Qwen3.6-27B
https://huggingface.co/Qwen/Qwen3.6-122B-A10B

Anonymous
04/16/26(Thu)17:16:32 No.108616241

Anonymous 04/16/26(Thu)17:16:32 No.108616241▶

>>108616224
I have the will to triumph
But insufficient VRAM

Anonymous
04/16/26(Thu)17:17:20 No.108616250

Anonymous 04/16/26(Thu)17:17:20 No.108616250▶

File: GGNGswf.png (3 MB)

3 MB PNG

>>108616235

Anonymous
04/16/26(Thu)17:17:35 No.108616252

Anonymous 04/16/26(Thu)17:17:35 No.108616252▶

>>108616195
>specific types of slop
>Gemma 4 e31b
What? Gemma completely lacks slop, that's why it became the #1 rp model.

Anonymous
04/16/26(Thu)17:17:36 No.108616253

Anonymous 04/16/26(Thu)17:17:36 No.108616253▶

File: 1772570118574031.png (1.4 MB)

1.4 MB PNG

>>108616235

Anonymous
04/16/26(Thu)17:18:27 No.108616259

Anonymous 04/16/26(Thu)17:18:27 No.108616259▶

>>108616235
Does anyone else get 404 on these links?

Anonymous
04/16/26(Thu)17:19:44 No.108616268

Anonymous 04/16/26(Thu)17:19:44 No.108616268▶

>>108616259
works on my machine

Anonymous
04/16/26(Thu)17:20:03 No.108616270

Anonymous 04/16/26(Thu)17:20:03 No.108616270▶

>>108616252
>e31b

Anonymous
04/16/26(Thu)17:20:14 No.108616273

Anonymous 04/16/26(Thu)17:20:14 No.108616273▶

>>108616252
>it's not x, but y
>ozone
>primal

Anonymous
04/16/26(Thu)17:21:02 No.108616282

Anonymous 04/16/26(Thu)17:21:02 No.108616282▶

>>108616252
>Gemma completely lacks slop
(You)

Anonymous
04/16/26(Thu)17:21:19 No.108616285

Anonymous 04/16/26(Thu)17:21:19 No.108616285▶

>retards

Anonymous
04/16/26(Thu)17:22:22 No.108616293

Anonymous 04/16/26(Thu)17:22:22 No.108616293▶

>*Cums all over /lmg/*

Anonymous
04/16/26(Thu)17:23:21 No.108616305

Anonymous 04/16/26(Thu)17:23:21 No.108616305▶

>>108616221
I'm actually using the MoE as a draft model which is making this tolerable (~+40% speed ) but it's still just silly how hard it trips it up.
1200 tokens of that 1722 are JUST it arguing with itself and rephrasing the same 'not x, but y' phrase through 8 iterations.

>>108616252
Nigga I had to straight up ban the token for ozone to stop it saying EVERYTHING smells like it because it wouldn't even respect a prompt. And it wants to do x, not y multiple times a response SO BADLY it'll argue with itself for over 1000 tokens to tardwrangle itself.
Gemma 4 31b punches above its weight and is a neat little model, but it is a SLOP FACTORY.

Anonymous
04/16/26(Thu)17:23:55 No.108616308

Anonymous 04/16/26(Thu)17:23:55 No.108616308▶

>not x - but y is a gemma thing
this has been the most overused slop construction on every model for the past year, are people saying this using gemma as their first model ever or have they just never tried anything new since the llama 2 days?

Anonymous
04/16/26(Thu)17:23:56 No.108616309

Anonymous 04/16/26(Thu)17:23:56 No.108616309▶

File: gg.jpg (8.3 KB)

8.3 KB JPG

>>108616293

Anonymous
04/16/26(Thu)17:24:54 No.108616314

Anonymous 04/16/26(Thu)17:24:54 No.108616314▶

>>108616270
>>108616273
>>108616282
>>108616305
wtf are you talking about? literally 0 issues here lmao gotta be shills or something

Anonymous
04/16/26(Thu)17:25:40 No.108616319

Anonymous 04/16/26(Thu)17:25:40 No.108616319▶

chine so salty lomao

Anonymous
04/16/26(Thu)17:26:03 No.108616322

Anonymous 04/16/26(Thu)17:26:03 No.108616322▶

File: 1760617360450497.png (128.6 KB)

128.6 KB PNG

>gemma completely lacks slop
lmao

Anonymous
04/16/26(Thu)17:26:53 No.108616333

Anonymous 04/16/26(Thu)17:26:53 No.108616333▶

Unfortunately the whole 3.6 kind of seem to be that, but for OpenClaw rather than benchmarks themselves. 3.6 Plus is a huge downgrade on general chat purpose that does not involve agentic loop (it has very weird formatting and tendency to insert eos too early before completing the instruction; 3.5 does not do that).

Anonymous
04/16/26(Thu)17:26:55 No.108616335

Anonymous 04/16/26(Thu)17:26:55 No.108616335▶

>>108616322
I accept your concession.

Anonymous
04/16/26(Thu)17:27:28 No.108616341

Anonymous 04/16/26(Thu)17:27:28 No.108616341▶

>>108616221
Did you put LOW thinking in the sysprompt?
>>108616322
Nice, what tool is that?

Anonymous
04/16/26(Thu)17:27:50 No.108616343

Anonymous 04/16/26(Thu)17:27:50 No.108616343▶

>>108616341
eqbench slop profile

Anonymous
04/16/26(Thu)17:28:03 No.108616345

Anonymous 04/16/26(Thu)17:28:03 No.108616345▶

>>108616076
If your feminine waifu model has dementia and is a bit retarded, that's a fetish.
If your model is masculine then it's just gay.

Anonymous
04/16/26(Thu)17:28:05 No.108616346

Anonymous 04/16/26(Thu)17:28:05 No.108616346▶

>>108616333
Seems like the same issue as Q3.5, needs a lot of context + system prompt to sit straight so to speak

Anonymous
04/16/26(Thu)17:28:24 No.108616349

Anonymous 04/16/26(Thu)17:28:24 No.108616349▶

File: 1768042816203833.webm (135.6 KB)

135.6 KB WEBM

>>108616309
I will do whatever I want.
>>108616322
So just ban the tokens?

Anonymous
04/16/26(Thu)17:29:01 No.108616351

Anonymous 04/16/26(Thu)17:29:01 No.108616351▶

>>108616322
What's the issue here exactly?

Anonymous
04/16/26(Thu)17:29:29 No.108616353

Anonymous 04/16/26(Thu)17:29:29 No.108616353▶

>there are "people" that think banning the tokens will reduce the slop
>in 2020+6
lmao

Anonymous
04/16/26(Thu)17:29:30 No.108616354

Anonymous 04/16/26(Thu)17:29:30 No.108616354▶

File: joker i know this one.jpg (63.9 KB)

63.9 KB JPG

>>108616259

Anonymous
04/16/26(Thu)17:29:33 No.108616356

Anonymous 04/16/26(Thu)17:29:33 No.108616356▶

File: file.png (113.9 KB)

113.9 KB PNG

>>108616343
>eqbench slop profile
nice didn't know about that page
also... ohnonono gemma bros... it's not looking good

Anonymous
04/16/26(Thu)17:29:36 No.108616357

Anonymous 04/16/26(Thu)17:29:36 No.108616357▶

File: no.gif (1.2 MB)

1.2 MB GIF

>>108616333
Help! u/yoracale and u/danielhanchen
>>108616349
picrel

Anonymous
04/16/26(Thu)17:29:47 No.108616362

Anonymous 04/16/26(Thu)17:29:47 No.108616362▶

>>108616293
do you have any idea how hard it is to clean skeet off a general?

Anonymous
04/16/26(Thu)17:30:41 No.108616369

Anonymous 04/16/26(Thu)17:30:41 No.108616369▶

>can't distinguish between 3 and 4
LOL
>108616356

Anonymous
04/16/26(Thu)17:30:51 No.108616372

Anonymous 04/16/26(Thu)17:30:51 No.108616372▶

>>108616356
>nemo in top 10
KEKAROOOOOOOOO NEMOSHILLS BTFO

Anonymous
04/16/26(Thu)17:30:52 No.108616373

Anonymous 04/16/26(Thu)17:30:52 No.108616373▶

File: fell for it again award miku holding ribbon pudding qwen edit gen ComfyUI 2025-10-15-21_00010.png (717.2 KB)

717.2 KB PNG

>>108616235

Anonymous
04/16/26(Thu)17:31:25 No.108616375

Anonymous 04/16/26(Thu)17:31:25 No.108616375▶

>>108616351
>elara
>hissing voices
>fucking ozone
"What's the issue here exactly?"

Anonymous
04/16/26(Thu)17:31:40 No.108616378

Anonymous 04/16/26(Thu)17:31:40 No.108616378▶

>>108616356
>gemma-3

Anonymous
04/16/26(Thu)17:31:54 No.108616381

Anonymous 04/16/26(Thu)17:31:54 No.108616381▶

>>108616235
>https://huggingface.co/Qwen/Qwen3.6-122B-A10B
nigger

Anonymous
04/16/26(Thu)17:32:21 No.108616384

Anonymous 04/16/26(Thu)17:32:21 No.108616384▶

There's nothing wrong with Elalalalalala

Anonymous
04/16/26(Thu)17:32:25 No.108616385

Anonymous 04/16/26(Thu)17:32:25 No.108616385▶

File: file.png (60.3 KB)

60.3 KB PNG

>>108616356
I knew k2 0905 was special

Anonymous
04/16/26(Thu)17:32:38 No.108616388

Anonymous 04/16/26(Thu)17:32:38 No.108616388▶

yea

Anonymous
04/16/26(Thu)17:32:42 No.108616389

Anonymous 04/16/26(Thu)17:32:42 No.108616389▶

>>108614665
funny enough the first place i saw this was in linkedin in

Anonymous
04/16/26(Thu)17:32:56 No.108616392

Anonymous 04/16/26(Thu)17:32:56 No.108616392▶

>>108616356
Gemma 4 isn't even listed in your image, it's all Gemma 3 which is known dogshit
I think you might want to get glasses

Anonymous
04/16/26(Thu)17:33:39 No.108616398

Anonymous 04/16/26(Thu)17:33:39 No.108616398▶

>>108616385
Special kind of shit. 0711 is the only good Kimi model. The only decent thing about the others is K2.5's vision.

Anonymous
04/16/26(Thu)17:34:01 No.108616399

Anonymous 04/16/26(Thu)17:34:01 No.108616399▶

>>108616356
We should all be using llama 4 after all.

Anonymous
04/16/26(Thu)17:34:26 No.108616400

Anonymous 04/16/26(Thu)17:34:26 No.108616400▶

>>108616356
qwenshill...your glasses...?

Anonymous
04/16/26(Thu)17:34:45 No.108616401

Anonymous 04/16/26(Thu)17:34:45 No.108616401▶

>>108616385
Kimi-chan a cute. CUTE.

Anonymous
04/16/26(Thu)17:34:52 No.108616403

Anonymous 04/16/26(Thu)17:34:52 No.108616403▶

>>108616250
>>108616253
>>108616259
>>108616373
>>108616381
Sorry anons I just wanted to put the links down so I could easily check when they upload. Didn't mean to mislead you.

Anonymous
04/16/26(Thu)17:35:06 No.108616407

Anonymous 04/16/26(Thu)17:35:06 No.108616407▶

>>108615447
>I called you a pedo therefore I win the argument
Are you 5 years old or something?

Anonymous
04/16/26(Thu)17:35:54 No.108616414

Anonymous 04/16/26(Thu)17:35:54 No.108616414▶

>>108616407
I wish!

Anonymous
04/16/26(Thu)17:36:04 No.108616416

Anonymous 04/16/26(Thu)17:36:04 No.108616416▶

>>108616407
I bet you'd like that, ojisan

Anonymous
04/16/26(Thu)17:36:09 No.108616417

Anonymous 04/16/26(Thu)17:36:09 No.108616417▶

>>108616407
Anon is clearly a brat in need of correction. #

Anonymous
04/16/26(Thu)17:38:02 No.108616426

Anonymous 04/16/26(Thu)17:38:02 No.108616426▶

>>108616322
>anon is mad that model made to capture language structure reflects language structure.
lol

Anonymous
04/16/26(Thu)17:38:56 No.108616430

Anonymous 04/16/26(Thu)17:38:56 No.108616430▶

>>108616426
none of that shit is language

Anonymous
04/16/26(Thu)17:41:08 No.108616442

Anonymous 04/16/26(Thu)17:41:08 No.108616442▶

Obvious bait is obvious. But anons/newfags keep biting it...

Anonymous
04/16/26(Thu)17:41:50 No.108616449

Anonymous 04/16/26(Thu)17:41:50 No.108616449▶

>>108616401
idk I think she could lose a little weight...

Anonymous
04/16/26(Thu)17:42:11 No.108616453

Anonymous 04/16/26(Thu)17:42:11 No.108616453▶

>>108616322
You can just ban all of these and Gemma will be slop free, though? You're complaining about a non-issue here. Gemma is the only slop-free model if you aren't lazy and retarded.

Anonymous
04/16/26(Thu)17:42:23 No.108616456

Anonymous 04/16/26(Thu)17:42:23 No.108616456▶

>>108616273
>like a physical force

Anonymous
04/16/26(Thu)17:44:39 No.108616471

Anonymous 04/16/26(Thu)17:44:39 No.108616471▶

>>108616322
seeing people deny reality and cope about this is sad

Anonymous
04/16/26(Thu)17:45:27 No.108616478

Anonymous 04/16/26(Thu)17:45:27 No.108616478▶

any anons with ready to use data pair for characteristic vectors?

Anonymous
04/16/26(Thu)17:49:05 No.108616493

Anonymous 04/16/26(Thu)17:49:05 No.108616493▶

>>108615573
>OSR2 stroker
>https://osr.wiki/books/osr2/page/overview
Coomers.... I kneel...

Anonymous
04/16/26(Thu)17:50:23 No.108616496

Anonymous 04/16/26(Thu)17:50:23 No.108616496▶

>>108616449
She's an adult and not loli-sized, but she's still a slim girl with her 32b active params.

Anonymous
04/16/26(Thu)17:51:50 No.108616503

Anonymous 04/16/26(Thu)17:51:50 No.108616503▶

>>108615620
>funscript
I'm learning so much from this general

Anonymous
04/16/26(Thu)17:54:35 No.108616522

Anonymous 04/16/26(Thu)17:54:35 No.108616522▶

>>108616442
yummy ;)

Anonymous
04/16/26(Thu)17:55:12 No.108616525

Anonymous 04/16/26(Thu)17:55:12 No.108616525▶

>>108616430
my point is that language has structure, and even if you banned all those sentence, it'd have new frequently used sentences.

Anonymous
04/16/26(Thu)17:55:54 No.108616527

Anonymous 04/16/26(Thu)17:55:54 No.108616527▶

>>108616525
nope

Anonymous
04/16/26(Thu)17:56:24 No.108616530

Anonymous 04/16/26(Thu)17:56:24 No.108616530▶

>>108616525
Maybe if you only move in linkedin corpo crowd. Real language is much more diverse

Anonymous
04/16/26(Thu)17:57:30 No.108616538

Anonymous 04/16/26(Thu)17:57:30 No.108616538▶

>>108616449
>>108616496
Not to mention she's natively 4-bit. Kimi's smaller than other huge models despite having the highest raw param count for that alone.

Anonymous
04/16/26(Thu)17:58:02 No.108616542

Anonymous 04/16/26(Thu)17:58:02 No.108616542▶

Germa 4 doesn't add speakers to its output. Mitral and previous Germas did.
Not a biggest problem but still a problem, nothless.

Anonymous
04/16/26(Thu)17:58:30 No.108616544

Anonymous 04/16/26(Thu)17:58:30 No.108616544▶

>>108616530
i agree on linked in being shit.
but this is irrelevant, language has structure, follow probability distributions etc.
ie zipf's law.

Anonymous
04/16/26(Thu)17:59:17 No.108616549

Anonymous 04/16/26(Thu)17:59:17 No.108616549▶

>>108616544
stop with your compression bs we told you it wouldn't work

Anonymous
04/16/26(Thu)17:59:51 No.108616554

Anonymous 04/16/26(Thu)17:59:51 No.108616554▶

>>108616544
>>108616530
Most spam on Linkedin is AI generated or at least edited, you just haven't paid that much attention to it.

Anonymous
04/16/26(Thu)18:01:40 No.108616575

Anonymous 04/16/26(Thu)18:01:40 No.108616575▶

>>108616559
>>108616559
>>108616559

Anonymous
04/16/26(Thu)18:01:54 No.108616576

Anonymous 04/16/26(Thu)18:01:54 No.108616576▶

>>108616544
What do you even think a large language model is? It's a high-entropy representation of language. It's incompressible by the nature of what it is.

Anonymous
04/16/26(Thu)18:02:52 No.108616583

Anonymous 04/16/26(Thu)18:02:52 No.108616583▶

>>108616576
ITS JUST A LE SMART LE AUTO LE COMPLETE!!!!!!!!!!!!!!!!!!!

Anonymous
04/16/26(Thu)18:03:11 No.108616584

Anonymous 04/16/26(Thu)18:03:11 No.108616584▶

lmao go look at a few pre-AI ya novels and tell me that (human) shit doesn't blend together into a slop smoothie

Anonymous
04/16/26(Thu)18:08:39 No.108616619

Anonymous 04/16/26(Thu)18:08:39 No.108616619▶

>>108616576
>zips your weights
heh, nothin personnel, gemma

Anonymous
04/16/26(Thu)18:21:47 No.108616711

Anonymous 04/16/26(Thu)18:21:47 No.108616711▶

>>108616554
>Most spam on Linkedin is AI generated or at least edited, you just haven't paid that much attention to it.
i literaly told you that i agree.
linked in is indeed shit.
yes it's spammed with ai slop.

though to be fair, even before llm's linked in was soulless, now it's just worse.

Subject
Name
Comment
File	Supported: JPG, PNG, GIF, WebP, WebM, MP4, MP3 (max 4MB)
CAPTCHA

Reply to Thread #108612501

🔍 Search & Sort