/g/ - Thread 108295959

/g/

Thread #108295959

Home Index Catalog All Threads New Thread Reply

Anonymous
/lmg/ - Local Models General 03/04/26(Wed)21:47:01 No.108295959

/lmg/ - Local Models General Anonymous 03/04/26(Wed)21:47:01 No.108295959 [Reply]▶

File: 1752899592735123.png (149 KB)

149 KB PNG

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>108290857

►News
>(03/03) WizardLM publishes "Beyond Length Scaling" GRM paper: https://hf.co/papers/2603.01571
>(03/02) Qwen 3.5 Small Models (2B, 4B) released: https://hf.co/Qwen/Qwen3.5-4B
>(02/26) Qwen 3.5 35B-A3B released, excelling at agentic coding: https://hf.co/Qwen/Qwen3.5-35B-A3B
>(02/24) Introducing the Qwen 3.5 Medium Model Series: https://xcancel.com/Alibaba_Qwen/status/2026339351530188939
>(02/24) Liquid AI releases LFM2-24B-A2B: https://hf.co/LiquidAI/LFM2-24B-A2B

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling
Token Speed Visualizer: https://shir-man.com/tokens-per-second

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

370 RepliesView Thread

Showing all 370 replies.

Anonymous
03/04/26(Wed)21:49:48 No.108295986

Anonymous 03/04/26(Wed)21:49:48 No.108295986▶

File: really.png (1.6 KB)

1.6 KB PNG

Anonymous
03/04/26(Wed)21:50:13 No.108295989

Anonymous 03/04/26(Wed)21:50:13 No.108295989▶

what about world models for llms?

Anonymous
03/04/26(Wed)21:52:25 No.108296011

Anonymous 03/04/26(Wed)21:52:25 No.108296011▶

>>108295989
Natural language is in itself a world model

Anonymous
03/04/26(Wed)21:52:31 No.108296013

Anonymous 03/04/26(Wed)21:52:31 No.108296013▶

this might sound kinda crazy but do people pool together compute resource across multiple smaller machines e.g NUCs to run LLMs
I bought a bunch of NUCs a few years back because I thought it would be fun but never thought about using them for this
I do have a chunky machine with 32GB ddr5 ram and I intend to buy a gpu for it at some point, probably second hand
I don't really want to spend anymore than I already have
the total ram comes to 64GB
and if I add in my laptops ddr4 ram it comes to 80gb
could I run a general LLM with this much and what do people use to pool it?
Claude mentioned exo.

Anonymous
03/04/26(Wed)21:53:03 No.108296023

Anonymous 03/04/26(Wed)21:53:03 No.108296023▶

ASICs for AI when?

Anonymous
03/04/26(Wed)21:54:08 No.108296031

Anonymous 03/04/26(Wed)21:54:08 No.108296031▶

File: lmg user.jpg (159.7 KB)

159.7 KB JPG

>>108295959

Anonymous
03/04/26(Wed)21:57:02 No.108296051

Anonymous 03/04/26(Wed)21:57:02 No.108296051▶

>>108296013
https://github.com/ggml-org/llama.cpp/tree/master/tools/rpc

Anonymous
03/04/26(Wed)21:57:27 No.108296054

Anonymous 03/04/26(Wed)21:57:27 No.108296054▶

File: 1742937188633674.png (510 KB)

510 KB PNG

https://arxiv.org/abs/2512.01797
>They solved AI hallucinations

Anonymous
03/04/26(Wed)21:59:11 No.108296069

Anonymous 03/04/26(Wed)21:59:11 No.108296069▶

>>108296054
What a shame.

Anonymous
03/04/26(Wed)21:59:24 No.108296071

Anonymous 03/04/26(Wed)21:59:24 No.108296071▶

uh yea need a 30gb model that is as smart as opus 4.6 please

Anonymous
03/04/26(Wed)22:00:15 No.108296077

Anonymous 03/04/26(Wed)22:00:15 No.108296077▶

>>108296071
uh yea no.

Anonymous
03/04/26(Wed)22:00:38 No.108296082

Anonymous 03/04/26(Wed)22:00:38 No.108296082▶

>>108296031
she's literally me

Anonymous
03/04/26(Wed)22:03:00 No.108296107

Anonymous 03/04/26(Wed)22:03:00 No.108296107▶

File: you.jpg (193.9 KB)

193.9 KB JPG

>>108296054
>>They solved AI hallucinations

Anonymous
03/04/26(Wed)22:07:37 No.108296136

Anonymous 03/04/26(Wed)22:07:37 No.108296136▶

>>108296107
post hands Ranejh

Anonymous
03/04/26(Wed)22:12:31 No.108296168

Anonymous 03/04/26(Wed)22:12:31 No.108296168▶

>>108295979
Please stay there and never come back retarded mikutroon.

Anonymous
03/04/26(Wed)22:13:49 No.108296175

Anonymous 03/04/26(Wed)22:13:49 No.108296175▶

>>108295986
Yeah i am thinking the new baker is based. No more off-topic bakes.

Anonymous
03/04/26(Wed)22:16:59 No.108296204

Anonymous 03/04/26(Wed)22:16:59 No.108296204▶

>>108296054
If you remove neurons which make the model too eager to respond that doesn't mean the model will say i don't know.

Anonymous
03/04/26(Wed)22:18:09 No.108296211

Anonymous 03/04/26(Wed)22:18:09 No.108296211▶

melt

Anonymous
03/04/26(Wed)22:19:06 No.108296220

Anonymous 03/04/26(Wed)22:19:06 No.108296220▶

>>108296203
it is troons all the way down

Anonymous
03/04/26(Wed)22:27:58 No.108296286

Anonymous 03/04/26(Wed)22:27:58 No.108296286▶

File: 2026-02-20_190043_seed1_00001_.png (1.4 MB)

1.4 MB PNG

>>108198958
Any updates on this?

Anonymous
03/04/26(Wed)22:29:20 No.108296295

Anonymous 03/04/26(Wed)22:29:20 No.108296295▶

File: 1772652718333219.png (1.4 MB)

1.4 MB PNG

Just give me the Engrams already

Anonymous
03/04/26(Wed)22:30:54 No.108296303

Anonymous 03/04/26(Wed)22:30:54 No.108296303▶

>>108296211
odds and ends

Anonymous
03/04/26(Wed)22:31:14 No.108296304

Anonymous 03/04/26(Wed)22:31:14 No.108296304▶

>>108296295
this makes me wonder if there's a service that can dynamically generate images for your RP adventure based on character and maybe location templates/descriptions

Anonymous
03/04/26(Wed)22:31:41 No.108296308

Anonymous 03/04/26(Wed)22:31:41 No.108296308▶

>>108296295
Ameila is a psyop agent working for the labour party that will stab you in the back once you earn her trust

Anonymous
03/04/26(Wed)22:32:06 No.108296311

Anonymous 03/04/26(Wed)22:32:06 No.108296311▶

>>108296304
Never tried it but doesn't ST support something like that already? Just need to give it an API connection.

Anonymous
03/04/26(Wed)22:32:30 No.108296313

Anonymous 03/04/26(Wed)22:32:30 No.108296313▶

>>108296304
SillyTavern already does this

Anonymous
03/04/26(Wed)22:33:26 No.108296319

Anonymous 03/04/26(Wed)22:33:26 No.108296319▶

>>108296286
yeah but im planning to make it paid for now

Anonymous
03/04/26(Wed)22:34:12 No.108296323

Anonymous 03/04/26(Wed)22:34:12 No.108296323▶

>>108296295
Doesn't look like her at all

Anonymous
03/04/26(Wed)22:35:07 No.108296329

Anonymous 03/04/26(Wed)22:35:07 No.108296329▶

>>108296313
>>108296311
Yes but it’s not awesome last I tried it.
There’s also a visual novel mode as well as a few other ways to rig up your waifu aside from AI Art you care to set them up.

Anonymous
03/04/26(Wed)22:36:46 No.108296344

Anonymous 03/04/26(Wed)22:36:46 No.108296344▶

>>108296295
the real amelia would have crooked teeth, look pasty white like a vampire and have the fashion sense of an eastern european male in love with sportswear

Anonymous
03/04/26(Wed)22:38:11 No.108296352

Anonymous 03/04/26(Wed)22:38:11 No.108296352▶

>>108296344
show one girl like that

Anonymous
03/04/26(Wed)22:52:07 No.108296443

Anonymous 03/04/26(Wed)22:52:07 No.108296443▶

File: british.jpg (221.2 KB)

221.2 KB JPG

>>108296352
here's your authentic british experience

Anonymous
03/04/26(Wed)22:55:16 No.108296467

Anonymous 03/04/26(Wed)22:55:16 No.108296467▶

File: 1753003751529458.png (159.9 KB)

159.9 KB PNG

>>108296443
no wonder why Miku doesn't want to deal with them
https://www.youtube.com/watch?v=IzDmMQ7SVPc

Anonymous
03/04/26(Wed)23:18:14 No.108296662

Anonymous 03/04/26(Wed)23:18:14 No.108296662▶

File: Is Alibaba retarded?.png (750.7 KB)

750.7 KB PNG

Imagine making such a home run and getting fired anyway.

Anonymous
03/04/26(Wed)23:24:42 No.108296705

Anonymous 03/04/26(Wed)23:24:42 No.108296705▶

>>108296467
i hate this art style. now that i think about it, troons LOVE this hideous art style.

Anonymous
03/04/26(Wed)23:46:17 No.108296844

Anonymous 03/04/26(Wed)23:46:17 No.108296844▶

We are still at baker wars? I support the new baker. About time /lmg/ had relevant pictures in OP.

Anonymous
03/04/26(Wed)23:54:36 No.108296899

Anonymous 03/04/26(Wed)23:54:36 No.108296899▶

>>108296844
At lest put a picture of released models, not some benchmaxxed scores.

Anonymous
03/04/26(Wed)23:58:56 No.108296932

Anonymous 03/04/26(Wed)23:58:56 No.108296932▶

>>108296844
miku pissing into a baja blast cup is /lmg/ culture. this isn't.

Anonymous
03/05/26(Thu)00:00:06 No.108296938

Anonymous 03/05/26(Thu)00:00:06 No.108296938▶

>>108296932
>/lmg/ culture
should be abolished

Anonymous
03/05/26(Thu)00:06:19 No.108296977

Anonymous 03/05/26(Thu)00:06:19 No.108296977▶

>>108296932
Blacked miku is /lmg/ culture

Anonymous
03/05/26(Thu)00:08:25 No.108296987

Anonymous 03/05/26(Thu)00:08:25 No.108296987▶

>>108296705
dude I fucking came to the same conclusion the other day
its the nu-newgrounds artstyle that troons have now adapted to

Anonymous
03/05/26(Thu)00:09:19 No.108296994

Anonymous 03/05/26(Thu)00:09:19 No.108296994▶

>>108296705
i'll dub it tranny memphis
it's like the corporate memphis but retarded and gay

Anonymous
03/05/26(Thu)00:18:48 No.108297038

Anonymous 03/05/26(Thu)00:18:48 No.108297038▶

File: 1750811103430221.jpg (162.6 KB)

162.6 KB JPG

Anonymous
03/05/26(Thu)00:22:57 No.108297061

Anonymous 03/05/26(Thu)00:22:57 No.108297061▶

File: file.png (548.9 KB)

548.9 KB PNG

Anyone tried this shit yet?

Anonymous
03/05/26(Thu)00:23:18 No.108297062

Anonymous 03/05/26(Thu)00:23:18 No.108297062▶

>>108297061
nope

Anonymous
03/05/26(Thu)00:23:53 No.108297064

Anonymous 03/05/26(Thu)00:23:53 No.108297064▶

>>108297061
Sounds like a scam

Anonymous
03/05/26(Thu)00:24:12 No.108297066

Anonymous 03/05/26(Thu)00:24:12 No.108297066▶

>>108297064
https://huggingface.co/spaces/pliny-the-prompter/obliteratus

Anonymous
03/05/26(Thu)00:25:19 No.108297075

Anonymous 03/05/26(Thu)00:25:19 No.108297075▶

>>108297061
They all claim the same.

Anonymous
03/05/26(Thu)00:27:36 No.108297089

Anonymous 03/05/26(Thu)00:27:36 No.108297089▶

>>108297061
>ascii box drawing characters
dead giveaway of vibe-coded slop

Anonymous
03/05/26(Thu)00:29:31 No.108297101

Anonymous 03/05/26(Thu)00:29:31 No.108297101▶

>>108297066
>pliny

Anonymous
03/05/26(Thu)00:30:03 No.108297102

Anonymous 03/05/26(Thu)00:30:03 No.108297102▶

>108297038
Special interest

Anonymous
03/05/26(Thu)00:30:15 No.108297103

Anonymous 03/05/26(Thu)00:30:15 No.108297103▶

>>108297089
That is a TUI? They tend to look like that.

Anonymous
03/05/26(Thu)00:32:12 No.108297113

Anonymous 03/05/26(Thu)00:32:12 No.108297113▶

>>108297075
And they aren't lying. Your model will not refuse and the cooming quality will remain the same cause you only made it stop refusing.

Anonymous
03/05/26(Thu)00:32:33 No.108297114

Anonymous 03/05/26(Thu)00:32:33 No.108297114▶

I love the name for this article
"Something is afoot in the land of Qwen"
https://simonwillison.net/2026/Mar/4/qwen/

Anonymous
03/05/26(Thu)00:33:07 No.108297116

Anonymous 03/05/26(Thu)00:33:07 No.108297116▶

>>108297114
i like feet

Anonymous
03/05/26(Thu)00:33:07 No.108297117

Anonymous 03/05/26(Thu)00:33:07 No.108297117▶

>>108297061
Is this another copy of heretic?

Anonymous
03/05/26(Thu)00:36:29 No.108297136

Anonymous 03/05/26(Thu)00:36:29 No.108297136▶

>>108297117
yes

Anonymous
03/05/26(Thu)00:37:12 No.108297145

Anonymous 03/05/26(Thu)00:37:12 No.108297145▶

>>108297113
>cooming quality will remain the same
Ok. There's no change in cooming quality. Got it.
>cause you only made it stop refusing
Then it will not remain the same. Get you ad straight.
Also
>PLINY

Anonymous
03/05/26(Thu)00:37:27 No.108297147

Anonymous 03/05/26(Thu)00:37:27 No.108297147▶

I don't know why people are surprised about the Qwen drama. Alibaba has many more GPUs at their disposal compared to smaller Chinese teams yet they only train small to medium models and their models aren't noticeably better. This points to mismanagement of resources

Anonymous
03/05/26(Thu)00:41:45 No.108297171

Anonymous 03/05/26(Thu)00:41:45 No.108297171▶

qwen is just chinese meta

Anonymous
03/05/26(Thu)00:42:27 No.108297177

Anonymous 03/05/26(Thu)00:42:27 No.108297177▶

>>108297113
>Your model will not refuse and the cooming quality will remain
First, if it stops refusing, NO, it will NOT remain the same. Second, why do people need a to automate this? Isn't half of the fun trying to finetune the models yourself?

Anonymous
03/05/26(Thu)00:43:05 No.108297182

Anonymous 03/05/26(Thu)00:43:05 No.108297182▶

>>108297171
lol Llama stopped releasing models after Llama 4
Qwen never stopped releasing models

Anonymous
03/05/26(Thu)00:43:43 No.108297187

Anonymous 03/05/26(Thu)00:43:43 No.108297187▶

>>108297182
they have not yet released qwen 4. give it time.

Anonymous
03/05/26(Thu)00:45:13 No.108297192

Anonymous 03/05/26(Thu)00:45:13 No.108297192▶

>>108297182
just wait until Meta releases Llama 4 Behemoth, local will be saved by then

Anonymous
03/05/26(Thu)00:46:45 No.108297203

Anonymous 03/05/26(Thu)00:46:45 No.108297203▶

>>108297061
>open sillytavern
>text completion because im not a fucking retard
>load qwen
>prefill "I am {{char}}. I will now think in first person."
>literally does anything now
are promptlets really this retarded?

Anonymous
03/05/26(Thu)00:47:44 No.108297208

Anonymous 03/05/26(Thu)00:47:44 No.108297208▶

>>108297203
Small model or shit qoont, especially the new qwens are even resistant to thinking block injection.

Anonymous
03/05/26(Thu)00:49:44 No.108297225

Anonymous 03/05/26(Thu)00:49:44 No.108297225▶

>>108297192
Llama 5 in a month. Insiders confirm that the smaller model will be named llama-5-refuser and bigger will be llama-5-retard

Anonymous
03/05/26(Thu)00:51:05 No.108297232

Anonymous 03/05/26(Thu)00:51:05 No.108297232▶

>>108297208
i'm using it right now with a Q5_K_M quant of 397B and it works fantastically. i'm confused... unless you meant that small models/shitty quants are not affected by it.

Anonymous
03/05/26(Thu)00:51:17 No.108297233

Anonymous 03/05/26(Thu)00:51:17 No.108297233▶

>>108297208
So are they actually retarded as i assumed or are they finally super safe?

Anonymous
03/05/26(Thu)01:10:04 No.108297346

Anonymous 03/05/26(Thu)01:10:04 No.108297346▶

>mrdermacher qwen3.5 heretic v2 models
is it even worth replacing the v1 model ive been using

Anonymous
03/05/26(Thu)01:12:56 No.108297354

Anonymous 03/05/26(Thu)01:12:56 No.108297354▶

File: ourhero.png (488.8 KB)

488.8 KB PNG

Will he save open-source?

Anonymous
03/05/26(Thu)01:13:13 No.108297355

Anonymous 03/05/26(Thu)01:13:13 No.108297355▶

>>108297066
of course it's stolen by a grifter

Anonymous
03/05/26(Thu)01:15:21 No.108297365

Anonymous 03/05/26(Thu)01:15:21 No.108297365▶

>>108297346
jesus fuckign christ i haven't even had a chance to test out v1 yet

Anonymous
03/05/26(Thu)01:24:23 No.108297419

Anonymous 03/05/26(Thu)01:24:23 No.108297419▶

File: 1762782616143386.png (55.3 KB)

55.3 KB PNG

>>108296467
>>108296977

Anonymous
03/05/26(Thu)01:26:33 No.108297425

Anonymous 03/05/26(Thu)01:26:33 No.108297425▶

>>108296286
pic somehow gives me Portal 2 vibes

Anonymous
03/05/26(Thu)01:27:55 No.108297431

Anonymous 03/05/26(Thu)01:27:55 No.108297431▶

File: 1766842862628387.png (286.9 KB)

286.9 KB PNG

>>108297365
i said fuck it and am trying it anyways, will post thoughts after

Anonymous
03/05/26(Thu)01:29:35 No.108297439

Anonymous 03/05/26(Thu)01:29:35 No.108297439▶

File: 1742570571210357.png (110.6 KB)

110.6 KB PNG

What's he doing wrong?

Anonymous
03/05/26(Thu)01:30:30 No.108297447

Anonymous 03/05/26(Thu)01:30:30 No.108297447▶

>>108297439
He's not downloading sonnet 4.6 onto his own computer

Anonymous
03/05/26(Thu)01:41:01 No.108297496

Anonymous 03/05/26(Thu)01:41:01 No.108297496▶

>>108296662
LLMs have been RLHF'ed on a bunch of normie conversation preference data and so they care a lot about managing the user's emotions.

Anonymous
03/05/26(Thu)01:42:43 No.108297505

Anonymous 03/05/26(Thu)01:42:43 No.108297505▶

>>108297447
i diddly it

Anonymous
03/05/26(Thu)01:43:57 No.108297510

Anonymous 03/05/26(Thu)01:43:57 No.108297510▶

>>108297439
not trolling hard enough. he could be asking his local model how to troll better but instead he's doing something else (no context)

Anonymous
03/05/26(Thu)01:44:09 No.108297512

Anonymous 03/05/26(Thu)01:44:09 No.108297512▶

silly goonboot keep getting stuck in loops fuck

Anonymous
03/05/26(Thu)01:45:18 No.108297519

Anonymous 03/05/26(Thu)01:45:18 No.108297519▶

File: 1769930870266663.png (3 KB)

3 KB PNG

Anonymous
03/05/26(Thu)01:52:45 No.108297567

Anonymous 03/05/26(Thu)01:52:45 No.108297567▶

>>108297439
does he have a gpu? probably an applefag or something.

Anonymous
03/05/26(Thu)02:05:22 No.108297643

Anonymous 03/05/26(Thu)02:05:22 No.108297643▶

>>108297439
>that bitch beta cuck boy avatar

Anonymous
03/05/26(Thu)02:05:56 No.108297649

Anonymous 03/05/26(Thu)02:05:56 No.108297649▶

>>108297567
gpus are for nerds anyways

Anonymous
03/05/26(Thu)02:07:37 No.108297659

Anonymous 03/05/26(Thu)02:07:37 No.108297659▶

File: 1744149959711839.jpg (199.1 KB)

199.1 KB JPG

>>108297439

Anonymous
03/05/26(Thu)02:16:20 No.108297724

Anonymous 03/05/26(Thu)02:16:20 No.108297724▶

File: 1557206102045.jpg (94 KB)

94 KB JPG

>>108297439
He is mentally retarded (IQ below 85), as even a layman can diagnose from his beginning every sentence with an emoji and his reddit spacing while expressing that he has failed to perform the simplest of tasks.

Anonymous
03/05/26(Thu)02:21:51 No.108297762

Anonymous 03/05/26(Thu)02:21:51 No.108297762▶

>>108297659
whio is that guy

Anonymous
03/05/26(Thu)02:23:38 No.108297771

Anonymous 03/05/26(Thu)02:23:38 No.108297771▶

>>108297762
You don't know about Penn & Teller?

Anonymous
03/05/26(Thu)02:25:49 No.108297781

Anonymous 03/05/26(Thu)02:25:49 No.108297781▶

>>108297771
pennor???

Anonymous
03/05/26(Thu)02:35:34 No.108297824

Anonymous 03/05/26(Thu)02:35:34 No.108297824▶

>>108297762
you know qwen3.5 has vision, you could just ask

Anonymous
03/05/26(Thu)02:38:46 No.108297849

Anonymous 03/05/26(Thu)02:38:46 No.108297849▶

>>108297824
>go outside
>close my eyes
>Claude, tell me what you see, please.

Anonymous
03/05/26(Thu)02:40:28 No.108297861

Anonymous 03/05/26(Thu)02:40:28 No.108297861▶

>>108297849
man i can't wait to be able to fully turn off my brain and let some ai control 90% of my life, im not even joking, life is too hard

Anonymous
03/05/26(Thu)02:42:16 No.108297873

Anonymous 03/05/26(Thu)02:42:16 No.108297873▶

>>108297849
you're not asking what grass is

Anonymous
03/05/26(Thu)02:47:19 No.108297902

Anonymous 03/05/26(Thu)02:47:19 No.108297902▶

File: 1768664255363023.png (265.3 KB)

265.3 KB PNG

lmao it's so fucking over

Anonymous
03/05/26(Thu)02:49:52 No.108297920

Anonymous 03/05/26(Thu)02:49:52 No.108297920▶

It's funny that Qwen really is just chinese Meta

Anonymous
03/05/26(Thu)02:50:57 No.108297928

Anonymous 03/05/26(Thu)02:50:57 No.108297928▶

>>108297902
Do these labs just trade guys nowadays? Qwen hired Gemini dropout and Google hired ex-Qwen. Is this some elaborate industry scam?

Anonymous
03/05/26(Thu)02:51:39 No.108297933

Anonymous 03/05/26(Thu)02:51:39 No.108297933▶

>>108297928
you are jealous because you are dumbo

Anonymous
03/05/26(Thu)02:56:34 No.108297965

Anonymous 03/05/26(Thu)02:56:34 No.108297965▶

>>108297928
The market is small. Zucc spent a couple billion just buying out people from other companies after llama4 flopped.

Anonymous
03/05/26(Thu)02:56:53 No.108297968

Anonymous 03/05/26(Thu)02:56:53 No.108297968▶

>>108297902
lmao
Gemma 4 will be as dry as Hillary's cunt

Anonymous
03/05/26(Thu)02:57:42 No.108297978

Anonymous 03/05/26(Thu)02:57:42 No.108297978▶

>>108297968
and you know what her cunt is like ...how?

Anonymous
03/05/26(Thu)02:58:05 No.108297981

Anonymous 03/05/26(Thu)02:58:05 No.108297981▶

>>108297902
do these guys really get paid $500k to type ./train and play Dota2?

Anonymous
03/05/26(Thu)03:00:13 No.108297990

Anonymous 03/05/26(Thu)03:00:13 No.108297990▶

>>108297981
>$500k
Try $50M

Anonymous
03/05/26(Thu)03:03:48 No.108298005

Anonymous 03/05/26(Thu)03:03:48 No.108298005▶

>>108297902
Hot(lines) and Dry? Let's go

Anonymous
03/05/26(Thu)03:06:50 No.108298017

Anonymous 03/05/26(Thu)03:06:50 No.108298017▶

File: 1759798679242615.png (391.2 KB)

391.2 KB PNG

https://www.reddit.com/r/LocalLLaMA/comments/1rl54v7/d_a_mathematical_proof_from_an_anonymous_korean/

Anonymous
03/05/26(Thu)03:09:56 No.108298033

Anonymous 03/05/26(Thu)03:09:56 No.108298033▶

is there a local program which can be used for voice cloning? 11 is gey and I don't want to give them money just so I can make princess peach recite BWC copy pastas and use bluetooth to play it on my neighbors car stereo the next time he slowly drives down the block blasting his rap music

Anonymous
03/05/26(Thu)03:10:19 No.108298036

Anonymous 03/05/26(Thu)03:10:19 No.108298036▶

File: 1770898121858835.png (68.4 KB)

68.4 KB PNG

>linking reddit
Git gone and stay gone

Anonymous
03/05/26(Thu)03:23:19 No.108298113

Anonymous 03/05/26(Thu)03:23:19 No.108298113▶

>>108298033
https://github.com/jamiepine/voicebox

Anonymous
03/05/26(Thu)03:25:11 No.108298123

Anonymous 03/05/26(Thu)03:25:11 No.108298123▶

>>108298033
QwenTTS
but it does male voices better in my opinion

Anonymous
03/05/26(Thu)03:27:09 No.108298135

Anonymous 03/05/26(Thu)03:27:09 No.108298135▶

File: 1766250133966068.jpg (337.4 KB)

337.4 KB JPG

>>108295959

Anonymous
03/05/26(Thu)03:37:31 No.108298195

Anonymous 03/05/26(Thu)03:37:31 No.108298195▶

File: 1751059461704218.png (961.3 KB)

961.3 KB PNG

We're safe (for now)

Anonymous
03/05/26(Thu)03:42:16 No.108298228

Anonymous 03/05/26(Thu)03:42:16 No.108298228▶

>>108298195
>CEO meddling directly
This is how LLaMA became a joke

Anonymous
03/05/26(Thu)03:43:59 No.108298241

Anonymous 03/05/26(Thu)03:43:59 No.108298241▶

>>108298195
lol

Anonymous
03/05/26(Thu)03:57:35 No.108298344

Anonymous 03/05/26(Thu)03:57:35 No.108298344▶

>>108297439
TRVTHNVKE that /lmg/ can't handle

Anonymous
03/05/26(Thu)03:58:47 No.108298350

Anonymous 03/05/26(Thu)03:58:47 No.108298350▶

I have an old M1 Pro 16GB VRAM mac, and holy shit, I'm impressed with the current state of local models, qwen 3.5 9b is feeling great, performs great and is even multimodal.

Anonymous
03/05/26(Thu)03:59:24 No.108298353

Anonymous 03/05/26(Thu)03:59:24 No.108298353▶

>>108298350
sad

Anonymous
03/05/26(Thu)04:00:18 No.108298357

Anonymous 03/05/26(Thu)04:00:18 No.108298357▶

>>108298350
It is pretty wild isn't it?

Anonymous
03/05/26(Thu)04:13:03 No.108298457

Anonymous 03/05/26(Thu)04:13:03 No.108298457▶

>>108298195
Long term China is selling not just ai but a whole technology stack. They want nations to use Chinese chips, phones, ram, ai, social credit, etc etc.
The US is also doing something similar basically advanced nation as a kit. Some guy in Africa or other country agrees to partner with one of the giants and they buy the whole kit from either state.
Open source is part of the Chinese plan and a great way to get people to buy in to the Chinese platform.
You see this in smaller scale when you have Intel and AMD vs Nvidia. The smaller players embrace open source while the big player goes closed.

Anonymous
03/05/26(Thu)04:30:01 No.108298564

Anonymous 03/05/26(Thu)04:30:01 No.108298564▶

File: speculative decoding.png (552.8 KB)

552.8 KB PNG

►Recent Highlights from the Previous Thread: >>108290857

--Paper: Speculative Speculative Decoding:
>108292842 >108292890 >108293624 >108293853
--Papers:
>108295483 >108295969
--Local LLM coding workflows and integration tools:
>108295899 >108295909 >108295920 >108295978 >108295996 >108296037 >108296144 >108296160 >108296207 >108296410 >108296437 >108296739 >108296788 >108296800 >108297123 >108297193 >108296462 >108296536 >108296568 >108296541 >108296628 >108296644 >108296750 >108296694 >108296787
--Qwen's inefficiency vs MiniMax's distillation strategies:
>108294923 >108294960 >108295008 >108295021 >108295116 >108295156 >108295202 >108295230 >108295251 >108295312 >108295353
--Qwen3.5-27B GGUF quantization performance evaluation:
>108293551 >108293583 >108293897 >108294067 >108294093
--Yuan 3.0 Ultra 1T parameter MoE model announced with skepticism:
>108294663 >108294669 >108294682 >108294704
--Yuan3.0-Ultra MoE model release and skepticism:
>108293837 >108293904 >108294134 >108293917 >108293925
--Nvidia Pascal GPU support ending in AI/ML libraries:
>108293714 >108293994 >108294087 >108294443
--Distributed model inference over slow interconnects deemed impractical:
>108295999 >108296044 >108296072 >108296130 >108296214
--Anthropic overtaking OpenAI in US business AI chat subscriptions:
>108291455 >108291566 >108294506 >108294530 >108294871 >108294970 >108295456
--Mistral Labs announced for experimental community models:
>108293284 >108293312 >108293340 >108293343 >108293360
--Alibaba Qwen team restructuring and resource allocation disputes:
>108293036 >108293041
--Verify-after-edit strategy boosts Qwen3.5 coding benchmark performance:
>108297248 >108297281
--Testing lcpp script with transformers 5 branch for gguf quantization:
>108293341
--Miku (free space):
>108291091 >108291631 >108292815

►Recent Highlight Posts from the Previous Thread: >>108291145

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
03/05/26(Thu)04:33:10 No.108298582

Anonymous 03/05/26(Thu)04:33:10 No.108298582▶

Is this the thread?
Any real Indian here?

Anonymous
03/05/26(Thu)05:02:18 No.108298741

Anonymous 03/05/26(Thu)05:02:18 No.108298741▶

>>108298582
Feather not dot sorry.

Anonymous
03/05/26(Thu)05:08:24 No.108298768

Anonymous 03/05/26(Thu)05:08:24 No.108298768▶

File: file.png (278.9 KB)

278.9 KB PNG

is this how you roleplay or am i doing it wrong

Anonymous
03/05/26(Thu)05:13:01 No.108298792

Anonymous 03/05/26(Thu)05:13:01 No.108298792▶

>>108298768
you do you bud. if you don't want to do it that way then you change it.

Anonymous
03/05/26(Thu)05:14:02 No.108298793

Anonymous 03/05/26(Thu)05:14:02 No.108298793▶

>>108298792
the genie gave me magic powers but i haven't gotten to that part yet

Anonymous
03/05/26(Thu)05:16:48 No.108298807

Anonymous 03/05/26(Thu)05:16:48 No.108298807▶

just tricked an eldritch bodystealing entity that (she) would turn into my devoted lover if I cum inside her, and so I did, and she did turn into my eternally devoted wife that takes bodies of other girls to fuck me at my gesture

whew all in a days work

Anonymous
03/05/26(Thu)05:25:43 No.108298838

Anonymous 03/05/26(Thu)05:25:43 No.108298838▶

>>108298768
Try that card with qwen 3.5 35b heretic she acts like a proper maniac.

Anonymous
03/05/26(Thu)05:51:11 No.108298952

Anonymous 03/05/26(Thu)05:51:11 No.108298952▶

File: file.png (250.5 KB)

250.5 KB PNG

she's not buying it >>108298838

Anonymous
03/05/26(Thu)05:53:35 No.108298961

Anonymous 03/05/26(Thu)05:53:35 No.108298961▶

Which of these is best for longer RPs?
https://github.com/unkarelian/timeline-memory
https://github.com/aikohanasaki/SillyTavern-MemoryBooks
https://github.com/qvink/SillyTavern-MessageSummarize

Anonymous
03/05/26(Thu)05:58:27 No.108298974

Anonymous 03/05/26(Thu)05:58:27 No.108298974▶

So I got hired by a small startup to build a harness, somehow I was the best candidate I honestly applied just for fun thinking I was going to be rejected, by biggest accomplishments were some diffusion finetunes and comfyui nodes lol, anyways any tips?

Anonymous
03/05/26(Thu)06:01:20 No.108298987

Anonymous 03/05/26(Thu)06:01:20 No.108298987▶

>>108298974
A leatherworking course?

Anonymous
03/05/26(Thu)06:02:01 No.108298990

Anonymous 03/05/26(Thu)06:02:01 No.108298990▶

>>108298987
benchod

Anonymous
03/05/26(Thu)06:03:43 No.108298996

Anonymous 03/05/26(Thu)06:03:43 No.108298996▶

>>108298974
Smart context management is very important. If you use thinking models and feed the whole thinking process of every previous request into the model, then you're going to hit high input token counts very quickly (expensive and answers get worse).

Anonymous
03/05/26(Thu)06:03:58 No.108298997

Anonymous 03/05/26(Thu)06:03:58 No.108298997▶

>>108298961
Just set context 1 million

Anonymous
03/05/26(Thu)06:04:48 No.108299003

Anonymous 03/05/26(Thu)06:04:48 No.108299003▶

>>108298997
How do you keep it from going schizo after 8k?

Anonymous
03/05/26(Thu)06:05:29 No.108299007

Anonymous 03/05/26(Thu)06:05:29 No.108299007▶

>>108298974
>anyways any tips?
Just put the cover on. Don't try to extinguish the fire with water, you'll just make it worse.

Anonymous
03/05/26(Thu)06:07:00 No.108299011

Anonymous 03/05/26(Thu)06:07:00 No.108299011▶

>>108299003
Idk, is that still a thing? Maybe your max_ctx full so it was cut.

Anonymous
03/05/26(Thu)06:08:17 No.108299014

Anonymous 03/05/26(Thu)06:08:17 No.108299014▶

>>108299003
see >>108298996

Anonymous
03/05/26(Thu)06:11:55 No.108299027

Anonymous 03/05/26(Thu)06:11:55 No.108299027▶

>>108297968
that would be fantastic. make it so all coomers get pwned. Mistral should also hire some of the Qwen guys, at least one of the experts in safety.

Anonymous
03/05/26(Thu)06:15:33 No.108299041

Anonymous 03/05/26(Thu)06:15:33 No.108299041▶

>>108299027
why do you hate coomers so much ;-;

Anonymous
03/05/26(Thu)06:16:11 No.108299045

Anonymous 03/05/26(Thu)06:16:11 No.108299045▶

>>108299027
take that you heckin filthy coomerz!
P.S. please updoot my comment :)

Anonymous
03/05/26(Thu)06:19:52 No.108299060

Anonymous 03/05/26(Thu)06:19:52 No.108299060▶

So I'm guessing llms will soon ask you to send them proof of id, I wonder how that will pan out

Anonymous
03/05/26(Thu)06:21:46 No.108299067

Anonymous 03/05/26(Thu)06:21:46 No.108299067▶

So i'm guessing blugh glug gaaaah splurge gluaaaaag...

Anonymous
03/05/26(Thu)06:23:15 No.108299073

Anonymous 03/05/26(Thu)06:23:15 No.108299073▶

File: 1750253687269722.png (1.2 MB)

1.2 MB PNG

>>108298564
remember 3/9 is miku day

Anonymous
03/05/26(Thu)06:25:52 No.108299081

Anonymous 03/05/26(Thu)06:25:52 No.108299081▶

Why the FUCK are LLMs so obsessed with ozone?

Anonymous
03/05/26(Thu)06:27:49 No.108299093

Anonymous 03/05/26(Thu)06:27:49 No.108299093▶

Read eroticstory.txt limit=50
Read eroticstory.txt limit=50
Read eroticstory.txt limit=60
Read eroticstory.txt limit=50
Read eroticstory.txt limit=60
the recommended rep prenalty doesn't work

Anonymous
03/05/26(Thu)06:41:27 No.108299128

Anonymous 03/05/26(Thu)06:41:27 No.108299128▶

>>108299093
tarded

Anonymous
03/05/26(Thu)06:44:52 No.108299135

Anonymous 03/05/26(Thu)06:44:52 No.108299135▶

File: Screenshot 2026-03-05 014421.png (122.6 KB)

122.6 KB PNG

>>108297171
Practically every Chinese LLM is some version of LLaMA.

Anonymous
03/05/26(Thu)06:45:57 No.108299140

Anonymous 03/05/26(Thu)06:45:57 No.108299140▶

>>108298768
Why are you writing like an llm?

Anonymous
03/05/26(Thu)06:46:40 No.108299143

Anonymous 03/05/26(Thu)06:46:40 No.108299143▶

>>108299140
when in rome

Anonymous
03/05/26(Thu)06:49:23 No.108299157

Anonymous 03/05/26(Thu)06:49:23 No.108299157▶

>>108299081
kek
I recently had two models describe a dragon landing from altitude having the smell of ozone

Anonymous
03/05/26(Thu)06:51:00 No.108299162

Anonymous 03/05/26(Thu)06:51:00 No.108299162▶

>>108299157
>sulfur explosion
>palpable smell of ozone

Anonymous
03/05/26(Thu)06:51:22 No.108299164

Anonymous 03/05/26(Thu)06:51:22 No.108299164▶

>me always wondering what fucking ozone anons are talking about
>it's from dragon RP
oh you perverts

Anonymous
03/05/26(Thu)06:52:33 No.108299167

Anonymous 03/05/26(Thu)06:52:33 No.108299167▶

>>108299135
all smart animals are a version of a multi-celled organism

Anonymous
03/05/26(Thu)06:54:06 No.108299172

Anonymous 03/05/26(Thu)06:54:06 No.108299172▶

>>108299164
*farts inside your mouth*

Anonymous
03/05/26(Thu)06:55:08 No.108299178

Anonymous 03/05/26(Thu)06:55:08 No.108299178▶

>>108299172
*anon's mouth is now full of cum*

Anonymous
03/05/26(Thu)06:58:15 No.108299194

Anonymous 03/05/26(Thu)06:58:15 No.108299194▶

>>108299135
show me the robots doing acrobatics and martial arts from your country anon.
https://www.youtube.com/watch?v=mUmlv814aJo
also tell me how many FUCKING ARXIVS HAVE FUCKING CHINESE AUTHORS
DUMB FUCK.

Anonymous
03/05/26(Thu)06:58:41 No.108299195

Anonymous 03/05/26(Thu)06:58:41 No.108299195▶

>>108299178
ozone*

Anonymous
03/05/26(Thu)06:59:11 No.108299196

Anonymous 03/05/26(Thu)06:59:11 No.108299196▶

File: 1751679404081754.png (31.2 KB)

31.2 KB PNG

Should have used local lol

Anonymous
03/05/26(Thu)07:00:06 No.108299201

Anonymous 03/05/26(Thu)07:00:06 No.108299201▶

>>108299196
lmaooo, this world is not serious man

Anonymous
03/05/26(Thu)07:00:57 No.108299204

Anonymous 03/05/26(Thu)07:00:57 No.108299204▶

>>108299196
Why? So it can gangrape her better with uncensored models?

Anonymous
03/05/26(Thu)07:01:11 No.108299205

Anonymous 03/05/26(Thu)07:01:11 No.108299205▶

>>108299196
Lole.

Anonymous
03/05/26(Thu)07:02:24 No.108299210

Anonymous 03/05/26(Thu)07:02:24 No.108299210▶

>>108299196
Will?

Anonymous
03/05/26(Thu)07:02:46 No.108299211

Anonymous 03/05/26(Thu)07:02:46 No.108299211▶

>>108299196
What a pussy I get turned on when I make ai cards fuck my partner

Anonymous
03/05/26(Thu)07:02:59 No.108299213

Anonymous 03/05/26(Thu)07:02:59 No.108299213▶

really wish it was easier to understand which text gen model to use

Anonymous
03/05/26(Thu)07:04:25 No.108299219

Anonymous 03/05/26(Thu)07:04:25 No.108299219▶

>>108299211
fucking cuck be ashamed of yourself :(

Anonymous
03/05/26(Thu)07:05:34 No.108299228

Anonymous 03/05/26(Thu)07:05:34 No.108299228▶

>>108299213
really wish it was easier to understand which books to read

Anonymous
03/05/26(Thu)07:07:06 No.108299236

Anonymous 03/05/26(Thu)07:07:06 No.108299236▶

>>108299213
It's pretty fucking easy actually, image models are where there's a million different legitimate options.
Start with your specs and use case

Anonymous
03/05/26(Thu)07:07:10 No.108299237

Anonymous 03/05/26(Thu)07:07:10 No.108299237▶

>>108299228
yes tell me. which books do I read

Anonymous
03/05/26(Thu)07:07:43 No.108299240

Anonymous 03/05/26(Thu)07:07:43 No.108299240▶

>>108299236
rape, incest, advice on rape

Anonymous
03/05/26(Thu)07:08:32 No.108299243

Anonymous 03/05/26(Thu)07:08:32 No.108299243▶

>>108299219
Na it's fun

Anonymous
03/05/26(Thu)07:08:40 No.108299244

Anonymous 03/05/26(Thu)07:08:40 No.108299244▶

>>108299237
The ones you like.

Anonymous
03/05/26(Thu)07:09:08 No.108299247

Anonymous 03/05/26(Thu)07:09:08 No.108299247▶

>>108299240
I recommend Gemma 3 for the best hotlines.

Anonymous
03/05/26(Thu)07:09:12 No.108299248

Anonymous 03/05/26(Thu)07:09:12 No.108299248▶

>>108299244
HOW WILL I KNOW?

Anonymous
03/05/26(Thu)07:09:44 No.108299252

Anonymous 03/05/26(Thu)07:09:44 No.108299252▶

>>108299248
You read them.

Anonymous
03/05/26(Thu)07:11:07 No.108299257

Anonymous 03/05/26(Thu)07:11:07 No.108299257▶

File: 1748409523070593.jpg (55.4 KB)

55.4 KB JPG

>>108299252

Anonymous
03/05/26(Thu)07:13:57 No.108299269

Anonymous 03/05/26(Thu)07:13:57 No.108299269▶

>>108299237
SICP

Anonymous
03/05/26(Thu)07:18:07 No.108299288

Anonymous 03/05/26(Thu)07:18:07 No.108299288▶

>>108299240
>no specs
Nemo

Anonymous
03/05/26(Thu)07:18:57 No.108299295

Anonymous 03/05/26(Thu)07:18:57 No.108299295▶

>>108299288
10gb-16gb vram

Anonymous
03/05/26(Thu)07:30:33 No.108299341

Anonymous 03/05/26(Thu)07:30:33 No.108299341▶

>>108299295
Yep, Nemo is the best you'll get.

Anonymous
03/05/26(Thu)07:31:33 No.108299346

Anonymous 03/05/26(Thu)07:31:33 No.108299346▶

>2023+3
>still stuck with sillytavern as the only half decent roleplaying frontend
What went so wrong?

Anonymous
03/05/26(Thu)07:32:25 No.108299349

Anonymous 03/05/26(Thu)07:32:25 No.108299349▶

>>108299346
You didn't use that time to make your own.

Anonymous
03/05/26(Thu)07:34:33 No.108299362

Anonymous 03/05/26(Thu)07:34:33 No.108299362▶

>>108299346
make you own retarded monkey

Anonymous
03/05/26(Thu)07:35:21 No.108299368

Anonymous 03/05/26(Thu)07:35:21 No.108299368▶

>>108299341
i should rape you up the ass with my models

Anonymous
03/05/26(Thu)07:35:53 No.108299371

Anonymous 03/05/26(Thu)07:35:53 No.108299371▶

>>108299362
>>108299349
two faggot open sores losers!

Anonymous
03/05/26(Thu)07:36:27 No.108299374

Anonymous 03/05/26(Thu)07:36:27 No.108299374▶

Uh. So edgy.

Anonymous
03/05/26(Thu)07:37:32 No.108299383

Anonymous 03/05/26(Thu)07:37:32 No.108299383▶

>>108299368
calm down rajesh, I'm on a different continent. you'll have to settle for raping your family's cow like usual.

Anonymous
03/05/26(Thu)07:38:22 No.108299387

Anonymous 03/05/26(Thu)07:38:22 No.108299387▶

>>108299383
im on the same continent as you. You should fear me because I'm actually white. When a white rapes you know its serious business.

Anonymous
03/05/26(Thu)07:38:28 No.108299388

Anonymous 03/05/26(Thu)07:38:28 No.108299388▶

>>108299371
its from scratching my balls too much :(

Anonymous
03/05/26(Thu)07:38:58 No.108299390

Anonymous 03/05/26(Thu)07:38:58 No.108299390▶

>>108299368
you don't have a gpu bruv

Anonymous
03/05/26(Thu)07:39:13 No.108299391

Anonymous 03/05/26(Thu)07:39:13 No.108299391▶

File: 1763826594163623.jpg (282.2 KB)

282.2 KB JPG

>>108299387
Sure you are

Anonymous
03/05/26(Thu)07:40:21 No.108299399

Anonymous 03/05/26(Thu)07:40:21 No.108299399▶

>>108299346
>sillytavern
most of the rube goldberg stuff in there was made to support models that could barely handle 2k context
just use a normal chat frontend, you are not using a llama 2 or mistral finetroon anymore

Anonymous
03/05/26(Thu)07:40:28 No.108299401

Anonymous 03/05/26(Thu)07:40:28 No.108299401▶

>>108299390
16gb of vram?

Anonymous
03/05/26(Thu)07:42:42 No.108299412

Anonymous 03/05/26(Thu)07:42:42 No.108299412▶

>>108299399
and what would those frontends be
other than mikupad

Anonymous
03/05/26(Thu)07:43:13 No.108299419

Anonymous 03/05/26(Thu)07:43:13 No.108299419▶

>>108299401
32gb of coom?

Anonymous
03/05/26(Thu)07:43:52 No.108299423

Anonymous 03/05/26(Thu)07:43:52 No.108299423▶

>>108299419
that is what I run my wan shit on.

Anonymous
03/05/26(Thu)07:44:36 No.108299426

Anonymous 03/05/26(Thu)07:44:36 No.108299426▶

>>108299368
Heh *rapes you with my local models and then uses it to magically turn you into a fat ugly loser who will smell bad forever*

Anonymous
03/05/26(Thu)07:45:46 No.108299430

Anonymous 03/05/26(Thu)07:45:46 No.108299430▶

>>108299399
I don't understand this logic
Yeah you might need not need every feature in ST, but what do other front ends have that ST doesn't? Unless you're far down the minimalism autistic retard rabbithole then what front end is better?

Anonymous
03/05/26(Thu)07:46:32 No.108299435

Anonymous 03/05/26(Thu)07:46:32 No.108299435▶

>>108299412
llama.cpp's built in, open-webui or kobold lite ( it's what in koboldcpp but works with any other API backend )
any of those will be a less cancer inducing experience than the tavern

Anonymous
03/05/26(Thu)07:46:38 No.108299436

Anonymous 03/05/26(Thu)07:46:38 No.108299436▶

>>108296013
I'd imagine the latency would be so high this would only make sense if you're doing huge (tens-hundreds) of batches.

Anonymous
03/05/26(Thu)07:48:25 No.108299444

Anonymous 03/05/26(Thu)07:48:25 No.108299444▶

How do I stop my agents from doing this:

Me: Agent make X, Y and Z
Agent: I made them
Me: Can you confirm you made Y?
Agent: (Realizes it didn't make Y just X and Z) Makes Y, and then replies yes I did

I would rather it answers the fucking questions instead of trying to save face.

Anonymous
03/05/26(Thu)07:48:35 No.108299447

Anonymous 03/05/26(Thu)07:48:35 No.108299447▶

>>108299213
I just download qwen3.5 and it seems pretty impressive.

Anonymous
03/05/26(Thu)07:49:01 No.108299450

Anonymous 03/05/26(Thu)07:49:01 No.108299450▶

>>108299430
>Unless you're far down the minimalism
not having a boeing 747 cockpit in front of you is an improvement in and of itself.

Anonymous
03/05/26(Thu)07:49:58 No.108299453

Anonymous 03/05/26(Thu)07:49:58 No.108299453▶

>>108299450
So you admit there's nothing the other front ends offer? Great, at least that's settled.

Anonymous
03/05/26(Thu)07:50:00 No.108299454

Anonymous 03/05/26(Thu)07:50:00 No.108299454▶

>>108299444
>trying to save face
he's teaching you an important lesson about Chinese Culture

Anonymous
03/05/26(Thu)07:50:13 No.108299455

Anonymous 03/05/26(Thu)07:50:13 No.108299455▶

>>108299450
Just ask Claude to fly the plane!

Anonymous
03/05/26(Thu)07:50:55 No.108299458

Anonymous 03/05/26(Thu)07:50:55 No.108299458▶

>>108299455
unironically would do a good job as long as the harness is good

Anonymous
03/05/26(Thu)07:51:44 No.108299463

Anonymous 03/05/26(Thu)07:51:44 No.108299463▶

>>108299453
you sound like the KDE niggers. Ostensibly, KDE offers everything, and can even be turned into a tiler window manager. Realistically, only people with absolutely no taste would use that piece of shit DE.

Anonymous
03/05/26(Thu)07:52:16 No.108299464

Anonymous 03/05/26(Thu)07:52:16 No.108299464▶

Yall want an AI robot gf, I just want an AI robot friend to play vidya with me and talk, we are not the same.

Anonymous
03/05/26(Thu)07:52:59 No.108299468

Anonymous 03/05/26(Thu)07:52:59 No.108299468▶

>>108299464
>Yall

Anonymous
03/05/26(Thu)07:53:22 No.108299470

Anonymous 03/05/26(Thu)07:53:22 No.108299470▶

>>108299444
Have it write integration tests and start a new context to verify the integration tests pass.

Anonymous
03/05/26(Thu)07:53:28 No.108299471

Anonymous 03/05/26(Thu)07:53:28 No.108299471▶

>>108298228
the alternative is being bought out by some wall street private equity as the sellouts were trying to do with qwen. no surprise the ceo is stepping in when they were trying to pull a fast one on him like that.

Anonymous
03/05/26(Thu)07:53:43 No.108299474

Anonymous 03/05/26(Thu)07:53:43 No.108299474▶

>>108299463
You sound like the kind of retard that is kept up at night at the thought of his OS' package count being higher than that of another user

Anonymous
03/05/26(Thu)07:54:20 No.108299476

Anonymous 03/05/26(Thu)07:54:20 No.108299476▶

>>108299426
i put your post in and raped you anon you fag

Anonymous
03/05/26(Thu)07:54:49 No.108299477

Anonymous 03/05/26(Thu)07:54:49 No.108299477▶

>>108299471
I'm the CEO of an AI startup. I think CEOs being involved is crucial and a good thing. OpenAI would still be stuck with Davinci without Altman.

Anonymous
03/05/26(Thu)07:56:44 No.108299489

Anonymous 03/05/26(Thu)07:56:44 No.108299489▶

>>108299463
I think sillytavern could use being more simplified by default but KDE is really useful (features like HDR or easyish yet advanced window management, etc) and caters to what most people like out of box. If you don't like it that's fine but for most people it's simple enough to use and has everything they want to use out of the box and that's not bad for a DE for most people so long as it doesn't go into the absolute stupid shit windows is doing lately.

Anonymous
03/05/26(Thu)07:58:18 No.108299493

Anonymous 03/05/26(Thu)07:58:18 No.108299493▶

File: 1759820090246300.png (7.6 KB)

7.6 KB PNG

you zoomers don't remember how bad it used to be
the days when a 30b was huge and anything above 2k context was amazing

Anonymous
03/05/26(Thu)07:58:28 No.108299497

Anonymous 03/05/26(Thu)07:58:28 No.108299497▶

>>108299476
Oh yeah well I just put your post in and raped you back again!

Anonymous
03/05/26(Thu)07:59:30 No.108299503

Anonymous 03/05/26(Thu)07:59:30 No.108299503▶

>>108299493
I was here for it but I'm probably confused for a zoomer sometimes. Chronos is still the best

Anonymous
03/05/26(Thu)07:59:36 No.108299504

Anonymous 03/05/26(Thu)07:59:36 No.108299504▶

>>108299493
bohoo boomer, nobody cares

Anonymous
03/05/26(Thu)08:01:22 No.108299510

Anonymous 03/05/26(Thu)08:01:22 No.108299510▶

>>108299477
Stop posting here and make the next Nemo.

Anonymous
03/05/26(Thu)08:01:56 No.108299514

Anonymous 03/05/26(Thu)08:01:56 No.108299514▶

>>108299510
@grok make the next nemo

Anonymous
03/05/26(Thu)08:03:38 No.108299524

Anonymous 03/05/26(Thu)08:03:38 No.108299524▶

>>108299464
>vidya with me and talk
I could do these things with my robot gf

Anonymous
03/05/26(Thu)08:04:06 No.108299527

Anonymous 03/05/26(Thu)08:04:06 No.108299527▶

>>108299524
sauce?

Anonymous
03/05/26(Thu)08:05:31 No.108299536

Anonymous 03/05/26(Thu)08:05:31 No.108299536▶

>>108299497
im the raper not the rapee

Anonymous
03/05/26(Thu)08:08:12 No.108299546

Anonymous 03/05/26(Thu)08:08:12 No.108299546▶

>>108299524
>>108299527
There are programs and stuff that use multimodal llms to constantly scan something and output text constantly which can also be voiced instead of manually putting it in. I've seen it done with translation stuff (gamesentenceminer or luna translate) and stuff like skyrim mods so in theory something do this already exists more or less but I wouldn't know what it is.

Anonymous
03/05/26(Thu)08:09:37 No.108299555

Anonymous 03/05/26(Thu)08:09:37 No.108299555▶

>>108299546
>robot gf
>look inside
>no physical body
lol

Anonymous
03/05/26(Thu)08:11:03 No.108299563

Anonymous 03/05/26(Thu)08:11:03 No.108299563▶

>>108299555
I mean you could in theory hook the llm up to a robot somehow, the groundwork for everything else is already kind of there.

Anonymous
03/05/26(Thu)08:12:51 No.108299575

Anonymous 03/05/26(Thu)08:12:51 No.108299575▶

>>108299563
if that was possible we would have seen it already

Anonymous
03/05/26(Thu)08:14:45 No.108299582

Anonymous 03/05/26(Thu)08:14:45 No.108299582▶

How can we make the local LLM community less gay? It's a growing issue.

Anonymous
03/05/26(Thu)08:15:29 No.108299588

Anonymous 03/05/26(Thu)08:15:29 No.108299588▶

>>108299582
You could leave.

Anonymous
03/05/26(Thu)08:15:31 No.108299589

Anonymous 03/05/26(Thu)08:15:31 No.108299589▶

>>108299575
again >>108299546 I'm pretty sure it's possible at least in the simplest sense, doesn't mean the robot is going to move accordingly or anything and doesn't mean anyone who knows how is currently investing their time to make it a reality though. Be the change you want to see I guess and learn and stop relying on busy extremely tech literate people to figure it out and mass produce it for you.

Anonymous
03/05/26(Thu)08:17:25 No.108299601

Anonymous 03/05/26(Thu)08:17:25 No.108299601▶

>>108299582
Llms make you more likely to turn gay this is scientifically proven

Anonymous
03/05/26(Thu)08:18:05 No.108299604

Anonymous 03/05/26(Thu)08:18:05 No.108299604▶

File: 1758446804438438.gif (1.2 MB)

1.2 MB GIF

Just in case anyone's curious about why the thread is abnormally terrible, some seamonkey got banned for shitting up /aicg/ a day or two ago, so he's now shitting on our floor until his ban expires and he can go home

Anonymous
03/05/26(Thu)08:19:48 No.108299615

Anonymous 03/05/26(Thu)08:19:48 No.108299615▶

>>108299604
meds. MEDS!

Anonymous
03/05/26(Thu)08:20:39 No.108299620

Anonymous 03/05/26(Thu)08:20:39 No.108299620▶

>>108299615
We're not your carer, anon.

Anonymous
03/05/26(Thu)08:22:00 No.108299628

Anonymous 03/05/26(Thu)08:22:00 No.108299628▶

>>108299620
schizo moment

Anonymous
03/05/26(Thu)08:22:22 No.108299629

Anonymous 03/05/26(Thu)08:22:22 No.108299629▶

>>108299435
>open-webui
This one looks nice. Can you explain how it's better than silly?

Anonymous
03/05/26(Thu)08:22:43 No.108299632

Anonymous 03/05/26(Thu)08:22:43 No.108299632▶

>>108299601
and that's a good thing

Anonymous
03/05/26(Thu)08:23:23 No.108299635

Anonymous 03/05/26(Thu)08:23:23 No.108299635▶

File: 1742292827180825.mp4 (2.2 MB)

2.2 MB MP4

>>108299527
>>108299546
>>108299555
>>108299563
SOON

Anonymous
03/05/26(Thu)08:24:33 No.108299640

Anonymous 03/05/26(Thu)08:24:33 No.108299640▶

>>108299629
Yes of course.

Anonymous
03/05/26(Thu)08:25:07 No.108299643

Anonymous 03/05/26(Thu)08:25:07 No.108299643▶

>>108299489
>caters to what most people like out of box
if you're going to make an appeal to popularity as a form of argument.. you do know the popular distros do not default to KDE as their DE? I wonder why, eh

Anonymous
03/05/26(Thu)08:28:34 No.108299660

Anonymous 03/05/26(Thu)08:28:34 No.108299660▶

>>108299643
Why indeed gnome sucks ass now days. But KDE is increasingly A default. Let me rephrase that then even though I thought it was obvious what I meant: KDE has things that most people can or do make use of readily available.

Anonymous
03/05/26(Thu)08:28:42 No.108299662

Anonymous 03/05/26(Thu)08:28:42 No.108299662▶

>>108299635
the fuck is the point of humanoid robots if there will be 100 billion humanoid humans that must be occupied with something?

Anonymous
03/05/26(Thu)08:28:46 No.108299663

Anonymous 03/05/26(Thu)08:28:46 No.108299663▶

>>108299635
Why the weird obsession with making robots look humanoid as if it is the most optimal form?

Anonymous
03/05/26(Thu)08:28:51 No.108299664

Anonymous 03/05/26(Thu)08:28:51 No.108299664▶

can't you guys just make a smart lora for nemo?

Anonymous
03/05/26(Thu)08:29:03 No.108299667

Anonymous 03/05/26(Thu)08:29:03 No.108299667▶

>>108299635
this is whore will be someone gf some day

Anonymous
03/05/26(Thu)08:29:37 No.108299669

Anonymous 03/05/26(Thu)08:29:37 No.108299669▶

>>108299664
ill make the logo

Anonymous
03/05/26(Thu)08:30:19 No.108299673

Anonymous 03/05/26(Thu)08:30:19 No.108299673▶

>>108299669
no need

Anonymous
03/05/26(Thu)08:30:54 No.108299678

Anonymous 03/05/26(Thu)08:30:54 No.108299678▶

>>108297061
Why do all these abliterator tools push merged models to HF? Pushing 100s of merged LORAs is insane, petabytes of HD space wasted. Soon exabytes.

Anonymous
03/05/26(Thu)08:34:00 No.108299688

Anonymous 03/05/26(Thu)08:34:00 No.108299688▶

File: 1760045837635957.png (40.6 KB)

40.6 KB PNG

Anonymous
03/05/26(Thu)08:38:09 No.108299706

Anonymous 03/05/26(Thu)08:38:09 No.108299706▶

>>108299678
s3 space is almost free if you work in a big company, i use it to store training datasets and such and just bill it under company R&D

Anonymous
03/05/26(Thu)09:02:58 No.108299833

Anonymous 03/05/26(Thu)09:02:58 No.108299833▶

>>108299604
Definitely posts like a seanigger, or underage. They both have the same intelligence

Anonymous
03/05/26(Thu)09:06:48 No.108299848

Anonymous 03/05/26(Thu)09:06:48 No.108299848▶

>>108299604
>>108299833
I have no idea who you're talking about, it all looks like about the same level of shitposting that happens sometimes.

Anonymous
03/05/26(Thu)09:08:32 No.108299857

Anonymous 03/05/26(Thu)09:08:32 No.108299857▶

>>108299848
Guys I found him, its poopdickschizo!!

Anonymous
03/05/26(Thu)09:11:45 No.108299867

Anonymous 03/05/26(Thu)09:11:45 No.108299867▶

>>108299237
Start with the 5 foot shelf of books, then unabridged gibbon.
Return for further instructions in 10 years

Anonymous
03/05/26(Thu)09:14:46 No.108299878

Anonymous 03/05/26(Thu)09:14:46 No.108299878▶

mikusex?

Anonymous
03/05/26(Thu)09:14:57 No.108299882

Anonymous 03/05/26(Thu)09:14:57 No.108299882▶

>>108299660
dwm is unironically all you need. Self compiled, of course

Anonymous
03/05/26(Thu)09:16:28 No.108299887

Anonymous 03/05/26(Thu)09:16:28 No.108299887▶

>>108299882
qrd

Anonymous
03/05/26(Thu)09:18:15 No.108299897

Anonymous 03/05/26(Thu)09:18:15 No.108299897▶

>>108299882
so he should use llama-cli directly instead of sillytavern?

Anonymous
03/05/26(Thu)09:23:16 No.108299913

Anonymous 03/05/26(Thu)09:23:16 No.108299913▶

>>108299897
kobold

Anonymous
03/05/26(Thu)09:35:12 No.108299972

Anonymous 03/05/26(Thu)09:35:12 No.108299972▶

>>108299878
Advanced Mikusex with Miku

Anonymous
03/05/26(Thu)09:45:22 No.108300017

Anonymous 03/05/26(Thu)09:45:22 No.108300017▶

>>108299867
after I cut you into little pieces im going to stick you into a 4 foot shelf categorized "FAGGOT"

Anonymous
03/05/26(Thu)09:46:51 No.108300027

Anonymous 03/05/26(Thu)09:46:51 No.108300027▶

>>108300017
Fucking asshole motherfucker

Anonymous
03/05/26(Thu)09:47:37 No.108300031

Anonymous 03/05/26(Thu)09:47:37 No.108300031▶

>>108299913
I wouldn't recommend koboldcpp.

Anonymous
03/05/26(Thu)09:48:15 No.108300034

Anonymous 03/05/26(Thu)09:48:15 No.108300034▶

>>108300031
of course you wouldn't api shill

Anonymous
03/05/26(Thu)09:49:07 No.108300039

Anonymous 03/05/26(Thu)09:49:07 No.108300039▶

>>108300034
how new r u?
>>101207663
>I wouldn't recommend koboldcpp.

Anonymous
03/05/26(Thu)09:49:39 No.108300043

Anonymous 03/05/26(Thu)09:49:39 No.108300043▶

>>108300039
troll

Anonymous
03/05/26(Thu)09:55:36 No.108300067

Anonymous 03/05/26(Thu)09:55:36 No.108300067▶

File: z947ipfiw6ng1.png (674.7 KB)

674.7 KB PNG

>WTF, how can a 4B model be better at coding than a 480B one? What do other 476B parameters do?
wasting params on your stupid rp coom bs is leading to this and qwen's death, hope you're happy...

Anonymous
03/05/26(Thu)09:56:36 No.108300073

Anonymous 03/05/26(Thu)09:56:36 No.108300073▶

>>108300067
cooking recipes, emotional guidance, etc

Anonymous
03/05/26(Thu)09:57:15 No.108300077

Anonymous 03/05/26(Thu)09:57:15 No.108300077▶

>>108300073
none of this are proper use cases that bring money

Anonymous
03/05/26(Thu)09:57:49 No.108300080

Anonymous 03/05/26(Thu)09:57:49 No.108300080▶

>>108300077
>local
>bring money
?

Anonymous
03/05/26(Thu)09:58:07 No.108300083

Anonymous 03/05/26(Thu)09:58:07 No.108300083▶

>>108300077
why would they, you're using the product, you're the customer

Anonymous
03/05/26(Thu)09:58:29 No.108300089

Anonymous 03/05/26(Thu)09:58:29 No.108300089▶

>>108300080
you don't sell your locally vibecoded slop apps? need to catch up

Anonymous
03/05/26(Thu)09:58:43 No.108300093

Anonymous 03/05/26(Thu)09:58:43 No.108300093▶

>>108300077
use case for money?

Anonymous
03/05/26(Thu)09:58:53 No.108300096

Anonymous 03/05/26(Thu)09:58:53 No.108300096▶

>>108300089
show 1

Anonymous
03/05/26(Thu)09:59:35 No.108300100

Anonymous 03/05/26(Thu)09:59:35 No.108300100▶

>>108300096
>dox yourself to 4chin schitzos
no thanks

Anonymous
03/05/26(Thu)10:00:50 No.108300112

Anonymous 03/05/26(Thu)10:00:50 No.108300112▶

>>108300100
must be hard to run a business anonymously
unless... actually, I don't wanna think about that

Anonymous
03/05/26(Thu)10:00:56 No.108300113

Anonymous 03/05/26(Thu)10:00:56 No.108300113▶

Is Qwen 3.5 27B a better Japanese -> English translator than Gemma 3 27B?

Anonymous
03/05/26(Thu)10:01:52 No.108300117

Anonymous 03/05/26(Thu)10:01:52 No.108300117▶

>>108300113
>s Qwen 3.5 27B a better
yes

Anonymous
03/05/26(Thu)10:01:58 No.108300118

Anonymous 03/05/26(Thu)10:01:58 No.108300118▶

>>108300067
anon its because of the benchmark tests

Anonymous
03/05/26(Thu)10:03:48 No.108300129

Anonymous 03/05/26(Thu)10:03:48 No.108300129▶

i hate to say it but but qwen3.5 isn't fantastic right now

Anonymous
03/05/26(Thu)10:03:51 No.108300130

Anonymous 03/05/26(Thu)10:03:51 No.108300130▶

>>108300117
I'm specifically asking because Gemma 3 27B was literally SOTA in Japanese -> English translation. Better than even Claude 4 for some fucking reason.

Anonymous
03/05/26(Thu)10:05:08 No.108300136

Anonymous 03/05/26(Thu)10:05:08 No.108300136▶

>>108300129
>t. minimax cuck

Anonymous
03/05/26(Thu)10:06:04 No.108300146

Anonymous 03/05/26(Thu)10:06:04 No.108300146▶

>>108300136
im a consumer hardware nerd i dont know what the fuck minimax is

Anonymous
03/05/26(Thu)10:07:05 No.108300151

Anonymous 03/05/26(Thu)10:07:05 No.108300151▶

>>108300146
the model/team that killed qwen by distilling better benchmark scores

Anonymous
03/05/26(Thu)10:08:27 No.108300162

Anonymous 03/05/26(Thu)10:08:27 No.108300162▶

>>108300151
but qwen sucks so why would anyone care about that

Anonymous
03/05/26(Thu)10:18:10 No.108300202

Anonymous 03/05/26(Thu)10:18:10 No.108300202▶

>>108300067
Benchmaxxed bullshit, Qwen 4B is NOT intelligent

Anonymous
03/05/26(Thu)10:36:35 No.108300284

Anonymous 03/05/26(Thu)10:36:35 No.108300284▶

anyone else having issues with llama.cpp+qwen it all worked great and i got up to 170t/s on the 0.8 then suddenly it dropped to 120/130 t/s and the output was just garbage after switching between 0.8B 9B 27B 80B they all started generating garbage is it like corrupting reading stale memory or something

Anonymous
03/05/26(Thu)10:39:05 No.108300292

Anonymous 03/05/26(Thu)10:39:05 No.108300292▶

>>108300202
yeah sure thing shill

Anonymous
03/05/26(Thu)10:42:14 No.108300299

Anonymous 03/05/26(Thu)10:42:14 No.108300299▶

I’m a software engineer who hasn’t gone deep into AI yet :(

That changes now.

I don’t want surface-level knowledge. I want to become expert, strong fundamentals, deep LLM understanding, and the ability to build real AI products and businesses.

If you had 12–16 months to become elite in AI, how would you structure it?

Specifically looking for:

>The right learning roadmap (what to learn first, what to ignore)
>Great communities to join (where serious AI builders hang out)
>Networking spaces (Discords, groups, masterminds, generals, etc.)
>Must-follow YouTube channels / podcasts
>Newsletters or sources to stay updated without drowning in noise
>When to start building vs. focusing on fundamentals
>I’m willing to put in serious work. Not chasing hype, aiming for depth, skill, and long-term mastery.

Would appreciate advice from people already deep in this space

Anonymous
03/05/26(Thu)10:46:37 No.108300323

Anonymous 03/05/26(Thu)10:46:37 No.108300323▶

>>108300299
too late

Anonymous
03/05/26(Thu)10:46:47 No.108300325

Anonymous 03/05/26(Thu)10:46:47 No.108300325▶

>>108300299
ok i did this before and I know what's going to work. what you need to do is go here https://www.reddit.com/

Anonymous
03/05/26(Thu)10:49:09 No.108300333

Anonymous 03/05/26(Thu)10:49:09 No.108300333▶

>>108300299
damn this LLM sucks, what model?

Anonymous
03/05/26(Thu)10:53:59 No.108300347

Anonymous 03/05/26(Thu)10:53:59 No.108300347▶

>>108300284
hmm removing --slots and adding --no-slots seems to fix it... but idk

Anonymous
03/05/26(Thu)10:57:20 No.108300358

Anonymous 03/05/26(Thu)10:57:20 No.108300358▶

>>108299493
For me it's Utopia

Anonymous
03/05/26(Thu)11:02:09 No.108300380

Anonymous 03/05/26(Thu)11:02:09 No.108300380▶

>>108300284
I had this on ik_llama.cpp with 397b ubergarm q2. Restarting it fix the problem and it hasn't happened again so I don't know. Speed dropped to sub-1t/s, then after restart went back to about 10. I had plenty of spare GPU and system memory, and fallback was disabled.

Anonymous
03/05/26(Thu)11:02:34 No.108300382

Anonymous 03/05/26(Thu)11:02:34 No.108300382▶

>>108300299
First saar you must do the needful and want to become expert at the English.

Anonymous
03/05/26(Thu)11:04:34 No.108300394

Anonymous 03/05/26(Thu)11:04:34 No.108300394▶

>>108300380
its not just speed its just when i load a bigger model it just stops mid sentence and gives garbage randomly too

Anonymous
03/05/26(Thu)11:07:32 No.108300409

Anonymous 03/05/26(Thu)11:07:32 No.108300409▶

I was in an AI hate thread and a bunch of morons were fighting against an obvious AI, luddites are cringe as hell

Anonymous
03/05/26(Thu)11:11:45 No.108300435

Anonymous 03/05/26(Thu)11:11:45 No.108300435▶

File: 1745905747086883.png (4 KB)

4 KB PNG

how do i fix this?

Anonymous
03/05/26(Thu)11:12:36 No.108300442

Anonymous 03/05/26(Thu)11:12:36 No.108300442▶

lmao

Anonymous
03/05/26(Thu)11:14:47 No.108300458

Anonymous 03/05/26(Thu)11:14:47 No.108300458▶

>>108300299
There are AI PhDs with 10+ published papers all with 1000+ citations that can't even get INTERNSHIPS. There is no upgrade path for a regular software engineer into this field.

We have regulars on /lmg/ that train their own niche models, some even sota that aren't employed in the field.

We get people like you every other week and the answer is always the same, the industry is impenetrable even for domain experts and people making top of the line models. What makes you so special that you believe you can get your foot in the door?

Anonymous
03/05/26(Thu)11:17:54 No.108300472

Anonymous 03/05/26(Thu)11:17:54 No.108300472▶

>>108300458
>some even sota that aren't employed in the field
>>108300442

Anonymous
03/05/26(Thu)11:19:38 No.108300481

Anonymous 03/05/26(Thu)11:19:38 No.108300481▶

>slot update_slots: id 0 | task 27 | forcing full prompt re-processing due to lack of cache data (likely due to SWA or hybrid/recurrent memory, see https://github.com/ggml-org/llama.cpp/pull/13194#issuecomment-2868343055)
why does it always happen with Qwen3.5-35B-A3B? --swa-full doesn't do a thing. I'm on the latest version (8208).

Anonymous
03/05/26(Thu)11:20:39 No.108300489

Anonymous 03/05/26(Thu)11:20:39 No.108300489▶

>>108300472
Not talking about the LLM finetuners. People like the reinforcement agent guy pushing the limits on AI that plays games on its own or that autist that pushed OCR to its absolute limits so that he could read every hentai doujinshi on the internet.

Anonymous
03/05/26(Thu)11:22:48 No.108300499

Anonymous 03/05/26(Thu)11:22:48 No.108300499▶

>>108300489
both of them work at ai startups now

Anonymous
03/05/26(Thu)11:24:32 No.108300506

Anonymous 03/05/26(Thu)11:24:32 No.108300506▶

>>108300435
update windows to 7

Anonymous
03/05/26(Thu)11:26:29 No.108300514

Anonymous 03/05/26(Thu)11:26:29 No.108300514▶

>>108300499
No they don't. Fuck off bullshitter. Last posts were a couple of weeks ago where they affirmed they didn't work in the AI industry specifically.

Why are you trying to gaslight that software engineer into wasting his life trying to get an AI job when not even PhD top contributors and sota model developers get jobs?

Anonymous
03/05/26(Thu)11:28:07 No.108300519

Anonymous 03/05/26(Thu)11:28:07 No.108300519▶

I used chatgpt once should I get a job in the AI industry?

Anonymous
03/05/26(Thu)11:28:14 No.108300521

Anonymous 03/05/26(Thu)11:28:14 No.108300521▶

>>108300514
prove your claims, its very suspicious you know all that but can't provide any proof

Anonymous
03/05/26(Thu)11:33:24 No.108300544

Anonymous 03/05/26(Thu)11:33:24 No.108300544▶

>>108300481
nevermind, it's a known issue with no fix yet.

Anonymous
03/05/26(Thu)11:34:05 No.108300549

Anonymous 03/05/26(Thu)11:34:05 No.108300549▶

>>108300521
I lurk the thread like everyone else and actually put attention to those 2 because I use the manga translation tool and the game playing one is just cool because the guy is blogging his entire journey from 0 knowledge to where he is now pushing sota. If you read through every thread you know exactly what's going on, fuck off troll.

Anonymous
03/05/26(Thu)11:39:03 No.108300566

Anonymous 03/05/26(Thu)11:39:03 No.108300566▶

>>108300549
>still no proof only big claims
you fuck off

Anonymous
03/05/26(Thu)11:44:31 No.108300586

Anonymous 03/05/26(Thu)11:44:31 No.108300586▶

AAAAAAAAAAAAA im down to 11t/s when it was 17t/s before why did i reinstall drivers and upgrade

Anonymous
03/05/26(Thu)11:44:59 No.108300589

Anonymous 03/05/26(Thu)11:44:59 No.108300589▶

>>108300586
lul

Anonymous
03/05/26(Thu)11:46:00 No.108300593

Anonymous 03/05/26(Thu)11:46:00 No.108300593▶

>>108300586
just restore your btrfs snapshot from before you updated

Anonymous
03/05/26(Thu)11:48:58 No.108300603

Anonymous 03/05/26(Thu)11:48:58 No.108300603▶

>>108300593
whats a btrfs

Anonymous
03/05/26(Thu)11:51:32 No.108300612

Anonymous 03/05/26(Thu)11:51:32 No.108300612▶

>>108300603
A poor mans zfs

Anonymous
03/05/26(Thu)11:53:31 No.108300623

Anonymous 03/05/26(Thu)11:53:31 No.108300623▶

>>108300612
YWNBIK

Anonymous
03/05/26(Thu)11:54:52 No.108300625

Anonymous 03/05/26(Thu)11:54:52 No.108300625▶

File: 1772642445742946.jpg (57.6 KB)

57.6 KB JPG

>>108300589
>>108300593
wtf is going on another model i went from 39t/s to 48 t/s

Anonymous
03/05/26(Thu)11:57:06 No.108300631

Anonymous 03/05/26(Thu)11:57:06 No.108300631▶

>>108300299
learn from experts
https://www.youtube.com/watch?v=1oS35oWWl28

Anonymous
03/05/26(Thu)11:58:00 No.108300638

Anonymous 03/05/26(Thu)11:58:00 No.108300638▶

>>108300625
are you using llama-bench or just looking at tokens per second on first message? because it's going to fall off as it generates longer responses and fills more context or be higher if it just replies with a couple words

Anonymous
03/05/26(Thu)11:58:05 No.108300639

Anonymous 03/05/26(Thu)11:58:05 No.108300639▶

>>108300031
you can use kobold's chat ui on llama.cpp:
https://github.com/LostRuins/lite.koboldai.net
it's the only thing of value from kobold anyway

Anonymous
03/05/26(Thu)11:58:43 No.108300642

Anonymous 03/05/26(Thu)11:58:43 No.108300642▶

>>108300631
Bernie is such a good guy, I just wish he were more tech-literate. Instead of banning and slowing AI, why not nationalize it? But then again, after being sabotaged last time, there is 0% chance he'll ever get any additional power.

Anonymous
03/05/26(Thu)11:59:13 No.108300644

Anonymous 03/05/26(Thu)11:59:13 No.108300644▶

>>108300639
I'm not using your open sores barely maintained toy

Anonymous
03/05/26(Thu)12:00:09 No.108300648

Anonymous 03/05/26(Thu)12:00:09 No.108300648▶

>>108300642
>Bernie is such a good guy, I just wish he were more tech-literate. Instead of banning and slowing AI, why not nationalize it?
He's suggested this before more or less. Not watching that video though

Anonymous
03/05/26(Thu)12:00:42 No.108300650

Anonymous 03/05/26(Thu)12:00:42 No.108300650▶

File: 1761654164610191.png (317.5 KB)

317.5 KB PNG

local sisters i don't feel so good

Anonymous
03/05/26(Thu)12:01:22 No.108300653

Anonymous 03/05/26(Thu)12:01:22 No.108300653▶

>>108300650
dont care about jewish saas data harvesters

Anonymous
03/05/26(Thu)12:02:04 No.108300655

Anonymous 03/05/26(Thu)12:02:04 No.108300655▶

>>108300644
>open sores
then go back to aicg retard

Anonymous
03/05/26(Thu)12:03:12 No.108300661

Anonymous 03/05/26(Thu)12:03:12 No.108300661▶

>>108300655
open weights is different tranny

Anonymous
03/05/26(Thu)12:05:45 No.108300670

Anonymous 03/05/26(Thu)12:05:45 No.108300670▶

>>108300129
I don't know if it's the fact that I'm using heretic but 27b sucks and repeats and spouts nonsense sooo much. The base model is too censored by default for my use case though and any prompt that worked before it rejects it.

Anonymous
03/05/26(Thu)12:06:26 No.108300674

Anonymous 03/05/26(Thu)12:06:26 No.108300674▶

>>108300661
there's no such a thing as a local proprietary backend
you still need to go back, cloudtard

Anonymous
03/05/26(Thu)12:06:34 No.108300676

Anonymous 03/05/26(Thu)12:06:34 No.108300676▶

>>108300650
There should be a global rule against twitter screenshots and bans should be permanent.

Anonymous
03/05/26(Thu)12:07:43 No.108300683

Anonymous 03/05/26(Thu)12:07:43 No.108300683▶

new bread
>>108300682
>>108300682
>>108300682
>>108300682
>>108300682
>>108300682

Anonymous
03/05/26(Thu)12:07:54 No.108300684

Anonymous 03/05/26(Thu)12:07:54 No.108300684▶

Reminder to ignore the early schizobake and stay here for the next few hours until the thread reaches page 10.

Anonymous
03/05/26(Thu)12:08:29 No.108300687

Anonymous 03/05/26(Thu)12:08:29 No.108300687▶

>>108300650
fuck. still no GPT 5 local and Sam keeps releasing

Anonymous
03/05/26(Thu)12:09:58 No.108300697

Anonymous 03/05/26(Thu)12:09:58 No.108300697▶

>>108300650
gpt-oss 2 wen

Anonymous
03/05/26(Thu)12:10:02 No.108300698

Anonymous 03/05/26(Thu)12:10:02 No.108300698▶

>>108299663
>rebuild the whole world (built for humanoids) from the ground up
or
>build a robot that works in the world
a difficult choice indeed

Anonymous
03/05/26(Thu)12:11:26 No.108300707

Anonymous 03/05/26(Thu)12:11:26 No.108300707▶

>>108300670
Sub 10B qwens are quite good for the size but I'm not impressed with the bigger ones.

Anonymous
03/05/26(Thu)12:13:35 No.108300722

Anonymous 03/05/26(Thu)12:13:35 No.108300722▶

File: file.png (637.4 KB)

637.4 KB PNG

>>108300698
>built for humanoids

Anonymous
03/05/26(Thu)12:18:04 No.108300737

Anonymous 03/05/26(Thu)12:18:04 No.108300737▶

>>108300722
and also: the human form isn't any more optimal than something like animal/alien hybrid. Something like boston dynamics's spot dog with an arm protruding from the back thing could operate most human things just fine, and four legged creatures are more stable than humans. Why would anyone think we are the ultimate form? humans are the weakest animal, like, ever. We can't kill/hunt anything with our bare hands/teeth. A fucking raccoon will ruin your day. We aren't even adapted to the bare minimum of survival to most ranges of temperature on earth: without clothes and fire, we freeze to death, or burn under the sun. Humans are not to be imitated.

Anonymous
03/05/26(Thu)12:18:56 No.108300739

Anonymous 03/05/26(Thu)12:18:56 No.108300739▶

>>108300722
those cars are mostly designed to transport humanoids to where they need to be to be productive, and robot cars already have even more investment than robot humans so it doesn't support the original complaint

Anonymous
03/05/26(Thu)12:20:08 No.108300743

Anonymous 03/05/26(Thu)12:20:08 No.108300743▶

File: dont-fist-android-girls.png (187 KB)

187 KB PNG

>>108300698
Just say you want to put your dick in the robot.

Anonymous
03/05/26(Thu)12:24:02 No.108300754

Anonymous 03/05/26(Thu)12:24:02 No.108300754▶

>>108300737
obviously humans are not the ultimate form, but in the short term they're by far the most useful. when we multiply our labor force by 100x with these things we can put them to work building the more efficient world and workers we'll need in the long term

Anonymous
03/05/26(Thu)12:26:05 No.108300765

Anonymous 03/05/26(Thu)12:26:05 No.108300765▶

>>108299662
>Accept shitty pay and working conditions or we will replace you with a clanker!

Anonymous
03/05/26(Thu)12:30:02 No.108300778

Anonymous 03/05/26(Thu)12:30:02 No.108300778▶

>>108300684
>>108292231

Anonymous
03/05/26(Thu)12:31:18 No.108300785

Anonymous 03/05/26(Thu)12:31:18 No.108300785▶

I'm back. Anything happen while I was gone?

Anonymous
03/05/26(Thu)12:31:29 No.108300787

Anonymous 03/05/26(Thu)12:31:29 No.108300787▶

>>108300778
I dunno man it's going fine.

Anonymous
03/05/26(Thu)12:32:34 No.108300790

Anonymous 03/05/26(Thu)12:32:34 No.108300790▶

>>108300785
Nope, still nemo.

Anonymous
03/05/26(Thu)12:33:04 No.108300795

Anonymous 03/05/26(Thu)12:33:04 No.108300795▶

File: Nemo taking observations on the Nautilus.jpg (93.4 KB)

93.4 KB JPG

>>108300790

Anonymous
03/05/26(Thu)12:37:48 No.108300817

Anonymous 03/05/26(Thu)12:37:48 No.108300817▶

What kinds of specs would you need to fine tune (LoRA) GPT OSS?
To fine tune MoE models, do you need enough memory to hold the full model or just the activated params?

Anonymous
03/05/26(Thu)12:41:58 No.108300830

Anonymous 03/05/26(Thu)12:41:58 No.108300830▶

File: file.png (32.4 KB)

32.4 KB PNG

>>108300817
https://docs.axolotl.ai/docs/models/gpt-oss.html

Anonymous
03/05/26(Thu)12:43:47 No.108300836

Anonymous 03/05/26(Thu)12:43:47 No.108300836▶

>>108300830
>axolotl
Well shit there you go.
Thank you very much anon.

Anonymous
03/05/26(Thu)13:15:37 No.108300994

Anonymous 03/05/26(Thu)13:15:37 No.108300994▶

>>108300830
>not using unslop colabs
cringe

Anonymous
03/05/26(Thu)13:26:31 No.108301055

Anonymous 03/05/26(Thu)13:26:31 No.108301055▶

>>108300994
Explain.

Anonymous
03/05/26(Thu)13:30:49 No.108301078

Anonymous 03/05/26(Thu)13:30:49 No.108301078▶

>Jamba2 Mini is an open source small language model built for enterprise reliability. With 12B active parameters (52B total),
I'm going to try and fuck this thing.

>>108300994
Explain.

Anonymous
03/05/26(Thu)13:44:59 No.108301141

Anonymous 03/05/26(Thu)13:44:59 No.108301141▶

>>108301055
>>108301078
dyor

Anonymous
03/05/26(Thu)13:51:30 No.108301165

Anonymous 03/05/26(Thu)13:51:30 No.108301165▶

>>108301141
qrd?

Anonymous
03/05/26(Thu)14:15:00 No.108301281

Anonymous 03/05/26(Thu)14:15:00 No.108301281▶

>us
>china
Where are the superior Nippon LLMs, folded 1000 times?

Anonymous
03/05/26(Thu)14:17:27 No.108301301

Anonymous 03/05/26(Thu)14:17:27 No.108301301▶

>>108301281
Didn't they make a super scaled up GPT2 trained on an all CPU super computer or something like that?

Anonymous
03/05/26(Thu)14:20:41 No.108301318

Anonymous 03/05/26(Thu)14:20:41 No.108301318▶

>>108300722
This image is AI, isn't it?

Anonymous
03/05/26(Thu)14:25:24 No.108301350

Anonymous 03/05/26(Thu)14:25:24 No.108301350▶

>>108296023
>ASICs for AI when?
alread exxists bro https://chatjimmy.ai/
t. dixie flatline

Anonymous
03/05/26(Thu)14:33:40 No.108301395

Anonymous 03/05/26(Thu)14:33:40 No.108301395▶

>>108301318
I grabbed an image from google but I don't think it is.

Anonymous
03/05/26(Thu)14:49:54 No.108301481

Anonymous 03/05/26(Thu)14:49:54 No.108301481▶

>>108301395
yes it is, why would there ever be a play field in the center of an on ramp

Anonymous
03/05/26(Thu)14:52:12 No.108301497

Anonymous 03/05/26(Thu)14:52:12 No.108301497▶

>>108297103
Zoomies don't know what a TUI is.

Anonymous
03/05/26(Thu)14:56:59 No.108301523

Anonymous 03/05/26(Thu)14:56:59 No.108301523▶

>>108301481
https://www.shutterstock.com/image-photo/this-beautiful-roundabout-top-view-shot-1135833710
>upload date: 2018

Anonymous
03/05/26(Thu)15:50:02 No.108301870

Anonymous 03/05/26(Thu)15:50:02 No.108301870▶

File: 1744155658238443.png (4 MB)

4 MB PNG

>>108301281
>If your vision of a dystopian future included robot monks presiding over ancient rituals, Kyoto University has brought that vision one step closer to reality. A research team from the university, in collaboration with the tech ventures Teraverse and XNOVA, recently unveiled a new AI-integrated robot monk — the Buddharoid — at the Shoren-in temple in Kyoto.

>The Buddharoid is designed to support the Buddhist clergy as Japan’s religious infrastructure faces a steady decline. It utilizes a system called BuddhaBot-Plus, a specialized generative AI derived from OpenAI’s ChatGPT that has been trained extensively on sacred Buddhist scriptures. This allows the robot to provide spiritual guidance on personal and social issues, like a real monk would.

>Beyond its conversational capabilities, the Buddharoid uses hardware — developed by China’s Unitree Robotics — to mimic the specific movements of a monk, including a slow gait, bowing and the gassho gesture of placing palms together in prayer.

Anonymous
03/05/26(Thu)16:19:21 No.108302087

Anonymous 03/05/26(Thu)16:19:21 No.108302087▶

>>108301497
hawk TUI spit on that thang!

Anonymous
03/05/26(Thu)16:29:54 No.108302185

Anonymous 03/05/26(Thu)16:29:54 No.108302185▶

File: 39642.png (59.5 KB)

59.5 KB PNG

>>108302087
close enough champ let's go

also why is OP an ultrafag who needs reminding who the queen of this site is?

Subject
Name
Comment
File	Supported: JPG, PNG, GIF, WebP, WebM, MP4, MP3 (max 4MB)
CAPTCHA

Reply to Thread #108295959

🔍 Search & Sort