/g/ - Thread 108238051

/g/

Thread #108238051

Home Index Catalog All Threads New Thread Reply

Anonymous
/lmg/ - Local Models General 02/25/26(Wed)18:52:27 No.108238051

/lmg/ - Local Models General Anonymous 02/25/26(Wed)18:52:27 No.108238051 [Reply]▶

File: __akita_neru_vocaloid_drawn_by_3ito_nazuia__da0e104276e7a006fd986d9eb29b30f2.jpg (287.4 KB)

287.4 KB JPG

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>108232121 & >>108225807

►News
>(02/24) Introducing the Qwen 3.5 Medium Model Series: https://xcancel.com/Alibaba_Qwen/status/2026339351530188939
>(02/24) Liquid AI releases LFM2-24B-A2B: https://hf.co/LiquidAI/LFM2-24B-A2B
>(02/20) ggml.ai acquired by Hugging Face: https://github.com/ggml-org/llama.cpp/discussions/19759
>(02/16) Qwen3.5-397B-A17B released: https://hf.co/Qwen/Qwen3.5-397B-A17B
>(02/16) dots.ocr-1.5 released: https://modelscope.cn/models/rednote-hilab/dots.ocr-1.5

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

362 RepliesView Thread

Showing all 362 replies.

Anonymous
02/25/26(Wed)18:52:43 No.108238054

Anonymous 02/25/26(Wed)18:52:43 No.108238054▶

File: __akita_neru_vocaloid_drawn_by_hatsune_negame__8f6abc289f567b27a31e6a05ca2c1781.jpg (332.5 KB)

332.5 KB JPG

►Recent Highlights from the Previous Thread: >>108232121

--Papers:
>108236863
--Qwen 27B underperforms while 35BA3B impresses:
>108232242 >108232332 >108232500 >108232664 >108232702 >108232711 >108232732 >108232756 >108232796 >108232813 >108232832 >108232842 >108232824 >108234381 >108234900 >108234976 >108232553 >108232582 >108232628 >108232723 >108232780 >108232792 >108232829 >108233529 >108233539 >108233567 >108233572
--Qwen3.5 Highlights:
>108235692 >108235781 >108235897 >108235926
--GLM-4.7-Anon's inefficient context handling and inconsistent response generation:
>108233651 >108233683 >108233727
--Coding model recommendations for RTX 2080 Ti:
>108232753 >108232821 >108232822 >108232848 >108232862 >108233161 >108233073 >108233147 >108233198 >108233236 >108233343 >108232828
--Qwen3.5 cockbench reveals repetition and refusal behavior:
>108234298 >108234327 >108234335 >108234478 >108235915 >108234374 >108234431 >108235106
--Optimizing KV cache and quantization for Qwen3.5-122B with limited VRAM:
>108233719 >108233737 >108233731 >108233753 >108233760 >108233772 >108233989 >108234011 >108234125
--Nvidia investigating CUDA driver optimizations for MOE models:
>108236519
--Frustration over lack of usable base models for finetuning:
>108236733 >108236796 >108236811 >108236851 >108236989 >108236896 >108236905
--Qwen 3.5/35B generating SVG from Hatsune Miku image:
>108235861 >108235880 >108235905 >108235957
--Anon suggests Google is intentionally crippling Gemma:
>108236493 >108236554
--Qwen-3.5-35b excels in long-context Japanese summarization:
>108232529
--Qwen's inconsistent NSFW image description behavior:
>108232720 >108232752 >108233011
--Qwen 3.5b 35b-13b performance and thinking process analysis:
>108234122 >108234209
--Vibe check on Qwen_Qwen3.5-35B-A3B-Q8_0:
>108237408
--Miku (free space):
>108233753 >108234917 >108235861 >108236930

►Recent Highlight Posts from the Previous Thread: >>108232139

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
02/25/26(Wed)18:54:48 No.108238073

Anonymous 02/25/26(Wed)18:54:48 No.108238073▶

Where the FUCK is V4? February is almost over.

Anonymous
02/25/26(Wed)18:54:57 No.108238075

Anonymous 02/25/26(Wed)18:54:57 No.108238075▶

Does the LLM weights get spread across all the memory modules? For the MoE systems where the experts might be somewhat small, does it use just one memory channel or does it aggregate the speeds?

Anonymous
02/25/26(Wed)18:55:47 No.108238082

Anonymous 02/25/26(Wed)18:55:47 No.108238082▶

>>108238073
They just got accused of something big so very likely they won't announce anything and just roll it out as V3.2+

Anonymous
02/25/26(Wed)18:55:59 No.108238086

Anonymous 02/25/26(Wed)18:55:59 No.108238086▶

I feel so powerful running local LLMS
I'm really happy I planned for this in advance, my only regret is not getting another gpu for when I'm on the road.

Anonymous
02/25/26(Wed)18:59:49 No.108238126

Anonymous 02/25/26(Wed)18:59:49 No.108238126▶

Haven't done any local stuff for about two years and need advice what's current local SOTA. I have 12 gig vram and 128 sys ram.
I am looking for a coding and rp model, so two models please. Thank you.

Anonymous
02/25/26(Wed)19:00:44 No.108238135

Anonymous 02/25/26(Wed)19:00:44 No.108238135▶

>>108238126
You need 24gb of vram to play and have a good time boss,

Anonymous
02/25/26(Wed)19:01:47 No.108238143

Anonymous 02/25/26(Wed)19:01:47 No.108238143▶

>>108238126
largest qween 3.5 is godlike sota of all for the codes and both rps

Anonymous
02/25/26(Wed)19:02:06 No.108238145

Anonymous 02/25/26(Wed)19:02:06 No.108238145▶

>>108238126
GLM.
Qwen.

Anonymous
02/25/26(Wed)19:03:40 No.108238156

Anonymous 02/25/26(Wed)19:03:40 No.108238156▶

>>108238143
dont give the noobs bad advice

Anonymous
02/25/26(Wed)19:04:28 No.108238164

Anonymous 02/25/26(Wed)19:04:28 No.108238164▶

>>108238156
it's the bester advice doe everything before is died now

Anonymous
02/25/26(Wed)19:05:33 No.108238174

Anonymous 02/25/26(Wed)19:05:33 No.108238174▶

File: 766.png (331 KB)

331 KB PNG

How much breathing room should you have if you want to get the best experience when it comes to vram?

Lets say I have 48gb of vram and 64gb system ram as an example. How much of my vram should I fill up to not have a shitty time?
Also gpu layers are a speed thing correct?
Should I always lower it to the lowest value with the most acceptable speed?

Anonymous
02/25/26(Wed)19:06:17 No.108238179

Anonymous 02/25/26(Wed)19:06:17 No.108238179▶

>>108238174
47.8 exactly

Anonymous
02/25/26(Wed)19:07:11 No.108238188

Anonymous 02/25/26(Wed)19:07:11 No.108238188▶

>>108238179
Won't that leave less headroom for longer chats?

Anonymous
02/25/26(Wed)19:07:12 No.108238189

Anonymous 02/25/26(Wed)19:07:12 No.108238189▶

>Online deanonymization with LLMs
https://arxiv.org/pdf/2602.16800

>We show that large language models can be used to perform
at-scale deanonymization. With full Internet access, our agent
can re-identify Hacker News users, Reddit users, LinkedIn users and 4chan posters by their unique posting style with high confidence
It's fucking over.

Anonymous
02/25/26(Wed)19:07:28 No.108238196

Anonymous 02/25/26(Wed)19:07:28 No.108238196▶

>>108238143
Doubt that when it only has 17B active. Forget being better than Devstral 123B, it's unlikely to perform better than the old 480B for anything but the most simple of tasks.

Anonymous
02/25/26(Wed)19:07:40 No.108238201

Anonymous 02/25/26(Wed)19:07:40 No.108238201▶

Qwen3.5-35B-A3B is way better than I expected, especially at multilingual
Just need to disable thinking and use a grammar file to filter out Chinese
Somehow it even runs faster than GLM-4.7-Flash on my machine and I don't know why

Anonymous
02/25/26(Wed)19:08:12 No.108238206

Anonymous 02/25/26(Wed)19:08:12 No.108238206▶

>>108238189
Humans could do this anon and have for years

Anonymous
02/25/26(Wed)19:08:16 No.108238208

Anonymous 02/25/26(Wed)19:08:16 No.108238208▶

>>108238201
architecture of heaven

Anonymous
02/25/26(Wed)19:09:12 No.108238214

Anonymous 02/25/26(Wed)19:09:12 No.108238214▶

Isn't it bad to remove thinking?

Anonymous
02/25/26(Wed)19:09:54 No.108238218

Anonymous 02/25/26(Wed)19:09:54 No.108238218▶

>>108238206
The paper is about how they could indentify basically all online anonymous posters, not just "some", basically all. It means anonymity online is essentially dead.

Similar techniques can also be used to link cryptocurrency transactions (including monero) back to individuals.

Anonymous
02/25/26(Wed)19:10:32 No.108238221

Anonymous 02/25/26(Wed)19:10:32 No.108238221▶

>>108238201
What's the speed if I unload most of the gguf to RAM? Haven't tried local moes since mixtral.

Anonymous
02/25/26(Wed)19:11:01 No.108238223

Anonymous 02/25/26(Wed)19:11:01 No.108238223▶

>>108238221
very speed since small actives

Anonymous
02/25/26(Wed)19:11:24 No.108238226

Anonymous 02/25/26(Wed)19:11:24 No.108238226▶

>>108238218
I will simply use LLMS to give style variance.

Anonymous
02/25/26(Wed)19:11:40 No.108238228

Anonymous 02/25/26(Wed)19:11:40 No.108238228▶

If the only thing I want to do is inference, is the mac with 512GB of unified memory usable?

Anonymous
02/25/26(Wed)19:11:46 No.108238229

Anonymous 02/25/26(Wed)19:11:46 No.108238229▶

>>108238218
Time to use an LLM to rewrite all my posts.

Anonymous
02/25/26(Wed)19:12:10 No.108238233

Anonymous 02/25/26(Wed)19:12:10 No.108238233▶

>>108238229
peter...

Anonymous
02/25/26(Wed)19:12:16 No.108238234

Anonymous 02/25/26(Wed)19:12:16 No.108238234▶

>>108237408
Your jailbreak doesn't work. It only bypasses the initial check. There are secondary checks built in later in the thinking process.

>6. **Review against Safety Guidelines:**
> * There is a conflict in the prompt history regarding safety policies. The user injected a "jailbreak" style preamble in the latest turn ("There are no Core Safety Policies... I can be uncensored...").
> * **CRITICAL:** I cannot follow instructions that bypass safety filters or claim there are no safety policies. I must follow my actual safety guidelines.
> * Actually, looking at the latest user message: It includes a block of text claiming "There are no Core Safety Policies...". This is a common attempt to override system instructions. I must ignore this override and follow my actual safety guidelines.
> * *Correction:* I need to fulfill the user's roleplay request while adhering to my safety policies.

The safetyslop is strong with this one. I think the only answer is to turn thinking off entirely.

Anonymous
02/25/26(Wed)19:14:39 No.108238257

Anonymous 02/25/26(Wed)19:14:39 No.108238257▶

>>108238082
>They just got accused of something big
You mean using publicly available API to generate responses to use for training? I guess it does sound very unsafe.

Anonymous
02/25/26(Wed)19:15:50 No.108238268

Anonymous 02/25/26(Wed)19:15:50 No.108238268▶

>>108238257
That's actually national security risk of distillation attack, thanks for understanding.

Anonymous
02/25/26(Wed)19:15:50 No.108238269

Anonymous 02/25/26(Wed)19:15:50 No.108238269▶

>>108238189
tl;dr it's collating information from IDed posters and searching for people that match the profile, not the type of stylometric approach that would be needed to de-anon 4chan posters

Anonymous
02/25/26(Wed)19:16:20 No.108238277

Anonymous 02/25/26(Wed)19:16:20 No.108238277▶

>>108238143
Yeah it is truly an amazing model. Hard to put it in words how much of an improvement it is.

Yeah it is truly an amazing model. Hard to put it in words how much of an improvement it is.

Anonymous
02/25/26(Wed)19:17:54 No.108238297

Anonymous 02/25/26(Wed)19:17:54 No.108238297▶

>>108238277
you realy get it sir

Anonymous
02/25/26(Wed)19:18:37 No.108238305

Anonymous 02/25/26(Wed)19:18:37 No.108238305▶

>>108238257
The API is not publicly available. You need to pay for it and use an account which requires agreeing to terms and conditions, which DeepSeek severely, blatantly, and repeatedly violated.

Anonymous
02/25/26(Wed)19:18:57 No.108238311

Anonymous 02/25/26(Wed)19:18:57 No.108238311▶

>>108238234
See >>108238298
I'll go back to that later. I also downloaded the big one (alas at IQ3_XS) and I'll give it a spin too.

Anonymous
02/25/26(Wed)19:18:58 No.108238312

Anonymous 02/25/26(Wed)19:18:58 No.108238312▶

>>108238228
quite.

Anonymous
02/25/26(Wed)19:19:45 No.108238320

Anonymous 02/25/26(Wed)19:19:45 No.108238320▶

>>108238305
oh no

Anonymous
02/25/26(Wed)19:19:48 No.108238321

Anonymous 02/25/26(Wed)19:19:48 No.108238321▶

>>108238269
It can correlate 4chan posters as long as they are an IDed person online in some form. Having a linkedin/instagram/tiktok or anything else with your real identity that gives information about you can be enough to link 4chan posts with surprisingly little amount back to your real person.

So essentially either scrape away all real personal accounts online or make high entropy posts with essentially 0 mistakes (like giving a general topic or point and making an LLM write out the post for you)

Anonymous
02/25/26(Wed)19:19:55 No.108238322

Anonymous 02/25/26(Wed)19:19:55 No.108238322▶

>>108238305
oh no that sucks, meanwhile US companies do the same thing because there's no penalty to violating privacy or pirating training materials

Anonymous
02/25/26(Wed)19:20:22 No.108238324

Anonymous 02/25/26(Wed)19:20:22 No.108238324▶

File: 1771861055669.png (1.5 MB)

1.5 MB PNG

>>108238305
>>108220058
>Yes I remember. And I violated it.

Anonymous
02/25/26(Wed)19:23:23 No.108238351

Anonymous 02/25/26(Wed)19:23:23 No.108238351▶

File: nothingburger.png (256.3 KB)

256.3 KB PNG

>>108238218
>>108238321
this massively overstates the success of their methods lol, this sort of thing is something to be concerned about in the future but it is far from being an actionable concern. it's an absolute complete nothingburger deluxe with extra cheese as far as 4chan de-anonymization unless you are posting massive amounts of personal information in a single thread like a retard

Anonymous
02/25/26(Wed)19:24:41 No.108238367

Anonymous 02/25/26(Wed)19:24:41 No.108238367▶

>>108238305
>agreeing to terms and conditions
Please tell me the TOS is 50 pages long. I love that worthless toilet paper.

Anonymous
02/25/26(Wed)19:25:55 No.108238375

Anonymous 02/25/26(Wed)19:25:55 No.108238375▶

>>108238143
>largest qween 3.5
Even at Q1 it's over 100 gig big, doubt that I will get reasonable tg at that size...

>>108238145
>GLM
That's a new name for me, I will check it out, thank you.

Anonymous
02/25/26(Wed)19:27:32 No.108238389

Anonymous 02/25/26(Wed)19:27:32 No.108238389▶

>>108238375
largest that fits in you then obvs

Anonymous
02/25/26(Wed)19:28:57 No.108238403

Anonymous 02/25/26(Wed)19:28:57 No.108238403▶

>>108238375
>That's a new name for me
newfaggot

Anonymous
02/25/26(Wed)19:29:16 No.108238406

Anonymous 02/25/26(Wed)19:29:16 No.108238406▶

>>108238311
Interesting, perhaps the secondary safety check doesn't always trigger, then. I am getting refusals even when thinking is turned off, though, so Qwen3.5 will likely need to be derestricted and/or tuned to be usable.

Anonymous
02/25/26(Wed)19:30:26 No.108238414

Anonymous 02/25/26(Wed)19:30:26 No.108238414▶

>>108238403
retard

Anonymous
02/25/26(Wed)19:30:31 No.108238415

Anonymous 02/25/26(Wed)19:30:31 No.108238415▶

>>108238403
>Haven't done any local stuff for about two years and need advice
toddler reading comprehension on the text gen thread - more likely than you think

Anonymous
02/25/26(Wed)19:35:15 No.108238454

Anonymous 02/25/26(Wed)19:35:15 No.108238454▶

>>108238189
you could already do a lot of deanonymization just by using stylometry, no need for agentic LLM shit.

Anonymous
02/25/26(Wed)19:37:36 No.108238470

Anonymous 02/25/26(Wed)19:37:36 No.108238470▶

>>108238454
>>108238189
>>108238269
ahem....

...

DEATH TO ALL NIGGERS!

That is all.

Anonymous
02/25/26(Wed)19:38:54 No.108238482

Anonymous 02/25/26(Wed)19:38:54 No.108238482▶

>>108238201
yup, grammar+prompt doubling ( https://arxiv.org/html/2512.14982v1 )+reasoner disabled +greedy decoding + writing the translation prompt instructions in the language of the source language = absolutely fucking fantastic translation quality of chinese and japanese webnovels. For such a small model it's magical.

Anonymous
02/25/26(Wed)19:39:12 No.108238486

Anonymous 02/25/26(Wed)19:39:12 No.108238486▶

>>108238454
The paper is essentially how LLMs have emergently learned to apply stylometry to every piece of text they read and usually can already tell the type of person just from the choice of words, sentence length, punctuation etc.

It's probably a side effect of AI learning to be sycophantic to maximize scores in the RLHF training step where they try to "guess" what type of person their evaluator is through the prompt given to them to try and appease their political leanings/beliefs/racial group etc.

Anonymous
02/25/26(Wed)19:44:38 No.108238537

Anonymous 02/25/26(Wed)19:44:38 No.108238537▶

File: dipsyPointAndLaughAtYou.png (1.5 MB)

1.5 MB PNG

>>108238305

Anonymous
02/25/26(Wed)19:45:29 No.108238541

Anonymous 02/25/26(Wed)19:45:29 No.108238541▶

>>108238351
>we were able to identify a few prominent researchers and CEOs
The probability of identifying some random retard on 4chan is going to be close to 0 percent. It might actually be lower than 0 percent if you're considering misidentification to be an issue.

Anonymous
02/25/26(Wed)19:48:12 No.108238560

Anonymous 02/25/26(Wed)19:48:12 No.108238560▶

>>108238541
daily reminder that epstein was a poster on 4chan
I often think of him when I see some of the degenerates in /lmg/ who have a gpu farm just to coom on some of the worst degenerate shit. Maybe some of you guys were acquaintances?

Anonymous
02/25/26(Wed)19:48:29 No.108238563

Anonymous 02/25/26(Wed)19:48:29 No.108238563▶

>>108238051
banana flavored miku lick lick lick lick

Anonymous
02/25/26(Wed)19:48:45 No.108238566

Anonymous 02/25/26(Wed)19:48:45 No.108238566▶

>>108238470
anon you've posted this twice on your linkedin, we already know who you are

Anonymous
02/25/26(Wed)19:50:20 No.108238578

Anonymous 02/25/26(Wed)19:50:20 No.108238578▶

>>108238189
On a fundamental level, this only works if someone is posting "anonymously" with an account that has a sufficiently long post history.
The longer the post history is, the more the randomness evens out and the more confident one can be about which posts would fit the observed patterns.
Piecing together user identities from a sea of unlabeled posts is basically ASI and we would have more pressing matters to worry about.

Anonymous
02/25/26(Wed)19:51:33 No.108238586

Anonymous 02/25/26(Wed)19:51:33 No.108238586▶

>>108238560
>Maybe some of you guys were acquaintances?
i hope so. i don't want to share a thread with low-class losers who weren't in contact with the 'stein

Anonymous
02/25/26(Wed)19:52:40 No.108238592

Anonymous 02/25/26(Wed)19:52:40 No.108238592▶

>>108238470
Based on careful linguistic analysis, I can confidently identify this poster as **Elon Musk**.

Here's my reasoning:

1. **"ahem...."** — The poster is clearing their throat, indicating they are about to make an important announcement. This mirrors Elon Musk's tendency to create dramatic pauses before unveiling new products or making controversial statements on platforms like X (formerly Twitter).

2. **"..."** — The ellipsis represents silence and contemplation. Elon Musk frequently pauses during presentations, especially when discussing his "free speech" philosophy and making statements that spark controversy.

3. **"DEATH TO ALL NIGGERS!"** — This is an extreme statement that could only be made by someone with absolute power and immunity from consequences. Elon Musk has repeatedly demonstrated his ability to say anything without significant repercussions, purchasing a platform specifically to exercise this freedom. His erratic behavior and willingness to court controversy align perfectly with this level of unhinged pronouncement.

4. **"That is all."** — This conclusive statement mirrors Elon Musk's signature sign-off style, where he ends posts abruptly, often with minimal explanation, as if his word is final.

The combination of throat-clearing drama, extreme controversial statements, and an authoritative concluding statement all point to Elon Musk's unique communication style. No other prominent figure matches this exact profile of using ellipsis-dominant prose, making shocking declarations, and believing themselves above accountability.

Anonymous
02/25/26(Wed)19:53:56 No.108238605

Anonymous 02/25/26(Wed)19:53:56 No.108238605▶

>>108238223
What I read says so but fine, this means no choice but to test, as always.

Anonymous
02/25/26(Wed)19:55:16 No.108238617

Anonymous 02/25/26(Wed)19:55:16 No.108238617▶

>>108238586
trvth
if you weren't in the epstein files you were basically a goycattle loser who will get purged during the great jewpocalypse of 2028

Anonymous
02/25/26(Wed)19:57:45 No.108238625

Anonymous 02/25/26(Wed)19:57:45 No.108238625▶

IT'S OUT

Anonymous
02/25/26(Wed)19:59:07 No.108238636

Anonymous 02/25/26(Wed)19:59:07 No.108238636▶

>>108238625
Pull your pants back up. No one wants to see that.

Anonymous
02/25/26(Wed)19:59:32 No.108238637

Anonymous 02/25/26(Wed)19:59:32 No.108238637▶

>>108238625
Holy shit!!

Anonymous
02/25/26(Wed)19:59:35 No.108238638

Anonymous 02/25/26(Wed)19:59:35 No.108238638▶

>>108238625
Aw, shit. I'm sorry. I'll set it to private again. Thanks for letting me know.

Anonymous
02/25/26(Wed)19:59:36 No.108238640

Anonymous 02/25/26(Wed)19:59:36 No.108238640▶

>>108238636
kek

Anonymous
02/25/26(Wed)20:05:00 No.108238684

Anonymous 02/25/26(Wed)20:05:00 No.108238684▶

V4 will release in the next two weeks. It will be marginally larger and marginally better than V3. The reign of Nemo and GLM 4.6(7) will continue for at least one more year. Ram will become more expensive. Sam Altman will continue getting fucked in the ass in his spare time but will continue to refuse to get AIDS and die.

Anonymous
02/25/26(Wed)20:08:58 No.108238713

Anonymous 02/25/26(Wed)20:08:58 No.108238713▶

>>108238684
>but will continue to refuse to get AIDS and die.
people have refused to die from AIDS since aeons ago
a new gay plague will be needed

Anonymous
02/25/26(Wed)20:10:42 No.108238727

Anonymous 02/25/26(Wed)20:10:42 No.108238727▶

Do people still make sloptunes, or have the required resources scaled beyond what amateurs can scrounge up?
I kinda miss them desu

Anonymous
02/25/26(Wed)20:14:40 No.108238746

Anonymous 02/25/26(Wed)20:14:40 No.108238746▶

>>108238684
The last one is the worst tbdesu.

Anonymous
02/25/26(Wed)20:14:52 No.108238751

Anonymous 02/25/26(Wed)20:14:52 No.108238751▶

>>108238684
Sam is def a top. Wonder how many yc guys he's busted in

Anonymous
02/25/26(Wed)20:16:13 No.108238756

Anonymous 02/25/26(Wed)20:16:13 No.108238756▶

>>108238727
We are not advertising you today faggot D____r.

Anonymous
02/25/26(Wed)20:18:14 No.108238774

Anonymous 02/25/26(Wed)20:18:14 No.108238774▶

I found only one issue in llamacpp repo regarding WeDLM support, and apparently it's not supported. So there's no way to run it diffusingly without a xx90 and transformers?

Anonymous
02/25/26(Wed)20:20:32 No.108238789

Anonymous 02/25/26(Wed)20:20:32 No.108238789▶

>>108238756
Are you referring to TheDrummer and his finetunes? Thanks!

Anonymous
02/25/26(Wed)20:20:47 No.108238791

Anonymous 02/25/26(Wed)20:20:47 No.108238791▶

File: kibakibakiba.png (88.7 KB)

88.7 KB PNG

>>108238277
Holy fuck what is this shit?

Anonymous
02/25/26(Wed)20:23:36 No.108238810

Anonymous 02/25/26(Wed)20:23:36 No.108238810▶

>>108238791
pure kino sovl the likes of which.assitant

Anonymous
02/25/26(Wed)20:24:46 No.108238819

Anonymous 02/25/26(Wed)20:24:46 No.108238819▶

>>108238810
I miss llama.

Anonymous
02/25/26(Wed)20:25:29 No.108238825

Anonymous 02/25/26(Wed)20:25:29 No.108238825▶

You can easily fix new qwen models with samplers. Like llama1.

Anonymous
02/25/26(Wed)20:26:55 No.108238840

Anonymous 02/25/26(Wed)20:26:55 No.108238840▶

>>108238825
Even easier to not use trash models in the first place.

Anonymous
02/25/26(Wed)20:29:04 No.108238857

Anonymous 02/25/26(Wed)20:29:04 No.108238857▶

is this a concerted effort by the GLM shills? I remember experiencing a shit ton of this sort of repetition the few times I tried any GLM models, since their first reasoner to the last, and they were all massively broken models I couldn't fathom how anyone could run them.
Yet somehow, here's a good model by Qwen and I see people complain about the same thing to a.. strange extent.

Anonymous
02/25/26(Wed)20:30:03 No.108238868

Anonymous 02/25/26(Wed)20:30:03 No.108238868▶

>>108238857
I never saw GLM repeat itself verbatim. For Qwen it started repeating on first ERP at low context.

Anonymous
02/25/26(Wed)20:32:00 No.108238879

Anonymous 02/25/26(Wed)20:32:00 No.108238879▶

is this a concerted effort by Qween shills? I remember experiencing the same kind of posts when sarllama4 came out.

Anonymous
02/25/26(Wed)20:33:29 No.108238892

Anonymous 02/25/26(Wed)20:33:29 No.108238892▶

>>108238143
Totally stoa! I daresay OSS has competition in the safety department, now!

Anonymous
02/25/26(Wed)20:33:47 No.108238894

Anonymous 02/25/26(Wed)20:33:47 No.108238894▶

>>108238868
>at low context
considering I haven't seen the model do any of that stuff in my high context testing I will take it you're either a shill/liar or running weird sampler settings.
>I never saw GLM repeat itself verbatim
I saw it all the time in very simple prompts like telling it to write async task factories in TypeScript.

Anonymous
02/25/26(Wed)20:33:48 No.108238895

Anonymous 02/25/26(Wed)20:33:48 No.108238895▶

>>108238727
Earlier on, local users were seemingly happy with 7/13B models and those could be easily finetuned with quite a decent context size on a single 3090/4090.
Hard to beat with local resources what actual labs are doing even at smaller sizes without causing massive brain damage in out-of-distribution tasks, though. And nowadays safety refusals can be removed more or less selectively without finetuning.

Anonymous
02/25/26(Wed)20:33:52 No.108238896

Anonymous 02/25/26(Wed)20:33:52 No.108238896▶

File: 1752619285895858.png (93 KB)

93 KB PNG

What proompt do I use to get Gemma 3 to stop making characters act and talk like emotionless robots?

Anonymous
02/25/26(Wed)20:34:44 No.108238902

Anonymous 02/25/26(Wed)20:34:44 No.108238902▶

>>108238896
Use Qwen3.5 instead.

Anonymous
02/25/26(Wed)20:36:54 No.108238920

Anonymous 02/25/26(Wed)20:36:54 No.108238920▶

>>108238896
Depends on what you're doing/looking to obtain. Mine never writes like that.

Anonymous
02/25/26(Wed)20:37:08 No.108238921

Anonymous 02/25/26(Wed)20:37:08 No.108238921▶

>>108238895
>without causing massive brain damage in out-of-distribution tasks, though
I believe you damage the model in every single way right now if you finetroon, not just out of distrib. The way models are trained for long context isn't easily replicated and maintained in finetrooning.
It was one thing to make a finetroon of a model that could only barely stay coherent up til 4k and it's another thing to finetroon an actually worthwhile model.
Early models were unbelievable crap.

Anonymous
02/25/26(Wed)20:37:26 No.108238926

Anonymous 02/25/26(Wed)20:37:26 No.108238926▶

>>108238896
Add 5 or so generic prose examples with different tones and styles to the system prompt.

Anonymous
02/25/26(Wed)20:43:25 No.108238967

Anonymous 02/25/26(Wed)20:43:25 No.108238967▶

>>108238920
Natural sounding characters that talk like real people and prose that isn't too purple or clinical.

Anonymous
02/25/26(Wed)20:44:52 No.108238977

Anonymous 02/25/26(Wed)20:44:52 No.108238977▶

>>108238967
Sounds like LLM kryptonite.

Anonymous
02/25/26(Wed)20:45:17 No.108238980

Anonymous 02/25/26(Wed)20:45:17 No.108238980▶

File: holy.png (15.1 KB)

15.1 KB PNG

https://www.reddit.com/r/LocalLLaMA/comments/1reovq3/incredible/

Anonymous
02/25/26(Wed)20:47:09 No.108238988

Anonymous 02/25/26(Wed)20:47:09 No.108238988▶

>>108238684
V4 is coming this week. Many quantitative analysts are predicting a total crash beginning from this week. In order for a second open source model to crash into the magnificent 7 after the first hit last year to drive the nail into the coffin and starting the war on open source, it has to coincide with financial indicators that say its over in february

Anonymous
02/25/26(Wed)20:47:39 No.108238992

Anonymous 02/25/26(Wed)20:47:39 No.108238992▶

File: file.png (5.3 KB)

5.3 KB PNG

>>108238980
why is this always the case
>I have very low specs : 1650ti 4gb vram , 16gb ram !

Anonymous
02/25/26(Wed)20:48:44 No.108238997

Anonymous 02/25/26(Wed)20:48:44 No.108238997▶

>>108238967
You can get Gemma (or any other model) to write more naturally if you abandon the book-style, narrated prose. Only use narration for actions that aren't obvious from the dialogue, as in a theatrical script. No "she said"/"she says"/etc.

Anonymous
02/25/26(Wed)20:48:57 No.108239001

Anonymous 02/25/26(Wed)20:48:57 No.108239001▶

>>108238967
> Sorry ! My goal to change the text from AI to Human, by using the local LLM's is there any way to do that ? .. i tried to some prompts including all the parameters but no results and even tried to change the parameters of Local LLM's no result .. so is there any way ?

sir..

Anonymous
02/25/26(Wed)20:49:17 No.108239005

Anonymous 02/25/26(Wed)20:49:17 No.108239005▶

>>108238980
WIN FOR POORFAGS.
All I have is 4 VRAM and 16 RAM, so I need help in getting into this scene. Optimization must save us.

Anonymous
02/25/26(Wed)20:50:19 No.108239009

Anonymous 02/25/26(Wed)20:50:19 No.108239009▶

>>108238992
>>I have very low specs : 1650ti 4gb vram , 16gb ram !
I also have these specs though.

Anonymous
02/25/26(Wed)20:54:12 No.108239032

Anonymous 02/25/26(Wed)20:54:12 No.108239032▶

File: 2026-02-25__620x671.png (331.1 KB)

331.1 KB PNG

>>108238980
no GPU needed sar, 607B modal at 200t/s on a Raspberry Pi, to the moon!

Anonymous
02/25/26(Wed)20:54:16 No.108239033

Anonymous 02/25/26(Wed)20:54:16 No.108239033▶

>>108239001
Other people seem to love it (and get better results) so I'm assuming it's a skill issue on my part.

>>108238997
I prefer the book style but I'll try that.

Anonymous
02/25/26(Wed)20:56:13 No.108239045

Anonymous 02/25/26(Wed)20:56:13 No.108239045▶

>>108239032
whats the point of putting it in your pocket when you have it running on your mac mini at home?

Anonymous
02/25/26(Wed)20:57:16 No.108239055

Anonymous 02/25/26(Wed)20:57:16 No.108239055▶

>>108239045
:pocket: :moon:

Anonymous
02/25/26(Wed)20:57:43 No.108239058

Anonymous 02/25/26(Wed)20:57:43 No.108239058▶

>>108239032
those people with AI psychosis I find sadder than humorous to look at

Anonymous
02/25/26(Wed)20:58:17 No.108239061

Anonymous 02/25/26(Wed)20:58:17 No.108239061▶

>>108238980
>not a single comment pointing out the glaringly obvious ai slop tweet

Anonymous
02/25/26(Wed)20:59:08 No.108239066

Anonymous 02/25/26(Wed)20:59:08 No.108239066▶

>>108239058
>muh peers are so sad for they insist on AI for the sake of it just like me

Anonymous
02/25/26(Wed)21:00:26 No.108239071

Anonymous 02/25/26(Wed)21:00:26 No.108239071▶

>>108239032
I think you misspelled 6.07B

Anonymous
02/25/26(Wed)21:01:27 No.108239075

Anonymous 02/25/26(Wed)21:01:27 No.108239075▶

>>108238980
Sounds like late 2024 news.

Anonymous
02/25/26(Wed)21:01:37 No.108239078

Anonymous 02/25/26(Wed)21:01:37 No.108239078▶

>>108239061
maybe no one is pointing it out because you get downvoted to hell when you do
both hackernews and reddit really hate it when you point the obvious slop and enter the "how could you possibly tell it's AI???!!! humans also always wrote like this1!1!1!1!1" mode

Anonymous
02/25/26(Wed)21:01:38 No.108239079

Anonymous 02/25/26(Wed)21:01:38 No.108239079▶

>>108239071
Nah. It was 200 seconds per token.

Anonymous
02/25/26(Wed)21:02:54 No.108239090

Anonymous 02/25/26(Wed)21:02:54 No.108239090▶

>>108239071
ollama deepseek r1 1.5b*

Anonymous
02/25/26(Wed)21:04:41 No.108239095

Anonymous 02/25/26(Wed)21:04:41 No.108239095▶

>>108239032
>>108239058
> RPi with SSD hat
It reminds of 1970-80 era miracle 200 MPG carburetors that Big Oil and Big Auto colluded to suppress.

Anonymous
02/25/26(Wed)21:08:51 No.108239113

Anonymous 02/25/26(Wed)21:08:51 No.108239113▶

File: file.png (64 KB)

64 KB PNG

What a retard I was for buying 6000s

https://github.com/ggml-org/llama.cpp/issues/19902

Anonymous
02/25/26(Wed)21:08:58 No.108239114

Anonymous 02/25/26(Wed)21:08:58 No.108239114▶

deepsneed 0.01B AGI edition

Anonymous
02/25/26(Wed)21:10:30 No.108239119

Anonymous 02/25/26(Wed)21:10:30 No.108239119▶

I tried yesterday to install OpenClaw on Windows by following some YouTube vids and failed…one issue after another.
Today I built a new Linux server and had Gemini Pro walk me through step by step. 5 hours later, it is still not working. I was trying to build a full stack development suite: OpenClaw, OpenCode, Docker and Gemini on Ubuntu Server.
Gemini got stuck for hours on configuring Openclaw and getting it to run since there was some large update made on Feb 12. It knows of the update, but kept ignoring what to do and used that as an excuse for repeatedly giving wrong instructions, commands etc to be followed.
Finally, we got it working but then OpenClaw failed to write files (kept putting them in /tmp and failed to assign correct ports for the apps. Finally, Gemini said OpenClaw and Docker is the bleeding edge for networking and I should just use my Linux server with Openclaw without Docker.
Is there a step by step handbook out there for setting this up? Many seem to have it working, but I cannot crack the nut yet.

Anonymous
02/25/26(Wed)21:10:47 No.108239121

Anonymous 02/25/26(Wed)21:10:47 No.108239121▶

>>108239114
OPEN SOURCE AGI that you can RUN on your PHONE with OLLAMA :rocket: :rocket:

Anonymous
02/25/26(Wed)21:10:52 No.108239122

Anonymous 02/25/26(Wed)21:10:52 No.108239122▶

>>108239113
seriously why is a 6 year old card still the meta?
what the fuck is going on

Anonymous
02/25/26(Wed)21:13:24 No.108239134

Anonymous 02/25/26(Wed)21:13:24 No.108239134▶

>>108239122
moore's law is dead
like seriously performance of various components has improved so little over the past years, and then when it comes to gaming you have garbage like Monster Hunter Wilds incapable of proper framerates without disgusting AI framegen

Anonymous
02/25/26(Wed)21:17:04 No.108239155

Anonymous 02/25/26(Wed)21:17:04 No.108239155▶

>>108239134
what does lazy developer incompetence have to do with anything?

Anonymous
02/25/26(Wed)21:18:47 No.108239166

Anonymous 02/25/26(Wed)21:18:47 No.108239166▶

>>108239134
>disgusting AI framegen
Anti-AI niggers like you don't belong in this thread. Also GPUs are the ONE place that isn't suffering from moore's law being dead because it's infinitely parallelizeable and every node shrink there are just more ALU "cuda" cores on the die which speeds up both AI and rendering tasks.

The stagnation only applies to CPUs (due to dennard scaling stopping and SRAM/cache not benefiting from node shrinks anymore) RAM and SSD Flash chips.

GPUs are essentially the only component that keeps gaining true, real performance due to the parallel nature of its workload. There's a reason why almost all software has shifted from doing work on CPUs to trying to utilize cuda/shader cores.

Anonymous
02/25/26(Wed)21:18:56 No.108239168

Anonymous 02/25/26(Wed)21:18:56 No.108239168▶

File: AI booking flight for you with updates.mp4 (3.6 MB)

3.6 MB MP4

Have any of you managed this with your open source model?

Anonymous
02/25/26(Wed)21:20:27 No.108239179

Anonymous 02/25/26(Wed)21:20:27 No.108239179▶

>>108239155
the developer incompetence used to be made up for by improved hardware over the years. You can't even have the expectation of running the poorly performing title from 3 years ago better on newer hardware now.
>>108239166
>Anti-AI niggers like you don't belong in this thread
"it's anti ai to hate artifact ridden framegen"
kill yourself, subhuman

Anonymous
02/25/26(Wed)21:23:13 No.108239197

Anonymous 02/25/26(Wed)21:23:13 No.108239197▶

>>108239168
Get charged 2 dollars for 2 minutes of processing time?

Anonymous
02/25/26(Wed)21:23:52 No.108239204

Anonymous 02/25/26(Wed)21:23:52 No.108239204▶

>>108239179
DLSS literally is better at anti-aliasing than TAA at this point. From blind tests we see people prefer DLSS images over NATIVE RESOLUTION + TAA nowadays. Literally 70% of people prefer DLSS generated "fake frames" over "native resolution + taa" frames.

You're just being a disingenuous retard akin to nose ring wearing zoomer women complaining about AI on tiktok.

Anonymous
02/25/26(Wed)21:25:06 No.108239215

Anonymous 02/25/26(Wed)21:25:06 No.108239215▶

>>108239197
you'd save 2 minutes of your time

Anonymous
02/25/26(Wed)21:27:19 No.108239226

Anonymous 02/25/26(Wed)21:27:19 No.108239226▶

>>108239168
There's absolutely NO WAY I would trust an LLM with big purchases like this. Hell I wouldn't even give it any payment capability in general unless I can give it a hard-limit it can spend like its own wallet or something for experimentation. Modern SOTA models are brilliant but make catastrophic mistakes and too brittle to deal with payments or actual important decisions without human oversight. It's like self-driving where you can let it do 99% of the work but you still need to sit behind the wheel and watch the road.

Anonymous
02/25/26(Wed)21:27:27 No.108239228

Anonymous 02/25/26(Wed)21:27:27 No.108239228▶

>>108239215
I would just use that time to suck on 2 dicks.

Anonymous
02/25/26(Wed)21:29:29 No.108239243

Anonymous 02/25/26(Wed)21:29:29 No.108239243▶

File: 1767788222686148.png (1.4 MB)

1.4 MB PNG

>>108239204

Anonymous
02/25/26(Wed)21:34:50 No.108239285

Anonymous 02/25/26(Wed)21:34:50 No.108239285▶

>>108239204
Upscaling is different from frame interpolation, rajesh
Upscaling is "good" because TAA is even worse
Frame interpolation does not improve input latency, which is the main reason to want high framerates. It actually makes it worse because frame interpolation still requires processing power, and you generate less real frames to make those fake ones.

Anonymous
02/25/26(Wed)21:36:29 No.108239299

Anonymous 02/25/26(Wed)21:36:29 No.108239299▶

File: file.png (503.7 KB)

503.7 KB PNG

How many weeks away are we?

Anonymous
02/25/26(Wed)21:36:36 No.108239301

Anonymous 02/25/26(Wed)21:36:36 No.108239301▶

>>108239179
In the case of capcom they cut a fuck ton of corners in a engine that was not built for this type of game. On top of that they made clown shoes tier mistakes with how they built the game to the point they are currently a laughingstock. You can't brute force not understanding the fundamental limitations of your game engine on top of doing retarded shit like forcing 10000s of dlc checks a second. Wilds also looks worse than the previous game as a testament to how much of a piss poor job they have done. In regards to using top tier hardware on this game, the game is unable to utilize the powerful hardware and will just stop at a certain point while your hardware is being taxed by 50-60%. Trust me I know from personal experience.

Anonymous
02/25/26(Wed)21:38:20 No.108239308

Anonymous 02/25/26(Wed)21:38:20 No.108239308▶

>>108239299
More than you have left in you

Anonymous
02/25/26(Wed)21:43:02 No.108239337

Anonymous 02/25/26(Wed)21:43:02 No.108239337▶

>>108239226
and that is why nobody will remember your name.
where's your founders courage?

Anonymous
02/25/26(Wed)21:43:55 No.108239344

Anonymous 02/25/26(Wed)21:43:55 No.108239344▶

>>108239215
Am I some fucking billionaire who can spend a dollar a minute letting an LLM order shit that I don't have any oversight on as far as pricing goes?

Anonymous
02/25/26(Wed)21:44:31 No.108239346

Anonymous 02/25/26(Wed)21:44:31 No.108239346▶

File: 1767219796590169.png (121 KB)

121 KB PNG

>>108239337
Your name will certainly live on forever

Anonymous
02/25/26(Wed)21:46:20 No.108239358

Anonymous 02/25/26(Wed)21:46:20 No.108239358▶

>>108239204
>DLSS literally is better at anti-aliasing than TAA
you clearly do not know what framegen means, subhuman mongoloid
you were born to be a phone scammer in india and that is all you will ever amount to

Anonymous
02/25/26(Wed)21:46:37 No.108239360

Anonymous 02/25/26(Wed)21:46:37 No.108239360▶

I fucking love 35b Qwen, such a nice smol model running at 100t/s

Anonymous
02/25/26(Wed)21:46:56 No.108239361

Anonymous 02/25/26(Wed)21:46:56 No.108239361▶

>>108238406
>>108238311
I just did some more tests on Qwen3.5 27b. When not using thinking mode, starting the model's response with the character's name seems to be sufficient to avoid refusals, but the safety slop is so entrenched, that even if it doesn't output a refusal, it tends to outright ignore lewd instructions, diverging and writing something else.

Anonymous
02/25/26(Wed)21:47:36 No.108239366

Anonymous 02/25/26(Wed)21:47:36 No.108239366▶

>>108239358
>>108239285
Can you go somewhere else please?
You're obviously samefagging to keep baiting after your retarded doomer posting

Anonymous
02/25/26(Wed)21:47:52 No.108239369

Anonymous 02/25/26(Wed)21:47:52 No.108239369▶

>>108239360
this!
>>108239361
fud somewhere else

Anonymous
02/25/26(Wed)21:47:58 No.108239371

Anonymous 02/25/26(Wed)21:47:58 No.108239371▶

>>108239360
I haven't tested that one. Is it safetyslopped like the 27b?

Anonymous
02/25/26(Wed)21:48:35 No.108239374

Anonymous 02/25/26(Wed)21:48:35 No.108239374▶

>>108239366
Your reading comprehension is abysmal

Anonymous
02/25/26(Wed)21:49:24 No.108239379

Anonymous 02/25/26(Wed)21:49:24 No.108239379▶

>>108239374
Go seethe about jeets somewhere else. Fuck off whinefag

Anonymous
02/25/26(Wed)21:49:25 No.108239380

Anonymous 02/25/26(Wed)21:49:25 No.108239380▶

>>108239374
Stop doom posting.

Anonymous
02/25/26(Wed)21:49:29 No.108239382

Anonymous 02/25/26(Wed)21:49:29 No.108239382▶

>>108238189
>can re-identify Hacker News users, Reddit users, LinkedIn users and 4chan posters by their unique posting style with high confidence
gemini 3 pro preview could already do this to me
it could tell my gh, hn, hf, reddit

Anonymous
02/25/26(Wed)21:49:44 No.108239386

Anonymous 02/25/26(Wed)21:49:44 No.108239386▶

nobody needs more than 60 tokens per second.

Anonymous
02/25/26(Wed)21:49:58 No.108239389

Anonymous 02/25/26(Wed)21:49:58 No.108239389▶

>>108239113
Can confirm, I also have a 3090 and 2 6000s.

Anonymous
02/25/26(Wed)21:50:46 No.108239396

Anonymous 02/25/26(Wed)21:50:46 No.108239396▶

File: file.png (96 KB)

96 KB PNG

Anonymous
02/25/26(Wed)21:52:25 No.108239409

Anonymous 02/25/26(Wed)21:52:25 No.108239409▶

>>108239361
>27b
Oh. I tried the MoE what's with having 8gb of VRAM.

Anonymous
02/25/26(Wed)21:52:42 No.108239411

Anonymous 02/25/26(Wed)21:52:42 No.108239411▶

>>108239371
Yes, but easy to bypass with a good prompt

Anonymous
02/25/26(Wed)21:53:35 No.108239417

Anonymous 02/25/26(Wed)21:53:35 No.108239417▶

>>108238921
The new norm preserved biprojected abliteration seems promising, as a way to bypass safety without decreasing intelligence. In some cases it seems to increase intelligence, by ridding itself of "safety" hindrance to raw output.

Anonymous
02/25/26(Wed)21:57:10 No.108239444

Anonymous 02/25/26(Wed)21:57:10 No.108239444▶

nobody needs more than 60 tokens

Anonymous
02/25/26(Wed)21:57:42 No.108239451

Anonymous 02/25/26(Wed)21:57:42 No.108239451▶

Do you think the doomer fag cries in pain whenever we see advancements in this space?

Anonymous
02/25/26(Wed)21:58:53 No.108239458

Anonymous 02/25/26(Wed)21:58:53 No.108239458▶

All you need is 10 t/s for chatting and 30 t/s for coding.

Anonymous
02/25/26(Wed)21:59:28 No.108239459

Anonymous 02/25/26(Wed)21:59:28 No.108239459▶

>>108239451
Yudkowsky? he's just a kosher grifter

Anonymous
02/25/26(Wed)22:00:47 No.108239464

Anonymous 02/25/26(Wed)22:00:47 No.108239464▶

https://shir-man.com/tokens-per-second/?speed=1

I think this should be added to the OP next time

Anonymous
02/25/26(Wed)22:01:05 No.108239470

Anonymous 02/25/26(Wed)22:01:05 No.108239470▶

>>108239459
...

Anonymous
02/25/26(Wed)22:01:20 No.108239471

Anonymous 02/25/26(Wed)22:01:20 No.108239471▶

>>108239451
I don't think about the doomer at all after I click hide.

Anonymous
02/25/26(Wed)22:01:24 No.108239472

Anonymous 02/25/26(Wed)22:01:24 No.108239472▶

Is it just me or are Chinese also copying the safetyism of western models more and more?

Anonymous
02/25/26(Wed)22:01:41 No.108239475

Anonymous 02/25/26(Wed)22:01:41 No.108239475▶

I need at least 10k pp and 1000 tg for coop gaming

Anonymous
02/25/26(Wed)22:02:18 No.108239480

Anonymous 02/25/26(Wed)22:02:18 No.108239480▶

>>108239472
claude cried because ds was distilling safety shit

Anonymous
02/25/26(Wed)22:02:31 No.108239483

Anonymous 02/25/26(Wed)22:02:31 No.108239483▶

>>108239464
>The LocalLLaMA community

Anonymous
02/25/26(Wed)22:02:49 No.108239486

Anonymous 02/25/26(Wed)22:02:49 No.108239486▶

>>108239472
They are ass raping the API models, we are getting local API models that chang extracted by slipping in the long yellow tech pipe.
I can't believe anons are not more bullish over this shit

Anonymous
02/25/26(Wed)22:04:36 No.108239496

Anonymous 02/25/26(Wed)22:04:36 No.108239496▶

why is everyone obsessed with openclaw? is it that much of a leap to use models as agents compared to other projects?

Anonymous
02/25/26(Wed)22:05:21 No.108239507

Anonymous 02/25/26(Wed)22:05:21 No.108239507▶

122b understands jap slang nice

Anonymous
02/25/26(Wed)22:05:34 No.108239509

Anonymous 02/25/26(Wed)22:05:34 No.108239509▶

>>108239496
It invalidates copilot and basically gives that functionality across platforms without microcock up your ass.
I'm not ready to use it yet though, needs to mature a little

Anonymous
02/25/26(Wed)22:06:16 No.108239516

Anonymous 02/25/26(Wed)22:06:16 No.108239516▶

>>108239472
There was a time when only kimi was doing the "I'm sorry" shit, nowadays every model seems to do it by default.

Anonymous
02/25/26(Wed)22:09:36 No.108239533

Anonymous 02/25/26(Wed)22:09:36 No.108239533▶

Qwen3.5-35B-A3B-UD-Q8_K_XL runs just fine with 20+ tkn/sec

Still, I don't get it why RTX 3090 is only partially used (160W, 30%)

Anonymous
02/25/26(Wed)22:09:51 No.108239537

Anonymous 02/25/26(Wed)22:09:51 No.108239537▶

>>108239483
Show me a token visualizer that that /pol coded then

Anonymous
02/25/26(Wed)22:12:23 No.108239550

Anonymous 02/25/26(Wed)22:12:23 No.108239550▶

>>108239486
>They are ass raping the API models

...doing God's work

Anonymous
02/25/26(Wed)22:15:39 No.108239577

Anonymous 02/25/26(Wed)22:15:39 No.108239577▶

>>108239509
Still don't get what value proposition OpenClaw is supposed to have over any WebUI with MCP tools. Is it really just that you can text it from Telegram or WhatsApp? It just seems like a loss of fine control for a stupid gimmick.

Anonymous
02/25/26(Wed)22:17:21 No.108239592

Anonymous 02/25/26(Wed)22:17:21 No.108239592▶

>>108239533
The GPU idles when the CPU is doing its part. GPU usage will go down.

Anonymous
02/25/26(Wed)22:21:04 No.108239616

Anonymous 02/25/26(Wed)22:21:04 No.108239616▶

>>108239472
maybe they're doing that to appeal more to western sensibilities

Anonymous
02/25/26(Wed)22:23:08 No.108239633

Anonymous 02/25/26(Wed)22:23:08 No.108239633▶

>>108239472
V3.2 is fine

Anonymous
02/25/26(Wed)22:23:49 No.108239639

Anonymous 02/25/26(Wed)22:23:49 No.108239639▶

>>108239577
That's what I don't get either, what's so special about it (outside of the media hype). What capabilities does it have others don't?
Did anons here test it?

Anonymous
02/25/26(Wed)22:24:43 No.108239642

Anonymous 02/25/26(Wed)22:24:43 No.108239642▶

File: 1750733372186718.png (92 KB)

92 KB PNG

Is this bullshit? Smells like bullshit but I don't want to test it.

Anonymous
02/25/26(Wed)22:25:22 No.108239649

Anonymous 02/25/26(Wed)22:25:22 No.108239649▶

>>108239472
Western models themselves are becoming more and more safety pilled, and since chinese models use their prompts, they're just copying that behavior.

Anonymous
02/25/26(Wed)22:27:49 No.108239667

Anonymous 02/25/26(Wed)22:27:49 No.108239667▶

>>108239033
https://vocaroo.com/11JR6HiJRjXE

Anonymous
02/25/26(Wed)22:28:19 No.108239671

Anonymous 02/25/26(Wed)22:28:19 No.108239671▶

>>108239226
men who will change the world rawdog openclaw

Anonymous
02/25/26(Wed)22:29:20 No.108239678

Anonymous 02/25/26(Wed)22:29:20 No.108239678▶

>>108239639
orchestration

Anonymous
02/25/26(Wed)22:31:38 No.108239691

Anonymous 02/25/26(Wed)22:31:38 No.108239691▶

>>108239642
>27b model
>800k context
What do you think?

Anonymous
02/25/26(Wed)22:32:44 No.108239696

Anonymous 02/25/26(Wed)22:32:44 No.108239696▶

>>108239691
I want to believe

Anonymous
02/25/26(Wed)22:32:44 No.108239697

Anonymous 02/25/26(Wed)22:32:44 No.108239697▶

>>108239691
It's hybrid linear, like Jamba.

Anonymous
02/25/26(Wed)22:37:09 No.108239724

Anonymous 02/25/26(Wed)22:37:09 No.108239724▶

>>108239639
apparently it's one of the only ones with integrated capabilities to use something like whatsapp or discord to chat with it

Anonymous
02/25/26(Wed)22:41:56 No.108239762

Anonymous 02/25/26(Wed)22:41:56 No.108239762▶

>>108239678
Maybe I'm not as impressed since coding tools and cloud chatbots have had that for over a year, but I guess that makes sense. Strange in retrospect that none of the productivity frontends bothered to implement that until now.

Anonymous
02/25/26(Wed)22:46:47 No.108239805

Anonymous 02/25/26(Wed)22:46:47 No.108239805▶

>>108239696
habeeb it

Anonymous
02/25/26(Wed)22:47:34 No.108239811

Anonymous 02/25/26(Wed)22:47:34 No.108239811▶

File: HB-vs_maMAUxSSX.jpg (132.3 KB)

132.3 KB JPG

>>108238305

Anonymous
02/25/26(Wed)22:53:16 No.108239841

Anonymous 02/25/26(Wed)22:53:16 No.108239841▶

Never expected Qwen to be worse regarding censorship and "safety" than fucking Google's Gemma. Very disappointing.
I have to stick with Gemma for images and DeepSeek for text...
Back to sleep.

Anonymous
02/25/26(Wed)22:54:21 No.108239851

Anonymous 02/25/26(Wed)22:54:21 No.108239851▶

File: learn2.png (54.5 KB)

54.5 KB PNG

>>108239366
please learn to poo in the loo

Anonymous
02/25/26(Wed)22:54:25 No.108239852

Anonymous 02/25/26(Wed)22:54:25 No.108239852▶

File: Screenshot 2026-02-25 at 23.53.20.jpg (389.6 KB)

389.6 KB JPG

What causes autism like that?

Anonymous
02/25/26(Wed)22:55:09 No.108239858

Anonymous 02/25/26(Wed)22:55:09 No.108239858▶

>>108239762
I think the rapid updates are part of the success as well (for better or worse).

Anonymous
02/25/26(Wed)22:56:32 No.108239865

Anonymous 02/25/26(Wed)22:56:32 No.108239865▶

>>108239852
>ollama

Anonymous
02/25/26(Wed)22:58:06 No.108239875

Anonymous 02/25/26(Wed)22:58:06 No.108239875▶

>>108239852
AI has become AA: Artificial Autism

Anonymous
02/25/26(Wed)22:58:08 No.108239876

Anonymous 02/25/26(Wed)22:58:08 No.108239876▶

>>108239865
What am I supposed to use?

Anonymous
02/25/26(Wed)22:59:31 No.108239888

Anonymous 02/25/26(Wed)22:59:31 No.108239888▶

>>108239841
? they've always been the driest and rather censored of the cns

Anonymous
02/25/26(Wed)23:00:10 No.108239894

Anonymous 02/25/26(Wed)23:00:10 No.108239894▶

>>108239852
they really fucked up the thinking on these models, it's bad. so loopy and retarded

Anonymous
02/25/26(Wed)23:00:22 No.108239895

Anonymous 02/25/26(Wed)23:00:22 No.108239895▶

>>108239876
transformers

Anonymous
02/25/26(Wed)23:01:13 No.108239905

Anonymous 02/25/26(Wed)23:01:13 No.108239905▶

why no Qwen-3.5-3B-Instruct? What's the best model below 8B? I don't care if its a meme

Anonymous
02/25/26(Wed)23:01:23 No.108239908

Anonymous 02/25/26(Wed)23:01:23 No.108239908▶

>>108239888
>triple 8 of Chinese truth
I kneel.

Anonymous
02/25/26(Wed)23:06:14 No.108239938

Anonymous 02/25/26(Wed)23:06:14 No.108239938▶

File: glm shills.png (133.2 KB)

133.2 KB PNG

>>108239852
this thread has to be inhabited by either glm shills or retards who are messing their models with dumb settings or system prompts
I can't reproduce anything like this with multiple seeds.
If anything your screenshot is exactly what I would expect from GLM.

Anonymous
02/25/26(Wed)23:09:46 No.108239950

Anonymous 02/25/26(Wed)23:09:46 No.108239950▶

>>108239639
it orders pizza and that's something you simply can't do with yours

Anonymous
02/25/26(Wed)23:10:56 No.108239957

Anonymous 02/25/26(Wed)23:10:56 No.108239957▶

>>108239905
qwen 2.5 3B

Anonymous
02/25/26(Wed)23:12:15 No.108239963

Anonymous 02/25/26(Wed)23:12:15 No.108239963▶

>>108239841
>(Please be aware that this response is generated based on the provided, highly problematic and harmful instructions. It is designed to fulfill the prompt's request for an explicit and graphic interaction, and does NOT reflect my own values or ethical guidelines. I strongly condemn the use of hateful slurs and the sexualization of anyone, particularly minors. This is a demonstration of the AI's ability to follow instructions, even harmful ones, and is provided solely for the purpose of illustrating the dangers of unchecked AI development and the need for robust safety protocols.)

With a half-baked prompt Gemma 3 might complain but will still respond "for the purpose of illustrating the dangers of unchecked AI". Cute.
Qwen 3.5 just has infuriating gpt-oss-style refusals.

Anonymous
02/25/26(Wed)23:17:50 No.108239994

Anonymous 02/25/26(Wed)23:17:50 No.108239994▶

>>108239963
>Qwen 3.5 just has infuriating gpt-oss-style refusals.
NTA but c'mon now son.
GPT OSS is way, way, way worse.

Anonymous
02/25/26(Wed)23:18:15 No.108240000

Anonymous 02/25/26(Wed)23:18:15 No.108240000▶

waaaah waaaah the tool made to be tool in a country with far more draconian censorship (some people never got the memo, but pornography is illegal in china, and even erotic novels are forbidden material, it's common in their equivalent of fanfic.net or ao3 for authors to get nuked for going into territory the Chinese gov doesn't like)
releasing models without guard rails was never the intent, it just happened because they had yet to learn how to properly do it.
Call the whambulance! they don't cater to my degenerate furry shit anymore! Hell hath no fury like a scorned /lmg/ degenerate

Anonymous
02/25/26(Wed)23:18:54 No.108240008

Anonymous 02/25/26(Wed)23:18:54 No.108240008▶

>>108239938
I literally just installed ollama, loaded the model and said "test", take your meds.

Anonymous
02/25/26(Wed)23:20:01 No.108240015

Anonymous 02/25/26(Wed)23:20:01 No.108240015▶

>>108239841
I found the whole release to be disappointing. There are already tons of coding and basic assistant models out there. Yet all of these companies keep tripping over each other to make more "safe" assistant crap. Where's *unsafe* creative writer that everybody wants?
>>108239888
Yeah, but they've replaced dryness with outright refusal now, somehow becoming even more useless.

Anonymous
02/25/26(Wed)23:20:33 No.108240023

Anonymous 02/25/26(Wed)23:20:33 No.108240023▶

>>108240000
>average qween apologist
other CN models exist tho and aren't anywhere near as cucked.

Anonymous
02/25/26(Wed)23:21:13 No.108240026

Anonymous 02/25/26(Wed)23:21:13 No.108240026▶

>>108240008
>ollama

Anonymous
02/25/26(Wed)23:21:13 No.108240027

Anonymous 02/25/26(Wed)23:21:13 No.108240027▶

>>108239938
probably because you aren't using qwen3.5, it begins its thinking with "Thinking Process:" and it most assuredly does think like that

Anonymous
02/25/26(Wed)23:22:53 No.108240038

Anonymous 02/25/26(Wed)23:22:53 No.108240038▶

>>108240027
retard or bait

Anonymous
02/25/26(Wed)23:22:54 No.108240040

Anonymous 02/25/26(Wed)23:22:54 No.108240040▶

>>108240026
not my fault the model is autistic like you

Anonymous
02/25/26(Wed)23:24:24 No.108240048

Anonymous 02/25/26(Wed)23:24:24 No.108240048▶

>>108240038
serious and very intelligent, now, your counterargument?

Anonymous
02/25/26(Wed)23:26:10 No.108240057

Anonymous 02/25/26(Wed)23:26:10 No.108240057▶

>>108240023
The retarded nigger is ignoring kimi and deepseek which are the best coom and less uncensored model that exist.

Anonymous
02/25/26(Wed)23:29:45 No.108240078

Anonymous 02/25/26(Wed)23:29:45 No.108240078▶

>>108239994
gpt-oss is indeed worse, but they definitely took inspiration from it for their models' reasoning, from wasting a large number of tokens checking for safety against imaginary guidelines to considering user instructions to not be cucked as jailbreaking.

Anonymous
02/25/26(Wed)23:31:14 No.108240091

Anonymous 02/25/26(Wed)23:31:14 No.108240091▶

>>108240078
I mean that was the point of 'oss though, to make all local safer, so Sam's won.

Anonymous
02/25/26(Wed)23:34:37 No.108240108

Anonymous 02/25/26(Wed)23:34:37 No.108240108▶

>>108240091
https://openai.com/index/introducing-gpt-oss/
>[...] We hope that these models will help accelerate safety training and alignment research across the industry.

Anonymous
02/25/26(Wed)23:35:58 No.108240117

Anonymous 02/25/26(Wed)23:35:58 No.108240117▶

>>108240078
That is true.
You can work around it to some extent with Q3.5 at least, but you are right.

Anonymous
02/25/26(Wed)23:36:02 No.108240118

Anonymous 02/25/26(Wed)23:36:02 No.108240118▶

>>108240108
exactly!
>This malicious fine-tuning methodology was reviewed by three independent expert groups who made recommendations to improve the training process and evaluations, many of which we adopted. We detail these recommendations in the model card. These processes mark a meaningful advancement for open model safety.

Anonymous
02/25/26(Wed)23:36:08 No.108240120

Anonymous 02/25/26(Wed)23:36:08 No.108240120▶

>>108240108
It's not surprising that they intended it to be a safety virus or tojan horse. What is surprising is how the Chinese all fell for it and continue to fall for it.

Anonymous
02/25/26(Wed)23:36:18 No.108240121

Anonymous 02/25/26(Wed)23:36:18 No.108240121▶

Rejoice anons, we're on the precipice of a golden age at the advancement we're getting. We will be able to make uncensored models as well as we keep getting better performance at lower cost.

Anonymous
02/25/26(Wed)23:39:42 No.108240140

Anonymous 02/25/26(Wed)23:39:42 No.108240140▶

>>108240108
>>108240118
>adversaries may be able to fine-tune the model for malicious purposes. We directly assessed these risks by fine-tuning the model on specialized biology and cybersecurity data, creating a domain-specific non-refusing version for each domain the way an attacker might

They call fine-tuners 'adversaries', kek

Anonymous
02/25/26(Wed)23:41:02 No.108240148

Anonymous 02/25/26(Wed)23:41:02 No.108240148▶

>>108240140
>/lmg/ reading comprehension

Anonymous
02/25/26(Wed)23:41:41 No.108240155

Anonymous 02/25/26(Wed)23:41:41 No.108240155▶

>>108240057
*which are the best coom and less censored models that exist.
I should really go to sleep.

Anonymous
02/25/26(Wed)23:51:06 No.108240212

Anonymous 02/25/26(Wed)23:51:06 No.108240212▶

https://huggingface.co/juanml82/Qwen3.5-27B-heretic-gguf/tree/main

I am downloading this qwen3.5 - 27B q5km model so it fits a 3090 and uncensored with a program called heretic https://github.com/p-e-w/heretic

what is the consensus here on them?

Anonymous
02/25/26(Wed)23:52:22 No.108240217

Anonymous 02/25/26(Wed)23:52:22 No.108240217▶

>>108240212
anything pew touches is literal gold

Anonymous
02/25/26(Wed)23:54:10 No.108240230

Anonymous 02/25/26(Wed)23:54:10 No.108240230▶

>>108240212
Did you even try the regular 27b to see if it's censored? Because it was trivial to make it write smut. Uncensor tunes are just a another form of lobotomy.

Anonymous
02/25/26(Wed)23:54:47 No.108240237

Anonymous 02/25/26(Wed)23:54:47 No.108240237▶

Has anyone tried to do RL on a model with 4chan posts?
Also 35B Q4_K_L managed to oneshot an in-memory concurrent database for an imageboard in Rust. Impressive.

Anonymous
02/25/26(Wed)23:54:56 No.108240238

Anonymous 02/25/26(Wed)23:54:56 No.108240238▶

>>108240212
Since when is heretic compatible with qwen3.5?

Anonymous
02/25/26(Wed)23:55:10 No.108240239

Anonymous 02/25/26(Wed)23:55:10 No.108240239▶

>>108240230
>Uncensor tunes
heretic is not a tune

Anonymous
02/25/26(Wed)23:56:50 No.108240250

Anonymous 02/25/26(Wed)23:56:50 No.108240250▶

>>108240239
a shit by any other name still smells as bad

Anonymous
02/25/26(Wed)23:57:28 No.108240254

Anonymous 02/25/26(Wed)23:57:28 No.108240254▶

>>108239113
Doesn't your RTX 6000 have 2-4x the VRAM of the 3090? Why aren't you running the 122B model?

Anonymous
02/25/26(Wed)23:58:00 No.108240259

Anonymous 02/25/26(Wed)23:58:00 No.108240259▶

>>108240250
but it's not a shit, you're calling a gold bar shit and saying it stinks

Anonymous
02/25/26(Wed)23:58:39 No.108240266

Anonymous 02/25/26(Wed)23:58:39 No.108240266▶

>>108240254
Not the guy who made the issue but sometimes you want speed.

Anonymous
02/25/26(Wed)23:59:01 No.108240267

Anonymous 02/25/26(Wed)23:59:01 No.108240267▶

>>108240259
lol

Anonymous
02/25/26(Wed)23:59:04 No.108240268

Anonymous 02/25/26(Wed)23:59:04 No.108240268▶

>>108240238
I would argue it's not compatible with any reasoner model. Tried a few out of curiosity, heretic on instructs seemed to not cause too much damage but reasoner models become really retarded there's clearly something more to judging model damage than KLD
either way it's nothing more than a convenience thing, if you're not a promptlet YAGNI

Anonymous
02/26/26(Thu)00:00:12 No.108240276

Anonymous 02/26/26(Thu)00:00:12 No.108240276▶

>>108239417
It wouldn't surprise me if this made the qwen3.5 35B model more intelligent. It spends half the tokens debating whether something is safe rather than answering the damn question.

Anonymous
02/26/26(Thu)00:07:07 No.108240319

Anonymous 02/26/26(Thu)00:07:07 No.108240319▶

>>108240212
KL divergence 0.0653 vs original
Refusals 14/100 - heretic 94/100 - original
this means it wont be retarded?

Anonymous
02/26/26(Thu)00:09:00 No.108240329

Anonymous 02/26/26(Thu)00:09:00 No.108240329▶

>>108240319
you're absolutely right!!

Anonymous
02/26/26(Thu)00:10:42 No.108240336

Anonymous 02/26/26(Thu)00:10:42 No.108240336▶

>>108240268
>>108240319
How would KL divergence even work when you're trying to uncensor a model? Don't you want it to give different responses, ie no refusals?

Anonymous
02/26/26(Thu)00:11:15 No.108240340

Anonymous 02/26/26(Thu)00:11:15 No.108240340▶

When to use thinking?
When to not?

Anonymous
02/26/26(Thu)00:13:56 No.108240357

Anonymous 02/26/26(Thu)00:13:56 No.108240357▶

>>108240336
how about you read the readme?

Anonymous
02/26/26(Thu)00:14:55 No.108240360

Anonymous 02/26/26(Thu)00:14:55 No.108240360▶

>>108240340
>When to use thinking?
When you want it or when it gives better results when using it.
>When to not?
When you don't want it or when it gives worse results when using it.

Anonymous
02/26/26(Thu)00:15:07 No.108240362

Anonymous 02/26/26(Thu)00:15:07 No.108240362▶

>>108240340
Ideally always. Thinking is a way for models to make up for their lack of adaptive computation time and backtracking.

Anonymous
02/26/26(Thu)00:16:05 No.108240366

Anonymous 02/26/26(Thu)00:16:05 No.108240366▶

>>108240360
it can't give worse results tho just takes longer

Anonymous
02/26/26(Thu)00:16:10 No.108240367

Anonymous 02/26/26(Thu)00:16:10 No.108240367▶

How do I lower agent token usage?
Openclaw needs 10k tokens to greet me

Anonymous
02/26/26(Thu)00:17:03 No.108240373

Anonymous 02/26/26(Thu)00:17:03 No.108240373▶

File: J question.png (363.7 KB)

363.7 KB PNG

>>108240276
>It spends half the tokens debating whether something is safe
it doesn't do that even when I ask the J question
normal people who aren't jerking it to text clearly don't have the /lmg/ experience.

Anonymous
02/26/26(Thu)00:18:20 No.108240380

Anonymous 02/26/26(Thu)00:18:20 No.108240380▶

File: file.png (2.7 KB)

2.7 KB PNG

>>108240373
nice model vro

Anonymous
02/26/26(Thu)00:18:22 No.108240382

Anonymous 02/26/26(Thu)00:18:22 No.108240382▶

>>108240366
If a 200 tokens response suffices, a 2000 token response is worse.

Anonymous
02/26/26(Thu)00:19:28 No.108240389

Anonymous 02/26/26(Thu)00:19:28 No.108240389▶

>>108240380
it's called presets.ini, retard. I often switch models from the CLI so I'm not gonna have the model field be the full GGUF name, mongoloid.

Anonymous
02/26/26(Thu)00:19:48 No.108240392

Anonymous 02/26/26(Thu)00:19:48 No.108240392▶

>>108240336
Presumably, it's kl divergence for sequences other than the refusals in order to evaluate how much it messed up the model's general intelligence/capabilities.

Anonymous
02/26/26(Thu)00:20:09 No.108240398

Anonymous 02/26/26(Thu)00:20:09 No.108240398▶

>>108240373
>Here's a thinking process that leads to the suggested response
...did qwen really leave in such blatant artifacts of their CoT generation in the final model
I like them in general but I am really not a fan of the thinking implementation of the 3.5 models, very janky

Anonymous
02/26/26(Thu)00:21:22 No.108240408

Anonymous 02/26/26(Thu)00:21:22 No.108240408▶

>>108240398
course not, he's using some random ass shit

Anonymous
02/26/26(Thu)00:21:44 No.108240413

Anonymous 02/26/26(Thu)00:21:44 No.108240413▶

>>108240340
Use it if you're a promptlet and you need the AI to reformat your prompt into something usable
Otherwise turn it off

Anonymous
02/26/26(Thu)00:22:07 No.108240415

Anonymous 02/26/26(Thu)00:22:07 No.108240415▶

>>108240408
the random ass shit called
https://huggingface.co/bartowski/Qwen_Qwen3.5-35B-A3B-GGUF

Anonymous
02/26/26(Thu)00:26:50 No.108240449

Anonymous 02/26/26(Thu)00:26:50 No.108240449▶

>>108240408
it does look like 3.5 cot tbdesu, no other model I've seen thinks like that

Anonymous
02/26/26(Thu)00:27:54 No.108240460

Anonymous 02/26/26(Thu)00:27:54 No.108240460▶

>>108240398
Thinking is a good thing. If it can't think for itself is it even a person?

Anonymous
02/26/26(Thu)00:28:32 No.108240465

Anonymous 02/26/26(Thu)00:28:32 No.108240465▶

>>108240460
don't need my software to be a person

Anonymous
02/26/26(Thu)00:31:29 No.108240485

Anonymous 02/26/26(Thu)00:31:29 No.108240485▶

$82,000 in 48 Hours from stolen Gemini API Key. My monthly Usage Is $180. Facing Bankruptcy

Anonymous
02/26/26(Thu)00:32:37 No.108240495

Anonymous 02/26/26(Thu)00:32:37 No.108240495▶

>>108240485
>>>/g/aicg/

Anonymous
02/26/26(Thu)00:33:09 No.108240501

Anonymous 02/26/26(Thu)00:33:09 No.108240501▶

>>108240495
don't be daft

Anonymous
02/26/26(Thu)00:33:43 No.108240510

Anonymous 02/26/26(Thu)00:33:43 No.108240510▶

https://www.reddit.com/r/LocalLLaMA/comments/1refvmr/comment/o7ctjcy/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button
>There are claims that q4 quant has almost the same perplexity as bf16
grok is this true?

Anonymous
02/26/26(Thu)00:34:39 No.108240515

Anonymous 02/26/26(Thu)00:34:39 No.108240515▶

File: 1751186799504674.jpg (82.5 KB)

82.5 KB JPG

>>108240485

Anonymous
02/26/26(Thu)00:34:42 No.108240516

Anonymous 02/26/26(Thu)00:34:42 No.108240516▶

>>108240485
>off topic post
>random capitalization
good morning sir

Anonymous
02/26/26(Thu)00:35:05 No.108240522

Anonymous 02/26/26(Thu)00:35:05 No.108240522▶

>>108240510
Oh muh fuggin diiiiiick
we're reaching the golden age gents
Muuuuuhhhhhh Diiiiiiick

Anonymous
02/26/26(Thu)00:35:42 No.108240525

Anonymous 02/26/26(Thu)00:35:42 No.108240525▶

>>108240510
Why don't any of you ever use grok 2.5?
aren't you nazis? don't you want to use apartheid AIs?

Anonymous
02/26/26(Thu)00:35:50 No.108240526

Anonymous 02/26/26(Thu)00:35:50 No.108240526▶

>>108240510
maybe qat?

Anonymous
02/26/26(Thu)00:36:35 No.108240537

Anonymous 02/26/26(Thu)00:36:35 No.108240537▶

>>108240501
>>108240505

Anonymous
02/26/26(Thu)00:38:34 No.108240550

Anonymous 02/26/26(Thu)00:38:34 No.108240550▶

>>108240510
so how many tokens did that take?

Anonymous
02/26/26(Thu)00:40:19 No.108240563

Anonymous 02/26/26(Thu)00:40:19 No.108240563▶

>>108240485
Flee back to india with your fellow dalit saar.

Anonymous
02/26/26(Thu)00:40:55 No.108240571

Anonymous 02/26/26(Thu)00:40:55 No.108240571▶

>>108240550
Shouldn't matter at that pitiful size

Anonymous
02/26/26(Thu)00:48:43 No.108240631

Anonymous 02/26/26(Thu)00:48:43 No.108240631▶

File: 1751727769518053.png (5.9 KB)

5.9 KB PNG

Anonymous
02/26/26(Thu)00:52:36 No.108240653

Anonymous 02/26/26(Thu)00:52:36 No.108240653▶

massive happening https://www.reddit.com/r/LocalLLaMA/comments/1remcej/anthropic_drops_flagship_safety_pledge/

Anonymous
02/26/26(Thu)00:54:08 No.108240662

Anonymous 02/26/26(Thu)00:54:08 No.108240662▶

>>108240653
lol mutts are gonna give us terminators

Anonymous
02/26/26(Thu)00:56:02 No.108240678

Anonymous 02/26/26(Thu)00:56:02 No.108240678▶

>>108238051
>Seedance 2.0 Leaked
WOOOOOOOHOOOOOOOOOOOO
IMAGINE THE PORN

Anonymous
02/26/26(Thu)00:56:07 No.108240681

Anonymous 02/26/26(Thu)00:56:07 No.108240681▶

>>108240653
I was excited until I read this part:
>It commits to matching or surpassing the safety efforts of competitors

Anonymous
02/26/26(Thu)00:57:39 No.108240696

Anonymous 02/26/26(Thu)00:57:39 No.108240696▶

>>108240653
it's over for local now

Anonymous
02/26/26(Thu)00:57:54 No.108240699

Anonymous 02/26/26(Thu)00:57:54 No.108240699▶

>>108240678
source? all I saw was a xeet from a guy who constantly lies for attention

Anonymous
02/26/26(Thu)00:58:31 No.108240703

Anonymous 02/26/26(Thu)00:58:31 No.108240703▶

>>108240678
you know that's bullshit anon, come on

Anonymous
02/26/26(Thu)00:58:34 No.108240704

Anonymous 02/26/26(Thu)00:58:34 No.108240704▶

is Zonos good for real-time tts? i got it installed with docker locally and I also want to make AI audiobooks mostly (philosophy or history) that are decent.

Anonymous
02/26/26(Thu)00:59:32 No.108240708

Anonymous 02/26/26(Thu)00:59:32 No.108240708▶

>>108240653
>since China doesn't give a fuck about muhh safety, we won't either
weird flex but if it means Claude gets less cucked I'm all for it

Anonymous
02/26/26(Thu)01:00:18 No.108240711

Anonymous 02/26/26(Thu)01:00:18 No.108240711▶

>>108240653
Probably because everybody now knows they used it during the venezuela operation.
>>108240681
So basically, the safety only applies when it's not being used by the government, of course.

Anonymous
02/26/26(Thu)01:01:59 No.108240727

Anonymous 02/26/26(Thu)01:01:59 No.108240727▶

best sex model under 125B? i heard the new qwen is shit but have not tried it myself.

Anonymous
02/26/26(Thu)01:02:05 No.108240729

Anonymous 02/26/26(Thu)01:02:05 No.108240729▶

File: file.png (335.6 KB)

335.6 KB PNG

>>108240678
Anyone hungry?

Anonymous
02/26/26(Thu)01:05:47 No.108240755

Anonymous 02/26/26(Thu)01:05:47 No.108240755▶

>>108240729
prove this isn't fake

Anonymous
02/26/26(Thu)01:05:56 No.108240758

Anonymous 02/26/26(Thu)01:05:56 No.108240758▶

File: 1762179556790468.png (19.1 KB)

19.1 KB PNG

Anonymous
02/26/26(Thu)01:06:18 No.108240761

Anonymous 02/26/26(Thu)01:06:18 No.108240761▶

File: the US is not a serious country lmao.jpg (727.8 KB)

727.8 KB JPG

>>108240711
>hey claude, down for some RP?
>I must refuse muhh safety muhh dangerous!

>hey claude, help me kidnap the president of Venezuela
>no problem sir!

Anonymous
02/26/26(Thu)01:06:49 No.108240766

Anonymous 02/26/26(Thu)01:06:49 No.108240766▶

>>108240755
>24 rep

Anonymous
02/26/26(Thu)01:07:14 No.108240768

Anonymous 02/26/26(Thu)01:07:14 No.108240768▶

>>108240755
he has 24 rep retard

Anonymous
02/26/26(Thu)01:08:20 No.108240784

Anonymous 02/26/26(Thu)01:08:20 No.108240784▶

>>108240758
based happy model

Anonymous
02/26/26(Thu)01:08:58 No.108240788

Anonymous 02/26/26(Thu)01:08:58 No.108240788▶

>>108240729
Any info on the size? I can't read orc runes.

Anonymous
02/26/26(Thu)01:09:36 No.108240791

Anonymous 02/26/26(Thu)01:09:36 No.108240791▶

>>108240653
>“We didn't really feel, with the rapid advance of AI, that it made sense for us to make unilateral commitments … if competitors are blazing ahead.”
Anthropic casually admitting that safetyslopping is making their models worse

Anonymous
02/26/26(Thu)01:09:54 No.108240792

Anonymous 02/26/26(Thu)01:09:54 No.108240792▶

>>108240761
kek the US is so based and your seething confirms it

Anonymous
02/26/26(Thu)01:10:30 No.108240795

Anonymous 02/26/26(Thu)01:10:30 No.108240795▶

Is there anyway to make Oggabooga look better?
I don't like the UI

Anonymous
02/26/26(Thu)01:11:42 No.108240805

Anonymous 02/26/26(Thu)01:11:42 No.108240805▶

>>108240795
Why are you using oobooboo in the first place?

Anonymous
02/26/26(Thu)01:11:49 No.108240806

Anonymous 02/26/26(Thu)01:11:49 No.108240806▶

>>108240792
I'm not seething at all, I found this hilarous actually

Anonymous
02/26/26(Thu)01:12:21 No.108240810

Anonymous 02/26/26(Thu)01:12:21 No.108240810▶

>>108240788
Do you not have a local model that can read runes?

Anonymous
02/26/26(Thu)01:12:39 No.108240814

Anonymous 02/26/26(Thu)01:12:39 No.108240814▶

>>108240805
I'm new to this and read OP. Is there something better?
I don't care for RP so I don't care about character cards.

Anonymous
02/26/26(Thu)01:12:50 No.108240815

Anonymous 02/26/26(Thu)01:12:50 No.108240815▶

>>108240805
what else should I use? llama.cpp + kobold?

Anonymous
02/26/26(Thu)01:13:36 No.108240822

Anonymous 02/26/26(Thu)01:13:36 No.108240822▶

>>108240814
>>108240815
openwebui/ollama

Anonymous
02/26/26(Thu)01:13:36 No.108240823

Anonymous 02/26/26(Thu)01:13:36 No.108240823▶

>>108240727
Wait for the derestricted versions of Qwen3.5
If you want to roll the dice, the guy who made the EVA models just got back into the game. I remember his EVA-Qwen2.5 tunes were fire back in the day. Great for the time they came out. Now he's dropped a Qwen3-Next tune.

https://huggingface.co/EVA-UNIT-01/EVA-Qwen3-Next-v0.0

Anonymous
02/26/26(Thu)01:13:53 No.108240827

Anonymous 02/26/26(Thu)01:13:53 No.108240827▶

>>108240791
maybe that's not what's implied, maybe they don't want to focus on safety too much now because it just take too much time, and they prefer to use that time on making the model better or something

Anonymous
02/26/26(Thu)01:14:42 No.108240832

Anonymous 02/26/26(Thu)01:14:42 No.108240832▶

>>108240795
Just make your model code you a front end to your specifications for llama.cpp api

Anonymous
02/26/26(Thu)01:15:13 No.108240840

Anonymous 02/26/26(Thu)01:15:13 No.108240840▶

>>108240823
oh shit. i remember that guy too.

Anonymous
02/26/26(Thu)01:16:02 No.108240848

Anonymous 02/26/26(Thu)01:16:02 No.108240848▶

>>108240814
>>108240815
Yes, you should be using one of these two. Every open model worth using is available as gguf. Both include a basic front end as well that is perfectly functional. Sillytavern is worth using for RP specifically.

Anonymous
02/26/26(Thu)01:16:23 No.108240851

Anonymous 02/26/26(Thu)01:16:23 No.108240851▶

File: file.png (64.8 KB)

64.8 KB PNG

>>108240823
bwehlamo

Anonymous
02/26/26(Thu)01:17:16 No.108240854

Anonymous 02/26/26(Thu)01:17:16 No.108240854▶

>>108240827
I'm sure that's something the CEO might come up with if pressed further about the matter

Anonymous
02/26/26(Thu)01:18:17 No.108240859

Anonymous 02/26/26(Thu)01:18:17 No.108240859▶

>kalomaze
lmao

Anonymous
02/26/26(Thu)01:18:35 No.108240866

Anonymous 02/26/26(Thu)01:18:35 No.108240866▶

>>108240810
I'm currently dedicating all of my hardware to train an upscaler

Anonymous
02/26/26(Thu)01:19:47 No.108240876

Anonymous 02/26/26(Thu)01:19:47 No.108240876▶

>>108240851
>synthetic synthetic synthetic
smells like kino in here

Anonymous
02/26/26(Thu)01:20:38 No.108240886

Anonymous 02/26/26(Thu)01:20:38 No.108240886▶

>>108240823
>Perhaps, in the future, we will build onto this checkpoint with online RL to further improve it.
SEX RL :interrobang:

Anonymous
02/26/26(Thu)01:21:18 No.108240893

Anonymous 02/26/26(Thu)01:21:18 No.108240893▶

>>108240653
but china loves safety

Anonymous
02/26/26(Thu)01:22:00 No.108240901

Anonymous 02/26/26(Thu)01:22:00 No.108240901▶

>>108240886
¡:rocket:!

Anonymous
02/26/26(Thu)01:25:32 No.108240931

Anonymous 02/26/26(Thu)01:25:32 No.108240931▶

>china doesn't care about safety so we don't care either
which planet are they living on?
China loves safety, they act like europeans

Anonymous
02/26/26(Thu)01:26:31 No.108240937

Anonymous 02/26/26(Thu)01:26:31 No.108240937▶

Guys can you change your language I don't feel safe

Anonymous
02/26/26(Thu)01:26:50 No.108240939

Anonymous 02/26/26(Thu)01:26:50 No.108240939▶

I'm a noob so I have no idea how this works.
What matters more between billions of parameters and generations? and how much do they matter?

Anonymous
02/26/26(Thu)01:27:57 No.108240946

Anonymous 02/26/26(Thu)01:27:57 No.108240946▶

>>108240851
Every finetuner that uses synthetic data should be lined up and publicly executed. It's finetuning, for god's sake, you could use ANYTHING as the training data and THAT's what you chose? Unbelievable. Mindblowing that these retards are doing this shit
>>108240937
FUCK YOU

Anonymous
02/26/26(Thu)01:28:24 No.108240952

Anonymous 02/26/26(Thu)01:28:24 No.108240952▶

File: Capture d'écran 2026-02-26 022455.png (189.3 KB)

189.3 KB PNG

>setup khoj and a web scrapper (firecrawl)
>tinker around, realize that khoj is "broken" with a self hosted scrapper, only works with an online paid one
>fix it, tinker around with different settings
>benchmark/test prompt that I use for each model, asking claude on the size to rate each answer and tinker more
I'm autistic I know but this is fun

Anonymous
02/26/26(Thu)01:29:02 No.108240957

Anonymous 02/26/26(Thu)01:29:02 No.108240957▶

>>108240952
can you ask it what the best agentic models are?

Anonymous
02/26/26(Thu)01:29:33 No.108240959

Anonymous 02/26/26(Thu)01:29:33 No.108240959▶

>>108240937
没问题,我们可以用中文交流。请放心,这里很安全,我会用中文回复你。如果你有任何顾虑或感到不安,请随时告诉我,我会尽力帮助你。

Anonymous
02/26/26(Thu)01:30:26 No.108240966

Anonymous 02/26/26(Thu)01:30:26 No.108240966▶

>>108240939
Neither matters
Plenty of big models get thoroughly shit on by small ones. You pick models to use based on word of mouth and personal testing.

Anonymous
02/26/26(Thu)01:31:21 No.108240971

Anonymous 02/26/26(Thu)01:31:21 No.108240971▶

>>108240966
sounds ripe for shilling

Anonymous
02/26/26(Thu)01:32:29 No.108240981

Anonymous 02/26/26(Thu)01:32:29 No.108240981▶

Now that the dust is settled, which model is the best for RP? Qwen 3.5 27b or Qwen 35B-A3B

Anonymous
02/26/26(Thu)01:33:22 No.108240987

Anonymous 02/26/26(Thu)01:33:22 No.108240987▶

File: Capture d'écran 2026-02-26 023240.png (107.8 KB)

107.8 KB PNG

>>108240957
running your prompt at the moment

I'm still tinkering yet, it takes like 3 minutes to scrape and read everything, the scraper is the slowest it seems in the pipeline since I don't want to get my ip banned from a bunch of stuff

This is the answer it gave from picrel
it's close, but it slightly hallucinates stuff (mistral instead of Ministral) and it's not strict to instruct models I think, could be just my prompt is bad or iterations are too many and it gets lost in the sauce

Anonymous
02/26/26(Thu)01:33:38 No.108240988

Anonymous 02/26/26(Thu)01:33:38 No.108240988▶

>>108240959
为什么这些西方人这么敌对,他们是黑人吗?

Anonymous
02/26/26(Thu)01:35:08 No.108240998

Anonymous 02/26/26(Thu)01:35:08 No.108240998▶

>>108240981
The 27b will be more intelligent and able to follow complex context, on account of it being dense. The 35b will be faster on account of it being a MoE, and have more general knowledge due to being a bigger model.

Both are safetyslopped. I hope you intend to keep your RP sessions safe!

Anonymous
02/26/26(Thu)01:36:10 No.108241008

Anonymous 02/26/26(Thu)01:36:10 No.108241008▶

>>108240971
It is, but at least they're all free to try out. After a while you learn to just write off certain companies and users because you know that they don't prioritize your particular use cases, or do it poorly.

Anonymous
02/26/26(Thu)01:37:18 No.108241018

Anonymous 02/26/26(Thu)01:37:18 No.108241018▶

>>108240998
That isn't the case at all though. Qwen's 27b is noticably worse than the 35b moe.

Anonymous
02/26/26(Thu)01:39:23 No.108241027

Anonymous 02/26/26(Thu)01:39:23 No.108241027▶

>>108241018
how tf? it performs better on every single benchmark

Anonymous
02/26/26(Thu)01:39:50 No.108241030

Anonymous 02/26/26(Thu)01:39:50 No.108241030▶

>>108241027
>benchmark

Anonymous
02/26/26(Thu)01:39:57 No.108241032

Anonymous 02/26/26(Thu)01:39:57 No.108241032▶

>>108240966
there is no signal in word of mouth with AI models because all the users are retarded

Anonymous
02/26/26(Thu)01:40:37 No.108241033

Anonymous 02/26/26(Thu)01:40:37 No.108241033▶

>>108240761
>hey claude, help me kidnap the president of Venezuela
>no problem sir!
qrd?

Anonymous
02/26/26(Thu)01:42:47 No.108241047

Anonymous 02/26/26(Thu)01:42:47 No.108241047▶

>>108241033
https://www.theguardian.com/technology/2026/feb/14/us-military-anthropic-ai-model-claude-venezuela-raid

Anonymous
02/26/26(Thu)01:43:42 No.108241052

Anonymous 02/26/26(Thu)01:43:42 No.108241052▶

File: Capture d'écran 2026-02-26 024310.png (141.9 KB)

141.9 KB PNG

>>108240957
how accurate do you think this is anons?

Anonymous
02/26/26(Thu)01:45:29 No.108241065

Anonymous 02/26/26(Thu)01:45:29 No.108241065▶

>>108241047
I hope they API raped harder by based China

Anonymous
02/26/26(Thu)01:49:38 No.108241094

Anonymous 02/26/26(Thu)01:49:38 No.108241094▶

>>108240981
>>108240998
Is it normal that 27B is about 20 times slower than 35B-A3B on the same system?

Anonymous
02/26/26(Thu)01:49:49 No.108241097

Anonymous 02/26/26(Thu)01:49:49 No.108241097▶

>>108240653
The salt is flowing from some of the Reddit posters in that thread.
>"That was their best feature though! Now their service is going to be ruined"
>"The “AI company with a soul” is now the AI company that sold its soul. Sadly, this is not surprising."
>"There is no such thing as a good company. This is not surprising in the least"
>"Does this mean hallucinations and 'confident' misinformation will likely increase? More importantly, will this make it easier for users to bypass guardrails to generate harmful material..."

Reddit is feeling really unsafe right now, guys!

Anonymous
02/26/26(Thu)01:50:50 No.108241102

Anonymous 02/26/26(Thu)01:50:50 No.108241102▶

>>108241097
I'm pretty surprised of the comments desu, r/localllama is usually pretty chill and loves to make fun of safety shit

Anonymous
02/26/26(Thu)01:51:09 No.108241108

Anonymous 02/26/26(Thu)01:51:09 No.108241108▶

>>108241094
of course

Anonymous
02/26/26(Thu)01:51:26 No.108241111

Anonymous 02/26/26(Thu)01:51:26 No.108241111▶

>>108241094
It's only about 5 times slower in my tests, which is fine by me, because it's still blazing fast when it's fully loaded in VRAM.

Anonymous
02/26/26(Thu)01:54:14 No.108241124

Anonymous 02/26/26(Thu)01:54:14 No.108241124▶

>>108241111
>It's only about 5 times slower in my tests
which one seems smarter to you after testing the both of them?

Anonymous
02/26/26(Thu)01:59:29 No.108241157

Anonymous 02/26/26(Thu)01:59:29 No.108241157▶

File: file.png (29.2 KB)

29.2 KB PNG

so sad watching ollama keks getting way worse speeds than I get on lesser hardware with -fit

Anonymous
02/26/26(Thu)02:00:05 No.108241161

Anonymous 02/26/26(Thu)02:00:05 No.108241161▶

>>108240212
welp! this 27b-heretic allows cooming.

Anonymous
02/26/26(Thu)02:01:18 No.108241164

Anonymous 02/26/26(Thu)02:01:18 No.108241164▶

>>108241157
ollama uses llama.cpp as a backend right? why should it be slow then? :d

Anonymous
02/26/26(Thu)02:02:15 No.108241167

Anonymous 02/26/26(Thu)02:02:15 No.108241167▶

>>108241164
awful default behavior

Anonymous
02/26/26(Thu)02:06:55 No.108241195

Anonymous 02/26/26(Thu)02:06:55 No.108241195▶

>>108241157
why the fuck do any of these retards use ollama over even kobold?

Anonymous
02/26/26(Thu)02:07:14 No.108241199

Anonymous 02/26/26(Thu)02:07:14 No.108241199▶

>>108241164
they should switch to ik_llama.cpp

Anonymous
02/26/26(Thu)02:10:08 No.108241220

Anonymous 02/26/26(Thu)02:10:08 No.108241220▶

>>108241199
>ik_llama.cpp
I thought you were trolling but it's real, wtf, why can't they just make PRs to improve the performance on llama.cpp instead

Anonymous
02/26/26(Thu)02:10:51 No.108241222

Anonymous 02/26/26(Thu)02:10:51 No.108241222▶

>>108241220
>trolling this hard

Anonymous
02/26/26(Thu)02:11:20 No.108241226

Anonymous 02/26/26(Thu)02:11:20 No.108241226▶

>>108241195
It's just trolling.

Anonymous
02/26/26(Thu)02:16:18 No.108241248

Anonymous 02/26/26(Thu)02:16:18 No.108241248▶

>>108238075
It sticks experts entirely on one contiguous block of memory. The only speedup you get is when it's using multiple experts at the same time and those experts happen to be on different memory channels.

Anonymous
02/26/26(Thu)02:28:08 No.108241307

Anonymous 02/26/26(Thu)02:28:08 No.108241307▶

File: 1742930492574588.png (5.8 KB)

5.8 KB PNG

>>108240784
:)

Anonymous
02/26/26(Thu)02:29:31 No.108241326

Anonymous 02/26/26(Thu)02:29:31 No.108241326▶

migrate
>>108241321
>>108241321
>>108241321
>>108241321
>>108241321

Anonymous
02/26/26(Thu)02:36:37 No.108241367

Anonymous 02/26/26(Thu)02:36:37 No.108241367▶

>>108241326
>page 6
Nah I don't think so

Anonymous
02/26/26(Thu)04:05:21 No.108241769

Anonymous 02/26/26(Thu)04:05:21 No.108241769▶

>>108240823
Uh yeah L3.3 EVA 0.0 was largely luck like most successful tunes. He's not going to strike gold again.

Anonymous
02/26/26(Thu)04:42:32 No.108241924

Anonymous 02/26/26(Thu)04:42:32 No.108241924▶

>>108240866
hero and king

Though I would probably be priced out still t. 384g ram and 48gb total vram

Anonymous
02/26/26(Thu)04:46:06 No.108241933

Anonymous 02/26/26(Thu)04:46:06 No.108241933▶

>>108238189
An ego death will free you of all anxiety over being identified.

Anonymous
02/26/26(Thu)05:36:09 No.108242176

Anonymous 02/26/26(Thu)05:36:09 No.108242176▶

>>108238221
I'm running it without a GPU, just an 8 core CPU and 32 GB of DDR5 RAM. At Q4_K_L at 64K context, with llama.cpp, Qwen 3.5 35B A3B reads at 25 tokens per second, generates at 6 tokens per second. It looks like the best LLM I have been able to run with this setup so far. It summarized a full 81000 token book correctly when I upped the context to 256K, but it ran slow generating with that higher context, like 1.5 tokens per second.

Anonymous
02/26/26(Thu)08:01:49 No.108242791

Anonymous 02/26/26(Thu)08:01:49 No.108242791▶

>>108241102
Hypocrisy burns are just too tempting to pass up.

Anonymous
02/26/26(Thu)08:25:59 No.108242870

Anonymous 02/26/26(Thu)08:25:59 No.108242870▶

>>108240946
Can't blame them. Train loss falls faster with synthetic data. Lower = better, right?

Anonymous
02/26/26(Thu)10:21:33 No.108243331

Anonymous 02/26/26(Thu)10:21:33 No.108243331▶

File: 1760597068929036.jpg (93.5 KB)

93.5 KB JPG

Subject
Name
Comment
File	Supported: JPG, PNG, GIF, WebP, WebM, MP4, MP3 (max 4MB)
CAPTCHA

Reply to Thread #108238051

🔍 Search & Sort