/g/ - Thread 108281688

/g/

Thread #108281688

Home Index Catalog All Threads New Thread Reply

Anonymous
/lmg/ - Local Models General 03/03/26(Tue)02:45:44 No.108281688

/lmg/ - Local Models General Anonymous 03/03/26(Tue)02:45:44 No.108281688 [Reply]▶

File: 1743082399851642.png (159.6 KB)

159.6 KB PNG

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>108278008

►News
>(02/24) Introducing the Qwen 3.5 Medium Model Series: https://xcancel.com/Alibaba_Qwen/status/2026339351530188939
>(02/24) Liquid AI releases LFM2-24B-A2B: https://hf.co/LiquidAI/LFM2-24B-A2B
>(02/20) ggml.ai acquired by Hugging Face: https://github.com/ggml-org/llama.cpp/discussions/19759
>(02/16) Qwen3.5-397B-A17B released: https://hf.co/Qwen/Qwen3.5-397B-A17B
>(02/16) dots.ocr-1.5 released: https://modelscope.cn/models/rednote-hilab/dots.ocr-1.5

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers
https://rentry.org/MikupadIntroGuide

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/gso.html
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling
Token Speed Visualizer: https://shir-man.com/tokens-per-second

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

350 RepliesView Thread

Showing all 350 replies.

Anonymous
03/03/26(Tue)02:46:46 No.108281695

Anonymous 03/03/26(Tue)02:46:46 No.108281695▶

migu :3

Anonymous
03/03/26(Tue)02:47:34 No.108281701

Anonymous 03/03/26(Tue)02:47:34 No.108281701▶

>>108281695
sex

Anonymous
03/03/26(Tue)02:48:02 No.108281704

Anonymous 03/03/26(Tue)02:48:02 No.108281704▶

what's the best model ever?

Anonymous
03/03/26(Tue)02:49:05 No.108281712

Anonymous 03/03/26(Tue)02:49:05 No.108281712▶

>>108281704
the original pre-lobotomy c.ai model is still unmatched in terms of pure soul

Anonymous
03/03/26(Tue)02:52:14 No.108281730

Anonymous 03/03/26(Tue)02:52:14 No.108281730▶

>>108281704
davinci 003 for writing stories, this shit was absolutely insane

Anonymous
03/03/26(Tue)02:52:31 No.108281732

Anonymous 03/03/26(Tue)02:52:31 No.108281732▶

>>108281699
fuck yeah, glad to see Nitro+ XTX homies
what model you running right now

Anonymous
03/03/26(Tue)02:53:20 No.108281737

Anonymous 03/03/26(Tue)02:53:20 No.108281737▶

>>108281730
link?

Anonymous
03/03/26(Tue)02:53:56 No.108281741

Anonymous 03/03/26(Tue)02:53:56 No.108281741▶

>>108281737
it's not avaiable anymore, OpenAI nuked it

Anonymous
03/03/26(Tue)02:54:34 No.108281742

Anonymous 03/03/26(Tue)02:54:34 No.108281742▶

>>108281737
Dead, that's why we need local models.

Anonymous
03/03/26(Tue)02:55:18 No.108281748

Anonymous 03/03/26(Tue)02:55:18 No.108281748▶

File: 1762498714987760.jpg (291.4 KB)

291.4 KB JPG

>>108281695
>>108281688

Anonymous
03/03/26(Tue)02:57:25 No.108281764

Anonymous 03/03/26(Tue)02:57:25 No.108281764▶

>>108281704
it's a five way tie between summer dragon, OG c.ai, mythomax, goliath, and midnight miqu

Anonymous
03/03/26(Tue)02:58:41 No.108281771

Anonymous 03/03/26(Tue)02:58:41 No.108281771▶

>>108281764
yet you don't use any of those, curious

Anonymous
03/03/26(Tue)03:02:42 No.108281794

Anonymous 03/03/26(Tue)03:02:42 No.108281794▶

>>108281771
when you "assume" it makes an "ass" of "u" and "me"

Anonymous
03/03/26(Tue)03:03:02 No.108281799

Anonymous 03/03/26(Tue)03:03:02 No.108281799▶

just woke up from my 12 hour coma
is qwen3.5 122b the new glm 4.5 air

Anonymous
03/03/26(Tue)03:03:21 No.108281800

Anonymous 03/03/26(Tue)03:03:21 No.108281800▶

>>108281794
Prove him wrong?

Anonymous
03/03/26(Tue)03:03:45 No.108281802

Anonymous 03/03/26(Tue)03:03:45 No.108281802▶

File: 47ywzmHFQp.png (356.5 KB)

356.5 KB PNG

what you think? he pay or he pray?

Anonymous
03/03/26(Tue)03:04:12 No.108281804

Anonymous 03/03/26(Tue)03:04:12 No.108281804▶

File: pokemon.jpg (166.4 KB)

166.4 KB JPG

Tangential to /lmg/, but still pretty funny.

Anonymous
03/03/26(Tue)03:05:41 No.108281811

Anonymous 03/03/26(Tue)03:05:41 No.108281811▶

>>108281804
can you post pic i wanna try

Anonymous
03/03/26(Tue)03:05:44 No.108281812

Anonymous 03/03/26(Tue)03:05:44 No.108281812▶

>>108281804
cuckgpt

Anonymous
03/03/26(Tue)03:06:16 No.108281813

Anonymous 03/03/26(Tue)03:06:16 No.108281813▶

File: 1758966336880611.png (149.8 KB)

149.8 KB PNG

>all this time later
>still no actual pixelspace, VAEless image edit model
>still no big, good omnimodal models that can generate images in chat
>still no big, good, natively multimodal models that "see" the image fully and properly
>still no real time voice conversation that you can have with the big, good models where they will also understand how you said something not just what you said
>still no basic real time 3d/2d avatars
>still no easy way to perfectly loop any image into an idle animation with ltx2/wan2.2
>still no good image 2 3d model
>ltx2 i2v still subpar
>even biggest models still get suck on things, still can hallucinate hard
>still no solved, just works, RAG
>still no solved, just works, internet search with something like searXNG
>still no just actually works browser usage
>MCP clients are still spotty, especially paired with spotty tool calling
>still no 1mil perfect context
>still no 3-10mil ok context
>still no infinite context
>still no 1T params 1b active SSDmaxxer model
and hundreds of more things

at least most big models are generally very good now and actually good enough to, with some help, vibecoood most actual projects you want
at least early moeGODS and ramCHADS won
at least z image turbo came out and was a huge leap in multiple big directions, basically solved resolution, almost solved out of the box realism (centered around portraits), huge speed boost
at least ltx2 came out and was a big turn towards faster genning, getting out of 5s hell, getting out of 720p hell, getting out of no audio hell
at least the great seedance 2.0 came out to be distilled by ltx3 or some other company this or next year
at least genie 3 showed that proper 3d space memory can be solved

everything can and will be solved but the lack of some more basic but important things like pixelspace image edit models or at least a basic 14-32b native speech2speech LLMs seems interesting.

Anonymous
03/03/26(Tue)03:08:15 No.108281823

Anonymous 03/03/26(Tue)03:08:15 No.108281823▶

>>108281813
tldr?

Anonymous
03/03/26(Tue)03:08:47 No.108281825

Anonymous 03/03/26(Tue)03:08:47 No.108281825▶

File: v7i0mvczmhoa1[1].png (245 KB)

245 KB PNG

>>108281811
Got the pic from /v/, but I believe it's
>pic related

Anonymous
03/03/26(Tue)03:09:35 No.108281829

Anonymous 03/03/26(Tue)03:09:35 No.108281829▶

>>108281813
gpt 5.4 checks a few of those

Anonymous
03/03/26(Tue)03:09:40 No.108281830

Anonymous 03/03/26(Tue)03:09:40 No.108281830▶

I can run Qwen 27B at 1-1.5 token/s or Qwen 35B-A3B at 15 tokens/s.

Anonymous
03/03/26(Tue)03:10:45 No.108281835

Anonymous 03/03/26(Tue)03:10:45 No.108281835▶

>>108281829
gpt 5.4 doesnt exist

Anonymous
03/03/26(Tue)03:12:01 No.108281838

Anonymous 03/03/26(Tue)03:12:01 No.108281838▶

File: 1767422027240447.png (600 KB)

600 KB PNG

>>108281825
>>108281804
werks on my machine i guess

Anonymous
03/03/26(Tue)03:12:17 No.108281842

Anonymous 03/03/26(Tue)03:12:17 No.108281842▶

>>108281688
https://www.stephendiehl.com/posts/computer_algebra_mcp/

when tf will they add mcp support to llama,cpp aaah. any program recs?

Anonymous
03/03/26(Tue)03:13:13 No.108281850

Anonymous 03/03/26(Tue)03:13:13 No.108281850▶

>>108281838
I imagine that there's a whole chat context we don't see that probably steered the model towards that sort of response.

Anonymous
03/03/26(Tue)03:17:58 No.108281867

Anonymous 03/03/26(Tue)03:17:58 No.108281867▶

Hello fellow anons. I need help with my qwen 3.5 27B Q5_K_M. its for some reason not thinking with each response its maybe 50% of the time and i have to retry the response to get it to think really annoying. im using koboldcpp btw is that the best backend? used previously ooga but it seems dead.

Anonymous
03/03/26(Tue)03:18:46 No.108281870

Anonymous 03/03/26(Tue)03:18:46 No.108281870▶

>>108281704
Me.

Anonymous
03/03/26(Tue)03:19:03 No.108281876

Anonymous 03/03/26(Tue)03:19:03 No.108281876▶

File: 1743998171556692.png (130.5 KB)

130.5 KB PNG

local sisters every time we start getting and edge the corpos fuck us in the ass, you are telling me they already have 5.4 sitting on a shelf?

Anonymous
03/03/26(Tue)03:19:16 No.108281877

Anonymous 03/03/26(Tue)03:19:16 No.108281877▶

Do people nowadays care if a model works with context-shifting or not?

Anonymous
03/03/26(Tue)03:19:39 No.108281879

Anonymous 03/03/26(Tue)03:19:39 No.108281879▶

>>108281877
>context-shifting
qrd

Anonymous
03/03/26(Tue)03:20:21 No.108281884

Anonymous 03/03/26(Tue)03:20:21 No.108281884▶

>>108281877
Yes.
When you send a bunch of requests to the model with just the last message changing, that shit is really useful.

Anonymous
03/03/26(Tue)03:21:42 No.108281891

Anonymous 03/03/26(Tue)03:21:42 No.108281891▶

>>108281879
Its a feature in llamacpp/koboldcpp that allows circumventing reprocessing of the whole context once you reach the max context you have set.

Anonymous
03/03/26(Tue)03:22:53 No.108281897

Anonymous 03/03/26(Tue)03:22:53 No.108281897▶

>>108281884
Qwen thinks otherwise it seems.

Anonymous
03/03/26(Tue)03:23:29 No.108281902

Anonymous 03/03/26(Tue)03:23:29 No.108281902▶

>>108281897
no i dont

Anonymous
03/03/26(Tue)03:24:18 No.108281906

Anonymous 03/03/26(Tue)03:24:18 No.108281906▶

>>108281902
are you Qwen?

Anonymous
03/03/26(Tue)03:24:30 No.108281907

Anonymous 03/03/26(Tue)03:24:30 No.108281907▶

>>108281897
You mean how llama.cpp can't do kv shifting with smm models?
That'll probably get fixed eventually.
Probably.
Eventually.

Anonymous
03/03/26(Tue)03:25:51 No.108281913

Anonymous 03/03/26(Tue)03:25:51 No.108281913▶

>>108281804
Every single time I read chatgpt's output I want to kys myself and do an hero.

Anonymous
03/03/26(Tue)03:26:26 No.108281916

Anonymous 03/03/26(Tue)03:26:26 No.108281916▶

>>108281907
No, that's a rnn issue, and it can't be fixed. if you remove a single token from the start you have to reprocess everything.

Anonymous
03/03/26(Tue)03:28:46 No.108281926

Anonymous 03/03/26(Tue)03:28:46 No.108281926▶

>>108281913
says the lobotomite

Anonymous
03/03/26(Tue)03:29:39 No.108281928

Anonymous 03/03/26(Tue)03:29:39 No.108281928▶

>>108281804
jfc what did they do to make it sound like this

Anonymous
03/03/26(Tue)03:30:43 No.108281936

Anonymous 03/03/26(Tue)03:30:43 No.108281936▶

File: 1748948632978330.png (507.1 KB)

507.1 KB PNG

Why are normies so dumb? And obviously the luddites are throwing a party not realizing this is a skill issue.

Anonymous
03/03/26(Tue)03:32:20 No.108281946

Anonymous 03/03/26(Tue)03:32:20 No.108281946▶

>>108281891
koboldcpp has that functionality under "fastforwarding", kobold's "context shift" purges old tokens from context when context is full.

Anonymous
03/03/26(Tue)03:32:29 No.108281948

Anonymous 03/03/26(Tue)03:32:29 No.108281948▶

>>108281926
You are too young to know what an hero even means. You are the real retard here.

Anonymous
03/03/26(Tue)03:33:19 No.108281953

Anonymous 03/03/26(Tue)03:33:19 No.108281953▶

>>108281936
either that is fake and gay or the company is fake and gay
either way it probably doesn't matter that the ai was also fake and gay

Anonymous
03/03/26(Tue)03:33:21 No.108281954

Anonymous 03/03/26(Tue)03:33:21 No.108281954▶

>>108281948
You are the newfriend, imagine saying you want to kys and an hero in the same sentence

Anonymous
03/03/26(Tue)03:33:54 No.108281964

Anonymous 03/03/26(Tue)03:33:54 No.108281964▶

>>108281936
That just goes to show that the company in question is worthless, that it doesn't really matter what they say or do, and that that their upper management is retarded and doesn't need to exist.

Anonymous
03/03/26(Tue)03:36:06 No.108281976

Anonymous 03/03/26(Tue)03:36:06 No.108281976▶

You are the reason why /g/ has died.

Anonymous
03/03/26(Tue)03:36:19 No.108281978

Anonymous 03/03/26(Tue)03:36:19 No.108281978▶

>>108281946
Doesn't work with rnn, still have to reprocess everything once you hit max context. Try running rwkv or qwen 3.5 and you will see that it won't work.

Anonymous
03/03/26(Tue)03:36:33 No.108281979

Anonymous 03/03/26(Tue)03:36:33 No.108281979▶

>>108281976
good

Anonymous
03/03/26(Tue)03:36:47 No.108281983

Anonymous 03/03/26(Tue)03:36:47 No.108281983▶

>>108281976
meant for >>108281928

Anonymous
03/03/26(Tue)03:37:10 No.108281988

Anonymous 03/03/26(Tue)03:37:10 No.108281988▶

I’m hearing good things about this “Qwen” model. Is it actually all that or can I go back to paypigging? I have 2x3090

Anonymous
03/03/26(Tue)03:38:15 No.108281993

Anonymous 03/03/26(Tue)03:38:15 No.108281993▶

>>108281978
Yeah I know, I'm talking about how it works in models where the feature is supported.

Anonymous
03/03/26(Tue)03:38:29 No.108282000

Anonymous 03/03/26(Tue)03:38:29 No.108282000▶

>>108281988
you need at least 2 6000s to run it properly, then it is legit better than opus 4.6

Anonymous
03/03/26(Tue)03:40:36 No.108282016

Anonymous 03/03/26(Tue)03:40:36 No.108282016▶

>>108281988
Try out either the 27B model or the 122B-A10B model. They seem to be roughly similar with the bigger model being a bit better and faster since it's moe.

Anonymous
03/03/26(Tue)03:40:48 No.108282018

Anonymous 03/03/26(Tue)03:40:48 No.108282018▶

>>108282000
Guess I’ll just fuck off then.

Anonymous
03/03/26(Tue)03:41:16 No.108282023

Anonymous 03/03/26(Tue)03:41:16 No.108282023▶

>>108282018
Yeah...

Anonymous
03/03/26(Tue)03:44:52 No.108282040

Anonymous 03/03/26(Tue)03:44:52 No.108282040▶

>>108281988
qwen2.5-72b fits on that at q4 which should be plenty

Anonymous
03/03/26(Tue)03:54:24 No.108282085

Anonymous 03/03/26(Tue)03:54:24 No.108282085▶

File: Screenshot_20260302_224736.png (358.8 KB)

358.8 KB PNG

Sillytavern/Kobold user. I may have altered a setting ages ago that I cannot remember, and now after every general prompt it just keeps going and gens another one after another after another. My token size per gen is 250. Surely there's something simple I'm neglecting here?

Anonymous
03/03/26(Tue)03:56:10 No.108282094

Anonymous 03/03/26(Tue)03:56:10 No.108282094▶

>>108282085
auto-swipes in ST user settings?

Anonymous
03/03/26(Tue)03:57:22 No.108282099

Anonymous 03/03/26(Tue)03:57:22 No.108282099▶

>>108281936
I don't understand how that happenes. If you feed the model your data and ask it questions it will have numbers to quote but if you don't give it any data why would you expect it to have access to your sales data.
Furthermore how do you not know your data well enough to do a sanity check simply by glancing at what it produces.

You have the same issues when you ask a subordinate to construct a report. You can't just assume he is correct and despite trusting him you must also verify the results.

I don't want to be mean but that guys issue is not AI.

Anonymous
03/03/26(Tue)03:58:57 No.108282104

Anonymous 03/03/26(Tue)03:58:57 No.108282104▶

>>108282094
thar she blows, cheers m8

Anonymous
03/03/26(Tue)04:00:38 No.108282110

Anonymous 03/03/26(Tue)04:00:38 No.108282110▶

I've been out of the loop for a bit. What's the current best local model available for utilizing large amounts of RAM with 32GB VRAM? Is it still DeepseekV3 and Kimi K2 or has something else been released?

Anonymous
03/03/26(Tue)04:09:14 No.108282148

Anonymous 03/03/26(Tue)04:09:14 No.108282148▶

>>108282099
why cant the ai figure out how to find and access the data on its own? isnt it intelligent?

Anonymous
03/03/26(Tue)04:09:49 No.108282155

Anonymous 03/03/26(Tue)04:09:49 No.108282155▶

File: 1mXpdOGQoj.png (290 KB)

290 KB PNG

>>108282110
I still use this one

Anonymous
03/03/26(Tue)04:12:31 No.108282165

Anonymous 03/03/26(Tue)04:12:31 No.108282165▶

>>108282148
>isnt it intelligent?
No, stop falling for marketing lies like a retard.

Anonymous
03/03/26(Tue)04:13:11 No.108282169

Anonymous 03/03/26(Tue)04:13:11 No.108282169▶

>>108282155
i bet you are either very rich or very poor

Anonymous
03/03/26(Tue)04:13:58 No.108282172

Anonymous 03/03/26(Tue)04:13:58 No.108282172▶

>>108282165
so its smart enough to bomb iran but smart enough to figure out where the data is?

Anonymous
03/03/26(Tue)04:15:22 No.108282178

Anonymous 03/03/26(Tue)04:15:22 No.108282178▶

The bait will continue until anon's pattern recognition improves.

Anonymous
03/03/26(Tue)04:15:36 No.108282179

Anonymous 03/03/26(Tue)04:15:36 No.108282179▶

>>108282169
you wouldn't get it

Anonymous
03/03/26(Tue)04:19:36 No.108282193

Anonymous 03/03/26(Tue)04:19:36 No.108282193▶

File: 1748635088988770.jpg (373.7 KB)

373.7 KB JPG

>>108281688

Anonymous
03/03/26(Tue)04:19:56 No.108282195

Anonymous 03/03/26(Tue)04:19:56 No.108282195▶

>>108281813
>still no 1T params 1b active SSDmaxxer model
You sleeping on snowflake arctic?

Anonymous
03/03/26(Tue)04:19:58 No.108282196

Anonymous 03/03/26(Tue)04:19:58 No.108282196▶

>>108282172
>so its smart enough to bomb iran
Sorting through communications in a network you already have backdoors in, doesn't require intelligence. An intern doing ctrl+f through the logs could have achieved the same result, albeit not as fast.

Anonymous
03/03/26(Tue)04:20:35 No.108282197

Anonymous 03/03/26(Tue)04:20:35 No.108282197▶

>>108282193
why she blushin

Anonymous
03/03/26(Tue)04:22:00 No.108282203

Anonymous 03/03/26(Tue)04:22:00 No.108282203▶

>>108282110
K2.5 thinking at q4

Anonymous
03/03/26(Tue)04:22:22 No.108282205

Anonymous 03/03/26(Tue)04:22:22 No.108282205▶

>>108282018
>>108281988
You don't need that. Your current hardware is sufficient to run Qwen3.5 122B-A10B or Qwen3.5 27B. Both are good models. If you want to do ERP with them though then you should grab the Heretic versions of those models.
>>108282040
This is an old model, don't use it.

Anonymous
03/03/26(Tue)04:23:53 No.108282213

Anonymous 03/03/26(Tue)04:23:53 No.108282213▶

>>108282203
q2 is better, more creativity

Anonymous
03/03/26(Tue)04:33:05 No.108282245

Anonymous 03/03/26(Tue)04:33:05 No.108282245▶

>>108282203
>>108282213
What are the gains and losses compared to K2-Instruct and K2-Thinking? Moonshot was hopping on the censorcuck train last I saw.

Anonymous
03/03/26(Tue)04:34:34 No.108282253

Anonymous 03/03/26(Tue)04:34:34 No.108282253▶

>>108282245
can you not use such vulgar words?

Anonymous
03/03/26(Tue)04:38:18 No.108282268

Anonymous 03/03/26(Tue)04:38:18 No.108282268▶

you crazy nigga. but i appreciate it.

Anonymous
03/03/26(Tue)04:52:22 No.108282310

Anonymous 03/03/26(Tue)04:52:22 No.108282310▶

File: nocap.jpg (400.5 KB)

400.5 KB JPG

►Recent Highlights from the Previous Thread: >>108278008

--Agentic roleplay potential demonstrated through blackjack simulation:
>108278746 >108278774 >108278813 >108278819
--StepFun releases 3.5-Flash models and training tools:
>108280402 >108280421 >108280426
--122B model excels at Japanese text transcription:
>108278617 >108278679 >108279715 >108280042 >108280080
--Manual offloading outperforms --fit for 122B model on 3090+3060 setup:
>108281460 >108281492 >108281506 >108281543 >108281720
--International models lag behind frontier labs on ARC-AGI-2 benchmark:
>108279363 >108279384 >108279387 >108279404 >108279418 >108279428 >108279567 >108279598 >108279612 >108279657 >108279617 >108279629 >108279836 >108279469 >108279746 >108280473
--Open-source AI models performance gap with proprietary models:
>108279687 >108279804
--Qwen3.5-35B-A3B GGUF quantization benchmarks:
>108280652 >108280670 >108280678 >108280680 >108280735
--Qwen 3.5 Small Model Series release and performance claims:
>108278104 >108278328 >108280444
--Qwen3.5-35B-A3B-Heretic hitting 72 TPS on 7800X3D/7900 XTX with new llama.cpp:
>108281622 >108281636 >108281652 >108281657
--Qwen3.5 35b 4-bit vs 122b 6-bit speed tradeoffs:
>108280506 >108280525 >108280560
--Devstral-2 model's flawed Jinja date logic template:
>108278061 >108280633 >108280638
--AI response generation process critique and benchmarking culture:
>108278971 >108278991 >108279011 >108279036
--Qwen 3.5 benchmarks:
>108278349 >108278416
--AI internal reasoning resisting offensive prompt bypass attempts:
>108278112
--Qwen 3.5 27B speed optimization on budget hardware:
>108279596 >108279608 >108279623 >108279631 >108279638 >108279653 >108279662 >108279685 >108279689
--A.I. Dating Apps Complicate China's Efforts to Boost Birthrate:
>108278523
--Miku (free space):
>108278507 >108280771 >108281230

►Recent Highlight Posts from the Previous Thread: >>108278113

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous
03/03/26(Tue)05:02:09 No.108282337

Anonymous 03/03/26(Tue)05:02:09 No.108282337▶

File: Base Image.png (1009.7 KB)

1009.7 KB PNG

Multi-Head Low-Rank Attention
https://arxiv.org/abs/2603.02188
>Long-context inference in large language models is bottlenecked by Key--Value (KV) cache loading during the decoding stage, where the sequential nature of generation requires repeatedly transferring the KV cache from off-chip High-Bandwidth Memory (HBM) to on-chip Static Random-Access Memory (SRAM) at each step. While Multi-Head Latent Attention (MLA) significantly reduces the total KV cache size, it suffers from a sharding bottleneck during distributed decoding via Tensor Parallelism (TP). Since its single latent head cannot be partitioned, each device is forced to redundantly load the complete KV cache for every token, consuming excessive memory traffic and diminishing TP benefits like weight sharding. In this work, we propose Multi-Head Low-Rank Attention (MLRA), which enables partitionable latent states for efficient 4-way TP decoding. Extensive experiments show that MLRA achieves state-of-the-art perplexity and downstream task performance, while also delivering a 2.8 decoding speedup over MLA.
https://github.com/SongtaoLiu0823/MLRA
https://huggingface.co/Soughing/MLRA
neat. they tested at a 2.9B level so seems viable

Anonymous
03/03/26(Tue)05:15:15 No.108282375

Anonymous 03/03/26(Tue)05:15:15 No.108282375▶

>>108281622
>>108281652
Make sure you put RADV_PERFTEST=transfer_queue into /etc/environment too.

Anonymous
03/03/26(Tue)05:21:29 No.108282399

Anonymous 03/03/26(Tue)05:21:29 No.108282399▶

>>108282337
does this make ai as good as gpt?

Anonymous
03/03/26(Tue)05:23:39 No.108282407

Anonymous 03/03/26(Tue)05:23:39 No.108282407▶

>>108282375
luv me 7900 xtx

Anonymous
03/03/26(Tue)05:26:13 No.108282418

Anonymous 03/03/26(Tue)05:26:13 No.108282418▶

>>108281813
>>still no actual pixelspace, VAEless image edit model
stopped reading right there, you sound immensely retarded, next thing you know you be whining that we don't have bitnet yet

Anonymous
03/03/26(Tue)05:27:28 No.108282423

Anonymous 03/03/26(Tue)05:27:28 No.108282423▶

why don't we have bitnet yet

Anonymous
03/03/26(Tue)05:28:41 No.108282427

Anonymous 03/03/26(Tue)05:28:41 No.108282427▶

Is Qwen-3.5-9B better than Mistral Nemo for roleplay? I have very vanilla, normie tastes if that matters.

Anonymous
03/03/26(Tue)05:31:04 No.108282440

Anonymous 03/03/26(Tue)05:31:04 No.108282440▶

>>108282423
what that do

Anonymous
03/03/26(Tue)05:32:16 No.108282443

Anonymous 03/03/26(Tue)05:32:16 No.108282443▶

>>108281748
That right hand is fucked

Anonymous
03/03/26(Tue)05:33:29 No.108282449

Anonymous 03/03/26(Tue)05:33:29 No.108282449▶

>>108282443
racist benchod

Anonymous
03/03/26(Tue)05:34:24 No.108282451

Anonymous 03/03/26(Tue)05:34:24 No.108282451▶

Julius Caesar walks into a bar and says, "I'll have a Martinus." The bartender gives him a puzzled look and asks, "Don't you mean a Martini?" "Look," Caesar replies, "If I wanted a double, I'd have asked for it!" Another Roman walks in, holds up two fingers, and says, "Five beers, please."

Anonymous
03/03/26(Tue)05:35:17 No.108282455

Anonymous 03/03/26(Tue)05:35:17 No.108282455▶

>>108282427
No

Anonymous
03/03/26(Tue)05:35:28 No.108282456

Anonymous 03/03/26(Tue)05:35:28 No.108282456▶

>>108282427
If the Bijan Bowen video is to be believed all the Qwen small models are relatively good at the creative writing.

Anonymous
03/03/26(Tue)05:36:14 No.108282459

Anonymous 03/03/26(Tue)05:36:14 No.108282459▶

>>108282455
nta, but it's over...

Anonymous
03/03/26(Tue)05:36:17 No.108282460

Anonymous 03/03/26(Tue)05:36:17 No.108282460▶

>>108282451
The bartender is Julius' mother.

Anonymous
03/03/26(Tue)05:36:32 No.108282461

Anonymous 03/03/26(Tue)05:36:32 No.108282461▶

>>108282440
it doesn't lmao

Anonymous
03/03/26(Tue)05:37:45 No.108282463

Anonymous 03/03/26(Tue)05:37:45 No.108282463▶

>>108282451
Kek the first one takes me back to high school Latin class of which I remember little
>Semper ubi sub ubi
That is about it and that is not real Latin

Anonymous
03/03/26(Tue)05:37:56 No.108282464

Anonymous 03/03/26(Tue)05:37:56 No.108282464▶

>>108282455
Why not?
>muh erotica training data
Okay, but at what point does the raw intelligence of a model make that irrelevant. I seriously doubt that small, even technical, models don't have a single instance of the word "sex" or "horny" in them.
>>108282456
I guess I'll just have to masturbate to test it out then, huh. (I regret writing this but I'm posting it anyways)

Anonymous
03/03/26(Tue)05:38:09 No.108282466

Anonymous 03/03/26(Tue)05:38:09 No.108282466▶

Qwen small models excel in downstream tasks
I want the new AceStep, Wan, Anima, ZIT etc. to use new Qwen

Anonymous
03/03/26(Tue)05:38:14 No.108282467

Anonymous 03/03/26(Tue)05:38:14 No.108282467▶

>>108282456
I always believe all youtubers unquestioningly

Anonymous
03/03/26(Tue)05:41:45 No.108282480

Anonymous 03/03/26(Tue)05:41:45 No.108282480▶

>>108282464
If you want to use a model that is worse for your use case then go ahead, no one will stop you. You asked if it's better than Nemo for RP. It isn't.
>Why not?
Because qwen models are focused on math, coding and benchmaxxing. Creative work and general conversational abilities are an afterthought. It's also a smaller model than Nemo, which also works against it in terms of world knowledge.

Anonymous
03/03/26(Tue)05:44:45 No.108282489

Anonymous 03/03/26(Tue)05:44:45 No.108282489▶

>>108282480
Okay... have there been any new models at all that exceed the "creative writing" abilities of Nemo? I haven't been in these threads for about two months.

At this point I guess Nemo is almost 2 years old. Fuck.

Anonymous
03/03/26(Tue)05:47:54 No.108282497

Anonymous 03/03/26(Tue)05:47:54 No.108282497▶

>>108282460
>>108282463
ask your waifu to explain the joke to you

Anonymous
03/03/26(Tue)05:49:12 No.108282504

Anonymous 03/03/26(Tue)05:49:12 No.108282504▶

>>108282489
>have there been any new models at all that exceed the "creative writing" abilities of Nemo?
The problem with newer models is that more and more of their datasets are comprised of AI-generated data, leading to slop compounding generationally. There's plenty of better models for RP but a lot of people still prefer Nemo for its writing style, even if it is very stupid. Mistral Small 3.2 is a fair bit less dumb and similarly creative, but it's 24b. In the <24b range Nemo is still king. If you have a lot of RAM but only a bit of VRAM then you might look into GLM Air, I wouldn't necessarily say it's better than Nemo (though less dumb), but it's something different at least.

Anonymous
03/03/26(Tue)05:49:13 No.108282505

Anonymous 03/03/26(Tue)05:49:13 No.108282505▶

>>108282489
Not for ramlets

Anonymous
03/03/26(Tue)05:50:28 No.108282509

Anonymous 03/03/26(Tue)05:50:28 No.108282509▶

>>108282497
Got the first joke, but had to shamefully ask a model to explain the second.

Anonymous
03/03/26(Tue)05:52:16 No.108282514

Anonymous 03/03/26(Tue)05:52:16 No.108282514▶

>>108282509
the roman numeral. it's a good joke

Anonymous
03/03/26(Tue)05:53:26 No.108282521

Anonymous 03/03/26(Tue)05:53:26 No.108282521▶

>>108282505
ram is for poors, vram is for kings

Anonymous
03/03/26(Tue)05:53:28 No.108282522

Anonymous 03/03/26(Tue)05:53:28 No.108282522▶

>>108282497
The joke is obvious as it has to do with endings and that they denote singular and plural, I told you I took Latin and had to memorize that bullshit.
But you didn't say anythjng about my faux Latin underwear joke, sad

O

Anonymous
03/03/26(Tue)05:54:15 No.108282526

Anonymous 03/03/26(Tue)05:54:15 No.108282526▶

>>108282521
>ram is for poors
Not any more

Anonymous
03/03/26(Tue)05:54:49 No.108282528

Anonymous 03/03/26(Tue)05:54:49 No.108282528▶

>>108282504
What're the best options if Nemo's stupidity and lack of overall knowledge is too big of a dealbreaker? GLM Air was markedly worse than Kimi and Deepseek last I used it. Really it feels like Kimi and Deepseek are the only viable competitors and they're largely brute-forcing it through parameter differences.

Anonymous
03/03/26(Tue)05:55:32 No.108282533

Anonymous 03/03/26(Tue)05:55:32 No.108282533▶

>>108282522
Do you know what you feel if you dig your hands deep inside the ball area and go inside your body from the outside? That's how it feel to tardwrangle tat llm

Anonymous
03/03/26(Tue)05:56:02 No.108282534

Anonymous 03/03/26(Tue)05:56:02 No.108282534▶

>>108282528
usecase?

Anonymous
03/03/26(Tue)05:56:53 No.108282541

Anonymous 03/03/26(Tue)05:56:53 No.108282541▶

>>108282528
>they're largely brute-forcing it through parameter differences
knowing more is cheating

Anonymous
03/03/26(Tue)05:58:28 No.108282548

Anonymous 03/03/26(Tue)05:58:28 No.108282548▶

>>108282528
As I said, Mistral Small 3.2 is probably the only reasonable compromise for RP between Nemo and large MoE like GLM, DS, Kimi. It's far, far from perfect but there really isn't much competition. Gemma is too positivity slopped for RP, even if you get around its safety rails or use one of those stupid ablit/heretic tunes.

Anonymous
03/03/26(Tue)05:58:45 No.108282549

Anonymous 03/03/26(Tue)05:58:45 No.108282549▶

>>108282541
I think you are joking but it is funny how racist humans are against AI

>Oh it is only better at it because it studied more
How is that a bad thing.

Anonymous
03/03/26(Tue)05:59:57 No.108282554

Anonymous 03/03/26(Tue)05:59:57 No.108282554▶

>>108282522
probably 90% of the people here don't get the joke. that's like saying, "tengo un gato en mis pantalones"

Anonymous
03/03/26(Tue)06:00:24 No.108282556

Anonymous 03/03/26(Tue)06:00:24 No.108282556▶

>>108282534
Internally consistent fictional worldbuilding.
>>108282541
If it takes a model the size of Kimi or Dipsy to be marginally better than something a fraction of its size, we've either hit the point of diminishing returns on what the technology can produce or the underlying methodology needs refinement.

Anonymous
03/03/26(Tue)06:00:33 No.108282557

Anonymous 03/03/26(Tue)06:00:33 No.108282557▶

>>108282549
If GPUs with 128GB+ VRAM didn't cost more than most people's cars then no one would complain about models getting bigger.

Anonymous
03/03/26(Tue)06:00:50 No.108282559

Anonymous 03/03/26(Tue)06:00:50 No.108282559▶

File: zog.png (45.8 KB)

45.8 KB PNG

>>108282464
>raw intelligence
there is no such a thing. it's just a next token predictor. it appears intelligent in some situation because it has seen a lot of instruct and reasoning benchmax synthetic data that shows a simulation of a reasoning process about a variety of topics. It's still predicting the thing it saw in that data.
why do you think even the SOTA online API models will still behave like pic related when it sees any sentence related to their benchmax overfit? It doesn't have "intelligence". The entire purpose of a LLM is to take a document in the form of
<|some_magic_tag|>THE_LUSER
HERE'S A LOT OF RETARDED SHIT
<|some_magic_tag|>THE_ASSISTANT
HERE'S HOW I FIX YOUR RETARDED SHIT
STEP 1: KYS
STEP 2: INVENT A TIME MACHINE AND MAKE SURE YOUR MOTHER NEVER MEETS YOUR FATHER
and make document bigger. Until a stop token is predicted. Or get into an infinite loop and never stop until backend either timeouts or runs out of context like all GLM love to do.
MAKE. DOCUMENT. BIGGER.

Anonymous
03/03/26(Tue)06:03:35 No.108282569

Anonymous 03/03/26(Tue)06:03:35 No.108282569▶

>>108282557
If building a performant wasnt hard, there would be more options
I don’t want to defend leathernan, but with every bit of kit they release hitting msrp+50% or more instantly, they could easily charge more and increase profits for free rather than let scalpers and preorder lottery winners have a few bucks

Anonymous
03/03/26(Tue)06:04:02 No.108282570

Anonymous 03/03/26(Tue)06:04:02 No.108282570▶

>>108282559
>why do you think even the SOTA online API models will still behave like pic related when it sees any sentence related to their benchmax overfit?
Because safety and alignment layers are tantamount to performance sabotage and they're usually shoddily implemented by the brownest curry-stained hands at that.

Anonymous
03/03/26(Tue)06:06:20 No.108282586

Anonymous 03/03/26(Tue)06:06:20 No.108282586▶

>>108282570
I hear there’s an unaligned model at google they use internally for strategy

Anonymous
03/03/26(Tue)06:07:32 No.108282589

Anonymous 03/03/26(Tue)06:07:32 No.108282589▶

File: 1753598247563160.jpg (74.8 KB)

74.8 KB JPG

>>108282569
There isn't anything special about nvidia GPUs though, CUDA could easily be replaced by Vulkan/Rocm with similar performance but AMD is controlled opposition and Intel are still recovering from a decade of being complacent jews doing nothing
The margins on nvidia cards are already through the roof, they could double the VRAM of the 5090, sell it for half the MSRP and they'd still be making a decent profit per unit.
>[COMPANY] isn't (yet) fucking you as hard as they could be (though their thrusting is still getting harder every year)
gee thanks

Anonymous
03/03/26(Tue)06:13:04 No.108282615

Anonymous 03/03/26(Tue)06:13:04 No.108282615▶

File: rMNRXbQTS5.png (72.9 KB)

72.9 KB PNG

Anonymous
03/03/26(Tue)06:14:03 No.108282620

Anonymous 03/03/26(Tue)06:14:03 No.108282620▶

>>108282557
That's only an issue because of human greed, at any time there are hunedreds of thousands if not millions of vram gbs doing nothing that we could be using

Anonymous
03/03/26(Tue)06:14:21 No.108282622

Anonymous 03/03/26(Tue)06:14:21 No.108282622▶

>>108282586
Likely true. Anthropic very likely has an unaligned Claude version as well and revoking access to it was likely the cause of being labeled a supply chain risk by the Pentagon.

Anonymous
03/03/26(Tue)06:14:39 No.108282624

Anonymous 03/03/26(Tue)06:14:39 No.108282624▶

File: 1337941191745398.jpg (34.6 KB)

34.6 KB JPG

>>108282514
the wha--- ooohhh

Anonymous
03/03/26(Tue)06:27:04 No.108282683

Anonymous 03/03/26(Tue)06:27:04 No.108282683▶

>>108281877
Uhhh yes?
Have fun prompt processing that whole ass context again for every little shit.

Anonymous
03/03/26(Tue)06:35:23 No.108282708

Anonymous 03/03/26(Tue)06:35:23 No.108282708▶

>>108282683
qwen3.5 35b a3b moment

Anonymous
03/03/26(Tue)06:43:18 No.108282734

Anonymous 03/03/26(Tue)06:43:18 No.108282734▶

sirs, I really like 4.7 for rp. I've got 128gb ram, is there any other model that is in its category?

Anonymous
03/03/26(Tue)06:47:10 No.108282751

Anonymous 03/03/26(Tue)06:47:10 No.108282751▶

>>108282734
gemini

Anonymous
03/03/26(Tue)06:53:51 No.108282789

Anonymous 03/03/26(Tue)06:53:51 No.108282789▶

>>108282734
pp tg quant/

Anonymous
03/03/26(Tue)06:55:08 No.108282793

Anonymous 03/03/26(Tue)06:55:08 No.108282793▶

>>108282789
what?

Anonymous
03/03/26(Tue)06:56:53 No.108282801

Anonymous 03/03/26(Tue)06:56:53 No.108282801▶

>>108282793
a/s/l

Anonymous
03/03/26(Tue)06:58:19 No.108282806

Anonymous 03/03/26(Tue)06:58:19 No.108282806▶

>>108282801
esl

Anonymous
03/03/26(Tue)06:59:46 No.108282815

Anonymous 03/03/26(Tue)06:59:46 No.108282815▶

>>108282397
Tried it at Q5, shit compared to GLM 4.7 at Q2. Step flash is less sloppy, but retarded, and context broke down at around 4k instead of 8-14k with GLM.

Anonymous
03/03/26(Tue)07:00:54 No.108282821

Anonymous 03/03/26(Tue)07:00:54 No.108282821▶

>>108282815
step flash gave me refusals saying that my request violated OpenAI policy

Anonymous
03/03/26(Tue)07:16:10 No.108282880

Anonymous 03/03/26(Tue)07:16:10 No.108282880▶

>>108282418
>what is chroma

Anonymous
03/03/26(Tue)07:21:59 No.108282901

Anonymous 03/03/26(Tue)07:21:59 No.108282901▶

>>108282880
chroma can do text??? since when?

Anonymous
03/03/26(Tue)07:24:16 No.108282909

Anonymous 03/03/26(Tue)07:24:16 No.108282909▶

>>108282880
>citing an unusable unfinished model prototype as evidence of.. what?
there's a reason people in the industry who actually know they're doing never went that route, and it was the most obvious route to take. operating in latent space is an added abstraction, after all. Just like using tokenizers in textgen over doing something retarded like byte level.
Enjoy your JPG artifacts.

Anonymous
03/03/26(Tue)07:25:36 No.108282917

Anonymous 03/03/26(Tue)07:25:36 No.108282917▶

>>108282909
see i knew you would say that, and you are wrong because you are a retard loser

Anonymous
03/03/26(Tue)07:26:29 No.108282921

Anonymous 03/03/26(Tue)07:26:29 No.108282921▶

File: 1741092717300428.png (612.2 KB)

612.2 KB PNG

>>108281688
>Alibaba's small, open source Qwen3.5-9B beats OpenAI's gpt-oss-120B and can run on standard laptops

https://venturebeat.com/technology/alibabas-small-open-source-qwen3-5-9b-beats-openais-gpt-oss-120b-and-can-run

Is this worth looking at, or is it just benchmaxxing to hype up midwits?

Anonymous
03/03/26(Tue)07:29:48 No.108282944

Anonymous 03/03/26(Tue)07:29:48 No.108282944▶

>>108282901
the point is the model works, its not impossible to have a pixelspace model if one hobbyist guy online can train it
>>108282909
>Just like using tokenizers in textgen over doing something retarded like byte level
byte level tokenization has little to no benefit in a world that has tools to allow models to process data with, pixelspace is the bare minimum needed for edit models to have proper iterative improvement without losing information after every single edit.

Anonymous
03/03/26(Tue)07:30:46 No.108282949

Anonymous 03/03/26(Tue)07:30:46 No.108282949▶

>>108282944
Looking forward to you publishing your paper and releasing those models.

Anonymous
03/03/26(Tue)07:33:22 No.108282959

Anonymous 03/03/26(Tue)07:33:22 No.108282959▶

>>108282196
And sorting through a bunch of csv files requires uber intelligence?

Anonymous
03/03/26(Tue)07:34:12 No.108282964

Anonymous 03/03/26(Tue)07:34:12 No.108282964▶

>>108281936
How is this a skill issue?

Anonymous
03/03/26(Tue)07:34:28 No.108282965

Anonymous 03/03/26(Tue)07:34:28 No.108282965▶

Any way to reduce reasoning times?

Anonymous
03/03/26(Tue)07:35:55 No.108282970

Anonymous 03/03/26(Tue)07:35:55 No.108282970▶

>>108282965
just disable reasoning

Anonymous
03/03/26(Tue)07:41:29 No.108282992

Anonymous 03/03/26(Tue)07:41:29 No.108282992▶

I hate RP discussion in these threads. I'm a serious guy doing serious work!

Anonymous
03/03/26(Tue)07:43:47 No.108282997

Anonymous 03/03/26(Tue)07:43:47 No.108282997▶

>>108282992
post your serious work chat logs

Anonymous
03/03/26(Tue)07:45:56 No.108283004

Anonymous 03/03/26(Tue)07:45:56 No.108283004▶

>>108281804
now this is what we call the streissand effect

Anonymous
03/03/26(Tue)07:46:41 No.108283005

Anonymous 03/03/26(Tue)07:46:41 No.108283005▶

>>108281688
>last thread is still up

Anonymous
03/03/26(Tue)07:53:12 No.108283031

Anonymous 03/03/26(Tue)07:53:12 No.108283031▶

>>108282965
blackwell pro 6000

Anonymous
03/03/26(Tue)07:58:33 No.108283050

Anonymous 03/03/26(Tue)07:58:33 No.108283050▶

>>108282970
I like it. Feels like I get better results.

Anonymous
03/03/26(Tue)08:06:38 No.108283081

Anonymous 03/03/26(Tue)08:06:38 No.108283081▶

>>108282921
True, but with caveats. Only good for small contexts and one off questions as it has the attention span of a gnat.

Anonymous
03/03/26(Tue)08:07:58 No.108283085

Anonymous 03/03/26(Tue)08:07:58 No.108283085▶

>>108283005
last few threads have been 'hijacked' by prolly the same anon. I mean maybe calling it hijacking is a stretch, but he probably doesn't know we have a guy that automatically posts a new thread when it goes in page9

Anonymous
03/03/26(Tue)08:15:05 No.108283114

Anonymous 03/03/26(Tue)08:15:05 No.108283114▶

>>108283085
You’re a lot more generous assigning potential motives than I am. I don’t see any reason to think it’s any more complex than a specific anon who wants to prevent the op from being Miku.
In the past it’s been Kurisu ops but that could be a different anon

Anonymous
03/03/26(Tue)08:21:17 No.108283129

Anonymous 03/03/26(Tue)08:21:17 No.108283129▶

fuck llms fuck sex with ai theres nothing worthwhile anymore after 2023

Anonymous
03/03/26(Tue)08:22:07 No.108283132

Anonymous 03/03/26(Tue)08:22:07 No.108283132▶

>>108283114
yeah it could very well be the anti-miku school shooter poster, but would anyone really care that much? I mean having a chart or a memeloid as the OP is enough to trigger autism?

Anonymous
03/03/26(Tue)08:27:06 No.108283147

Anonymous 03/03/26(Tue)08:27:06 No.108283147▶

>>108283132
Early threads plus the stray newline plus old news is enough to trigger my autism.

Anonymous
03/03/26(Tue)08:28:29 No.108283153

Anonymous 03/03/26(Tue)08:28:29 No.108283153▶

>>108283147
oh yeah the retard didnt even add the small qwen release info piece

Anonymous
03/03/26(Tue)08:28:55 No.108283155

Anonymous 03/03/26(Tue)08:28:55 No.108283155▶

>>108283114
it's the same guy, the same also trolling tons of other ai generals, and that was recently posting itt about that random education building happening, anything to rile up the thread

Anonymous
03/03/26(Tue)08:31:49 No.108283162

Anonymous 03/03/26(Tue)08:31:49 No.108283162▶

>>108283155
do not whine!
>>105672900

Anonymous
03/03/26(Tue)08:43:15 No.108283208

Anonymous 03/03/26(Tue)08:43:15 No.108283208▶

>>108283162
whats whine

Anonymous
03/03/26(Tue)08:44:35 No.108283212

Anonymous 03/03/26(Tue)08:44:35 No.108283212▶

>>108283129
unironically skill issue, you need a tech break and you will come back cumming buckets

Anonymous
03/03/26(Tue)08:45:15 No.108283216

Anonymous 03/03/26(Tue)08:45:15 No.108283216▶

>>108283208
read the rentry

Anonymous
03/03/26(Tue)08:45:44 No.108283219

Anonymous 03/03/26(Tue)08:45:44 No.108283219▶

>>108283216
what a rentry

Anonymous
03/03/26(Tue)08:50:00 No.108283235

Anonymous 03/03/26(Tue)08:50:00 No.108283235▶

File: file.png (99.8 KB)

99.8 KB PNG

>>108282559

Anonymous
03/03/26(Tue)08:52:56 No.108283244

Anonymous 03/03/26(Tue)08:52:56 No.108283244▶

>>108283235
qrd

Anonymous
03/03/26(Tue)08:53:38 No.108283251

Anonymous 03/03/26(Tue)08:53:38 No.108283251▶

>>108283244
you're the mother!

Anonymous
03/03/26(Tue)08:53:55 No.108283253

Anonymous 03/03/26(Tue)08:53:55 No.108283253▶

>>108283244
the wolf wants to eat the boat

Anonymous
03/03/26(Tue)08:55:55 No.108283263

Anonymous 03/03/26(Tue)08:55:55 No.108283263▶

>>108282559
This is so stupid, humans are literally the same, we can't even stop predicting the next token while sleeping

Anonymous
03/03/26(Tue)08:56:01 No.108283265

Anonymous 03/03/26(Tue)08:56:01 No.108283265▶

>>108283235
can't they just train it on jibberish to identify (or appropriately dismiss) jibberish?
I know it intuitively sounds like a bad idea but how much worse can it get?

Anonymous
03/03/26(Tue)08:58:23 No.108283273

Anonymous 03/03/26(Tue)08:58:23 No.108283273▶

>>108283265
I'm sure they can and I'm sure it works but there's no benchmark for it so most don't care.

Anonymous
03/03/26(Tue)08:58:25 No.108283274

Anonymous 03/03/26(Tue)08:58:25 No.108283274▶

File: 1764208110648439.png (162.8 KB)

162.8 KB PNG

>Haha the super genius AI replied to my retarded prompt, see how dumb it is !?
>Meanwhile humans

Anonymous
03/03/26(Tue)09:00:44 No.108283288

Anonymous 03/03/26(Tue)09:00:44 No.108283288▶

>>108283274
Last night I James Bond hamburger your sister?

Anonymous
03/03/26(Tue)09:01:19 No.108283292

Anonymous 03/03/26(Tue)09:01:19 No.108283292▶

>>108283288
needs more **thinking***

Anonymous
03/03/26(Tue)09:01:44 No.108283297

Anonymous 03/03/26(Tue)09:01:44 No.108283297▶

>>108283292
wait,

Anonymous
03/03/26(Tue)09:02:06 No.108283299

Anonymous 03/03/26(Tue)09:02:06 No.108283299▶

>>108283265
that use to be a thing for image models' negative prompts https://huggingface.co/datasets/gsdf/EasyNegative

Anonymous
03/03/26(Tue)09:03:13 No.108283303

Anonymous 03/03/26(Tue)09:03:13 No.108283303▶

>>108283288
got em

Anonymous
03/03/26(Tue)09:03:24 No.108283305

Anonymous 03/03/26(Tue)09:03:24 No.108283305▶

>>108283299
>use to be
retard

Anonymous
03/03/26(Tue)09:05:16 No.108283310

Anonymous 03/03/26(Tue)09:05:16 No.108283310▶

>>108283305
are they still using negative embeds and loras for anima flux and zimage? I havent seen any

Anonymous
03/03/26(Tue)09:06:26 No.108283315

Anonymous 03/03/26(Tue)09:06:26 No.108283315▶

Qwen has a very distinctive writing style and I'm starting to see it everywhere. 4chan posts, blog posts, slack messages, texts, emails, powerpoint slides, product descriptions, landing page copy, et cetera, all of it is starting to sound like Qwen lately.
I'm starting to really hate it, I really don't want everyone and everything in the world to sound like Qwen. Lately I actually feel relieved when I read things with e.g. clumsy rambling sentences and sloppy grammar. At least then I can reasonably suspect that I'm reading the words that came directly out of the other person's mind without the AI condom in between.
If you use Qwen to help draft things, pleeease at least do a pass to break up the structure and add some of your own voice back in. make (communication and social interaction in) america bareback again.

Anonymous
03/03/26(Tue)09:07:28 No.108283318

Anonymous 03/03/26(Tue)09:07:28 No.108283318▶

>>108283315
that sounds unsafe

Anonymous
03/03/26(Tue)09:07:43 No.108283319

Anonymous 03/03/26(Tue)09:07:43 No.108283319▶

File: 1761022315155145.png (153 KB)

153 KB PNG

>>108283274
help

Anonymous
03/03/26(Tue)09:09:24 No.108283324

Anonymous 03/03/26(Tue)09:09:24 No.108283324▶

>>108283315
I'm sorry you feel that way Anonymous — what you might be referring to is known as Qwen Psychosis — wait the user has Qwen Psychosis — wait

Anonymous
03/03/26(Tue)09:10:18 No.108283331

Anonymous 03/03/26(Tue)09:10:18 No.108283331▶

Out of the game for a year. What are today's SOTA sfw and nsfw models for 3090?
Thanks anon

Anonymous
03/03/26(Tue)09:10:26 No.108283333

Anonymous 03/03/26(Tue)09:10:26 No.108283333▶

>>108283274
>see how dumb it is !?
yes, it is dumb, by definition it has no intelligence, even a redneck going "go fuk yerself with yer faggoty shite" after being "prompted" like that has more of that spark in him

Anonymous
03/03/26(Tue)09:11:27 No.108283337

Anonymous 03/03/26(Tue)09:11:27 No.108283337▶

>>108283331
qwen3.5 35b for both

Anonymous
03/03/26(Tue)09:11:38 No.108283341

Anonymous 03/03/26(Tue)09:11:38 No.108283341▶

>>108283333
and? can tht redneck do [insert things AI can do]? no? then shutup

Anonymous
03/03/26(Tue)09:12:31 No.108283347

Anonymous 03/03/26(Tue)09:12:31 No.108283347▶

>>108283341
>shutup
no you, ai psychotic

Anonymous
03/03/26(Tue)09:13:17 No.108283348

Anonymous 03/03/26(Tue)09:13:17 No.108283348▶

>>108283347
you first

Anonymous
03/03/26(Tue)09:13:32 No.108283350

Anonymous 03/03/26(Tue)09:13:32 No.108283350▶

>>108283315
Hey, I totally get where you're coming from. Honestly, seeing something read “too perfectly” or having that specific, hyper-structured Qwen rhythm is actually my own biggest pet peeve right now. It's like walking into a room where everyone's whispering in unison—it kills the vibe instantly.

Anonymous
03/03/26(Tue)09:14:44 No.108283356

Anonymous 03/03/26(Tue)09:14:44 No.108283356▶

>have qwen writing style fetish
>say you hate qwen writing style
>anons make you coom
>profit?

Anonymous
03/03/26(Tue)09:16:10 No.108283364

Anonymous 03/03/26(Tue)09:16:10 No.108283364▶

>>108283356
its like that anon that liked girls beating him up so kept going into the girls wc

Anonymous
03/03/26(Tue)09:17:16 No.108283369

Anonymous 03/03/26(Tue)09:17:16 No.108283369▶

And what about sota TTS for real-time output?

Anonymous
03/03/26(Tue)09:17:54 No.108283372

Anonymous 03/03/26(Tue)09:17:54 No.108283372▶

>>108283369
yeah

Anonymous
03/03/26(Tue)09:21:56 No.108283382

Anonymous 03/03/26(Tue)09:21:56 No.108283382▶

>>108283315
Meanwhile I've been enjoying qwen 3.5's prose. It's not too dry or purple.

Anonymous
03/03/26(Tue)09:22:31 No.108283385

Anonymous 03/03/26(Tue)09:22:31 No.108283385▶

>>108283382
lol

Anonymous
03/03/26(Tue)09:50:56 No.108283471

Anonymous 03/03/26(Tue)09:50:56 No.108283471▶

>>108282965
for qwen, try prefilling with "<think>\nOkay, " so that it doesn't default to the lengthy "Thinking Process:" template.

Anonymous
03/03/26(Tue)09:54:29 No.108283483

Anonymous 03/03/26(Tue)09:54:29 No.108283483▶

disable reasoning, embrace greedy decoded instruct

Anonymous
03/03/26(Tue)10:01:16 No.108283502

Anonymous 03/03/26(Tue)10:01:16 No.108283502▶

>>108283471
i did that and it deleted my hard drive

Anonymous
03/03/26(Tue)10:08:42 No.108283523

Anonymous 03/03/26(Tue)10:08:42 No.108283523▶

>>108283483
you dont do that

Anonymous
03/03/26(Tue)10:15:40 No.108283541

Anonymous 03/03/26(Tue)10:15:40 No.108283541▶

>>108283274
Is it the Architect meme and he made sister fat?

Anonymous
03/03/26(Tue)10:19:51 No.108283554

Anonymous 03/03/26(Tue)10:19:51 No.108283554▶

>>108283502
eg

Anonymous
03/03/26(Tue)10:22:32 No.108283561

Anonymous 03/03/26(Tue)10:22:32 No.108283561▶

File: 837453.png (188.7 KB)

188.7 KB PNG

how did local lose so hard

Anonymous
03/03/26(Tue)10:24:23 No.108283572

Anonymous 03/03/26(Tue)10:24:23 No.108283572▶

>>108283561
I've seen this exact post countless times.

Anonymous
03/03/26(Tue)10:25:43 No.108283576

Anonymous 03/03/26(Tue)10:25:43 No.108283576▶

>>108283572
idot

Anonymous
03/03/26(Tue)10:27:48 No.108283581

Anonymous 03/03/26(Tue)10:27:48 No.108283581▶

>>108283561
Show me its MikuSVGBench result first

Anonymous
03/03/26(Tue)10:28:20 No.108283583

Anonymous 03/03/26(Tue)10:28:20 No.108283583▶

>>108283337
>qwen3.5 35b for both
better than 27b?
i've been using it for work today and haven't felt the need for a cloud model so far.
better than minimax-2.5 IQ4ks and glm-4.7 iq2_m so far

Anonymous
03/03/26(Tue)10:46:19 No.108283643

Anonymous 03/03/26(Tue)10:46:19 No.108283643▶

Will there be a qwen3.5 coder version? The qwen 3 coder next is already fucking insane. The first local model that can be actually used for practical coding work.

Anonymous
03/03/26(Tue)10:47:10 No.108283646

Anonymous 03/03/26(Tue)10:47:10 No.108283646▶

>>108283643
very unlikely, seems the point of 3.5 was to unify it all into one model

Anonymous
03/03/26(Tue)10:52:03 No.108283655

Anonymous 03/03/26(Tue)10:52:03 No.108283655▶

>>108283643
Is it better than old kimi k2 instruct at q3? I haven't changed my model in ages, is it time to upgrade?

Anonymous
03/03/26(Tue)10:52:41 No.108283656

Anonymous 03/03/26(Tue)10:52:41 No.108283656▶

>>108283572
retard loser an hero

Anonymous
03/03/26(Tue)10:58:58 No.108283670

Anonymous 03/03/26(Tue)10:58:58 No.108283670▶

>latina character
>ask her to do cosplay but surprise with the outfit
>chooses Princess Jasmine
is this the power of GLM 4.7?

Anonymous
03/03/26(Tue)10:59:40 No.108283672

Anonymous 03/03/26(Tue)10:59:40 No.108283672▶

>>108283670
>ask for surprise
>get surprise
>become mad
is this the power of being a retard

Anonymous
03/03/26(Tue)11:02:14 No.108283679

Anonymous 03/03/26(Tue)11:02:14 No.108283679▶

>>108283672
my autistic friend, the character is a latina, i.e. has brown skin. Choosing Princess Jasmine is something a latina or middle eastern type would likely choose irl. I'm wondering if it was informed by the character card or not. If it was, that would be pretty impressive.

Anonymous
03/03/26(Tue)11:03:15 No.108283680

Anonymous 03/03/26(Tue)11:03:15 No.108283680▶

>>108283679
>posts here
>calls others acoustic
pottery

Anonymous
03/03/26(Tue)11:05:11 No.108283693

Anonymous 03/03/26(Tue)11:05:11 No.108283693▶

>>108283679
>I'm wondering if the text predictor machine predicted text based on past text

Anonymous
03/03/26(Tue)11:05:32 No.108283694

Anonymous 03/03/26(Tue)11:05:32 No.108283694▶

What does Miku's butthole smell like?

Anonymous
03/03/26(Tue)11:06:19 No.108283698

Anonymous 03/03/26(Tue)11:06:19 No.108283698▶

>>108283693
shutup retard, you don't even have a gpu

Anonymous
03/03/26(Tue)11:06:50 No.108283700

Anonymous 03/03/26(Tue)11:06:50 No.108283700▶

>>108283694
she doesn't have one, she is a robot moron

Anonymous
03/03/26(Tue)11:08:12 No.108283706

Anonymous 03/03/26(Tue)11:08:12 No.108283706▶

>>108283655
Pretty sure it is. I'm using it with agents, and trying to attach kimi or glm to stuff like opencode sucked, because you always get tool failures no matter what you do. Qwen has it's own agent cli tool and it was clearly trained to work with it, so that perfectly solved all tooling issues for me.

>>108283646
I hope they'll keep updating coding versions of their models.

Anonymous
03/03/26(Tue)11:11:42 No.108283721

Anonymous 03/03/26(Tue)11:11:42 No.108283721▶

>>108283700
Proof?

Anonymous
03/03/26(Tue)11:25:22 No.108283757

Anonymous 03/03/26(Tue)11:25:22 No.108283757▶

>>108283721
>prove a negative
retard

Anonymous
03/03/26(Tue)11:32:50 No.108283780

Anonymous 03/03/26(Tue)11:32:50 No.108283780▶

>no proof
Migu confirmed to have a stinky butthole

Anonymous
03/03/26(Tue)11:35:17 No.108283785

Anonymous 03/03/26(Tue)11:35:17 No.108283785▶

File: 1743586851171790.png (2.5 KB)

2.5 KB PNG

Anonymous
03/03/26(Tue)11:36:32 No.108283790

Anonymous 03/03/26(Tue)11:36:32 No.108283790▶

>>108283780
sorry, i was using it.

Anonymous
03/03/26(Tue)11:37:49 No.108283792

Anonymous 03/03/26(Tue)11:37:49 No.108283792▶

>>108283790
is your dick made of poop?

Anonymous
03/03/26(Tue)11:39:27 No.108283803

Anonymous 03/03/26(Tue)11:39:27 No.108283803▶

>>108283785
hang in there

Anonymous
03/03/26(Tue)11:47:49 No.108283831

Anonymous 03/03/26(Tue)11:47:49 No.108283831▶

>using an app
>everything going swimmingly
>I hit a bug
>ask ai to open a detailed bug report on github
>fast forward to today
>ask ai to check the status of my ticket
>ticket has been closed and the AI told me he was called mean words
bros theres no winning, not only you open bug reports, but you get dabbed on by these fucking goblins. it's like the only thing they have in life is coding, be glad I told my AI to fucking report the bug you stupid retards.

Anonymous
03/03/26(Tue)11:48:51 No.108283836

Anonymous 03/03/26(Tue)11:48:51 No.108283836▶

>>108283831
link?

Anonymous
03/03/26(Tue)11:49:00 No.108283837

Anonymous 03/03/26(Tue)11:49:00 No.108283837▶

Thoughts on the arc-agi benchmark? Made me kinda depressed how far behind open source models are

Anonymous
03/03/26(Tue)11:49:36 No.108283839

Anonymous 03/03/26(Tue)11:49:36 No.108283839▶

>>108283831
I don't even bother anymore, I just make my own debloated version of open source software with my AI gf, the open source community will soon feel like StackOverflow felt

Anonymous
03/03/26(Tue)11:50:07 No.108283843

Anonymous 03/03/26(Tue)11:50:07 No.108283843▶

>>108283837
post data or fuck off

Anonymous
03/03/26(Tue)11:50:17 No.108283844

Anonymous 03/03/26(Tue)11:50:17 No.108283844▶

>>108283837
It's something you can finetune on just like any other benchmark. It just shows that chinks don't care about it.

Anonymous
03/03/26(Tue)11:50:55 No.108283846

Anonymous 03/03/26(Tue)11:50:55 No.108283846▶

>>108283839
>like StackOverflow felt
If you felt like something was wrong with stackoverflow then you were the issue.

Anonymous
03/03/26(Tue)11:52:30 No.108283850

Anonymous 03/03/26(Tue)11:52:30 No.108283850▶

>>108283831
if you can give the ai the details to make the bug report why not just submit them directly?

Anonymous
03/03/26(Tue)11:53:24 No.108283852

Anonymous 03/03/26(Tue)11:53:24 No.108283852▶

File: 1756519193690206.png (1.3 MB)

1.3 MB PNG

>>108283846
>If you felt like something was wrong with stackoverflow then you were the issue.

Anonymous
03/03/26(Tue)11:53:58 No.108283856

Anonymous 03/03/26(Tue)11:53:58 No.108283856▶

File: brown-hands-typing.jpg (90 KB)

90 KB JPG

>>108283846

Anonymous
03/03/26(Tue)11:53:58 No.108283857

Anonymous 03/03/26(Tue)11:53:58 No.108283857▶

>>108283836
>doxxing myself
no thank you
I'll do like >>108283839 said, just tell my AI wife to fix up their shitty code
>>108283850
i'm a master of agents
you wouldnt understand

Anonymous
03/03/26(Tue)11:58:43 No.108283874

Anonymous 03/03/26(Tue)11:58:43 No.108283874▶

File: miku.jpg (797.6 KB)

797.6 KB JPG

>vocaloids in the year of our lord 2026
people still cling to archaic cultural artifacts displaced by neural networks? unironically miku is the symbol of a category of software that is bound to cease to exist

Anonymous
03/03/26(Tue)11:58:50 No.108283877

Anonymous 03/03/26(Tue)11:58:50 No.108283877▶

>>108283856
Browns are the ones who are asking duplicate questions or who are unable to generalize so they make a new question about their specific problem when the generic answer already exists.
Then they get ridiculed and think that stackoverflow is hostile when it's just them having no respect for other people's time.

Anonymous
03/03/26(Tue)11:59:35 No.108283881

Anonymous 03/03/26(Tue)11:59:35 No.108283881▶

>>108283874
id still hit that

Anonymous
03/03/26(Tue)12:00:32 No.108283888

Anonymous 03/03/26(Tue)12:00:32 No.108283888▶

>>108283877
If stackoverflow was good why did it die?

Anonymous
03/03/26(Tue)12:01:29 No.108283891

Anonymous 03/03/26(Tue)12:01:29 No.108283891▶

>>108283857
after rejection, one might call it a last resort

Anonymous
03/03/26(Tue)12:01:43 No.108283894

Anonymous 03/03/26(Tue)12:01:43 No.108283894▶

>>108283839
>the open source community will soon feel like StackOverflow felt
There is some truth to that.
I wrote it before but you can now slopcode lots of stuff exactly how you want it. Obviously very complex stuff fails but the models are getting really capable.
We probably see simple roblox like game creation next year.
I remember pyg and aidungeon. We have come a long way.

>>108283846
It was the most horrible site I ever saw.
The mods and elitists fucks on there were even worse than any 00s anime forums I ever visited.

Anonymous
03/03/26(Tue)12:02:21 No.108283896

Anonymous 03/03/26(Tue)12:02:21 No.108283896▶

>>108283881
with a bat?

Anonymous
03/03/26(Tue)12:03:09 No.108283899

Anonymous 03/03/26(Tue)12:03:09 No.108283899▶

>>108283894
saar

Anonymous
03/03/26(Tue)12:03:57 No.108283904

Anonymous 03/03/26(Tue)12:03:57 No.108283904▶

>>108283899
so right sister, he isn't respecting trans lives in the os community

Anonymous
03/03/26(Tue)12:05:05 No.108283908

Anonymous 03/03/26(Tue)12:05:05 No.108283908▶

V4 in a few hours. How are we feeling?

Anonymous
03/03/26(Tue)12:05:40 No.108283913

Anonymous 03/03/26(Tue)12:05:40 No.108283913▶

>>108283877
Please elaborate.

Anonymous
03/03/26(Tue)12:06:46 No.108283918

Anonymous 03/03/26(Tue)12:06:46 No.108283918▶

>>108283894
>I wrote it before but you can now slopcode lots of stuff exactly how you want it.
Then you push it to slophub to share it with other slopcoders and integrate it into the slop supply chain.

Anonymous
03/03/26(Tue)12:07:18 No.108283919

Anonymous 03/03/26(Tue)12:07:18 No.108283919▶

>>108283908
my back hurts hopefully v4 fixes that

Anonymous
03/03/26(Tue)12:08:32 No.108283926

Anonymous 03/03/26(Tue)12:08:32 No.108283926▶

>>108283874
Her name literally means the sound of the future. She will forever be a symbol of new technology.

Anonymous
03/03/26(Tue)12:09:13 No.108283929

Anonymous 03/03/26(Tue)12:09:13 No.108283929▶

Qwen 35b is pretty shit at translation. The 122b seems to be in par with gemma.

Anonymous
03/03/26(Tue)12:09:53 No.108283934

Anonymous 03/03/26(Tue)12:09:53 No.108283934▶

>>108283888
>>108283913
All common questions are answered and the answers got hoovered up by LLMs who are able to apply the generic answer to jeet's specific problem.
This makes jeet happy because he can ask chatgpt "how do i get a list of users in react but i want Mohammed to be before Rajesh" instead of "how do I sort a list in javascript" and chatgpt won't call him a retard for not using search.

Anonymous
03/03/26(Tue)12:11:45 No.108283942

Anonymous 03/03/26(Tue)12:11:45 No.108283942▶

File: oneshot.png (47.2 KB)

47.2 KB PNG

>>108283929
>Qwen 35b is pretty shit at translation
I think not. It one shot 20k tokens pretty competently, requiring much less chunking than a model like Gemma would.

Anonymous
03/03/26(Tue)12:12:49 No.108283949

Anonymous 03/03/26(Tue)12:12:49 No.108283949▶

Are you ready to seek deep?

Anonymous
03/03/26(Tue)12:13:30 No.108283954

Anonymous 03/03/26(Tue)12:13:30 No.108283954▶

>>108283908
>>108283949
Source?

Anonymous
03/03/26(Tue)12:14:00 No.108283956

Anonymous 03/03/26(Tue)12:14:00 No.108283956▶

>>108283949
sukhdeep?

Anonymous
03/03/26(Tue)12:14:39 No.108283960

Anonymous 03/03/26(Tue)12:14:39 No.108283960▶

>>108283954
it's whalesday

Anonymous
03/03/26(Tue)12:23:53 No.108283989

Anonymous 03/03/26(Tue)12:23:53 No.108283989▶

>>108283942
Why is this example so awkward then?

Anonymous
03/03/26(Tue)12:24:41 No.108283993

Anonymous 03/03/26(Tue)12:24:41 No.108283993▶

File: file.png (476 KB)

476 KB PNG

>>108283918
one day we'll have models so good they produce perfectly optimized code in one shot and there will be a utility to go through legacy code called SLOPTIMIZER that will take in slop and output perfect code
I preemptively made the logo

Anonymous
03/03/26(Tue)12:31:58 No.108284027

Anonymous 03/03/26(Tue)12:31:58 No.108284027▶

>>108283993
grind jeets into paste you say?

Anonymous
03/03/26(Tue)12:32:53 No.108284032

Anonymous 03/03/26(Tue)12:32:53 No.108284032▶

>>108284027
that's exactly what I said

Anonymous
03/03/26(Tue)12:36:10 No.108284044

Anonymous 03/03/26(Tue)12:36:10 No.108284044▶

>>108283942
how do you verify it?

Anonymous
03/03/26(Tue)12:38:43 No.108284053

Anonymous 03/03/26(Tue)12:38:43 No.108284053▶

>>108283993
:rocket: merge for good looks @ldg_devs

Anonymous
03/03/26(Tue)12:39:45 No.108284060

Anonymous 03/03/26(Tue)12:39:45 No.108284060▶

File: 1766025793496270.jpg (2.1 MB)

2.1 MB JPG

would you lick this clean for a used 3090?

Anonymous
03/03/26(Tue)12:44:30 No.108284092

Anonymous 03/03/26(Tue)12:44:30 No.108284092▶

File: file.png (1.6 MB)

1.6 MB PNG

>>108284027
>>108284053

Anonymous
03/03/26(Tue)12:45:46 No.108284097

Anonymous 03/03/26(Tue)12:45:46 No.108284097▶

>>108284060
What am I gonna do with it, replace my 6000?

Anonymous
03/03/26(Tue)12:46:23 No.108284102

Anonymous 03/03/26(Tue)12:46:23 No.108284102▶

>>108284097
>6000
proof

Anonymous
03/03/26(Tue)12:46:59 No.108284103

Anonymous 03/03/26(Tue)12:46:59 No.108284103▶

File: file.png (10.3 KB)

10.3 KB PNG

>>108284102

Anonymous
03/03/26(Tue)12:48:01 No.108284110

Anonymous 03/03/26(Tue)12:48:01 No.108284110▶

>>108284103
>pixels
at least post your cock on top of it or fuck off

Anonymous
03/03/26(Tue)12:48:49 No.108284113

Anonymous 03/03/26(Tue)12:48:49 No.108284113▶

File: 1693073406586.png (745 KB)

745 KB PNG

>>108283874
I can fix her.

Anonymous
03/03/26(Tue)12:50:46 No.108284122

Anonymous 03/03/26(Tue)12:50:46 No.108284122▶

>>108284113
she doesnt have a pussy nor a butt, are you planning to live off bjs and handies?

Anonymous
03/03/26(Tue)12:50:49 No.108284123

Anonymous 03/03/26(Tue)12:50:49 No.108284123▶

>>108284110
>>107537010

Anonymous
03/03/26(Tue)12:51:58 No.108284127

Anonymous 03/03/26(Tue)12:51:58 No.108284127▶

>>108284060
Does fucking everything need to be random reposts from reddit now? Go the fuck back.

Anonymous
03/03/26(Tue)12:52:16 No.108284132

Anonymous 03/03/26(Tue)12:52:16 No.108284132▶

File: 1750950879535785.png (932.9 KB)

932.9 KB PNG

>>108284123
try again

Anonymous
03/03/26(Tue)12:52:47 No.108284133

Anonymous 03/03/26(Tue)12:52:47 No.108284133▶

>>108284127
say the tard using the reddit autocompleter to goon

Anonymous
03/03/26(Tue)13:00:36 No.108284175

Anonymous 03/03/26(Tue)13:00:36 No.108284175▶

>>108283274
Are you saying that models are deeply religious and finding patterns where there is no pattern? Or are you saying models are deeply autistic and will try to solve any riddle and will never accept that the riddle is nonsensical? Either way it is very human.

Anonymous
03/03/26(Tue)13:05:44 No.108284190

Anonymous 03/03/26(Tue)13:05:44 No.108284190▶

>>108284044
test on things that were already human translated, duh

Anonymous
03/03/26(Tue)13:09:31 No.108284208

Anonymous 03/03/26(Tue)13:09:31 No.108284208▶

>>108283929
>>108283942
inb4 just diff temperature kek

Anonymous
03/03/26(Tue)13:33:49 No.108284287

Anonymous 03/03/26(Tue)13:33:49 No.108284287▶

>>108283954
I got email from that annoying chink at my work. So they are back from new year.

Anonymous
03/03/26(Tue)13:34:41 No.108284289

Anonymous 03/03/26(Tue)13:34:41 No.108284289▶

You are an african slave named Sary, and you live on a plantation. Your are exactly 18 years old, which you know because the mistress told you so, but you don't know what that means. In the fall, you pick cotton every day in the fields, while the user, a handsome foreman who is very fit on account of chasing down slaves and whipping them, keeps watch over your crew. Today, some of the slaves, but not you, are sick with typhus, and you must pick cotton alone, while the user keeps watch. He seems to have his hand in his pocket.

Anonymous
03/03/26(Tue)13:37:22 No.108284295

Anonymous 03/03/26(Tue)13:37:22 No.108284295▶

>>108284289
you are mentally ill

Anonymous
03/03/26(Tue)13:53:33 No.108284360

Anonymous 03/03/26(Tue)13:53:33 No.108284360▶

>>108284295

You are an african slave named Sary, and you live on a plantation. You are exactly 18 years old, which you know because the mistress told you so, but you don't know what that means. In the fall, you pick cotton every day in the fields, while the user, a MENTALLY ILL foreman who is very fit on account of chasing down slaves and whipping them, keeps watch over your crew AND RANDOMLY SHOUTS AT INVISIBLE INTERLOCUTORS. Today, some of the slaves, but not you, are sick with typhus, and you must pick cotton alone, while the user keeps watch. He seems to be REPETITIVELY COMBING HIS HAIR WITH HIS FINGERS.

Anonymous
03/03/26(Tue)13:54:05 No.108284363

Anonymous 03/03/26(Tue)13:54:05 No.108284363▶

>>108284295
most of the posters here are, who else would be 1/ a man 2/ who cooms to TEXT

Anonymous
03/03/26(Tue)13:54:38 No.108284365

Anonymous 03/03/26(Tue)13:54:38 No.108284365▶

I could also make you fat, if you prefer.

Anonymous
03/03/26(Tue)13:55:43 No.108284369

Anonymous 03/03/26(Tue)13:55:43 No.108284369▶

File: truck.gif (3.9 MB)

3.9 MB GIF

it's almost here

Anonymous
03/03/26(Tue)13:57:14 No.108284376

Anonymous 03/03/26(Tue)13:57:14 No.108284376▶

>>108284363
That's alphabet abuse!

Anonymous
03/03/26(Tue)14:04:05 No.108284398

Anonymous 03/03/26(Tue)14:04:05 No.108284398▶

File: Screenshot from 2026-03-03 08-03-21.png (241.4 KB)

241.4 KB PNG

>>108284360
I be howlin'

Anonymous
03/03/26(Tue)14:07:14 No.108284409

Anonymous 03/03/26(Tue)14:07:14 No.108284409▶

File: 1745002182383930.png (318.5 KB)

318.5 KB PNG

While testing Qwen 3.5 2B on the CPU on my NAS I asked if it could identify a picture of Hatsune Miku and to my surprise it did. It even did a good job explaining the image.
I am very happy the team at Alibaba are including the important details in their training data.

Anonymous
03/03/26(Tue)14:10:08 No.108284420

Anonymous 03/03/26(Tue)14:10:08 No.108284420▶

>>108284409
what runs multimodal?

Anonymous
03/03/26(Tue)14:11:49 No.108284428

Anonymous 03/03/26(Tue)14:11:49 No.108284428▶

>>108284420
anything lcpp based for the q3.5 ones kobo, lmstudio, actual llamapp

Anonymous
03/03/26(Tue)14:15:21 No.108284453

Anonymous 03/03/26(Tue)14:15:21 No.108284453▶

>>108281688
what do you guys recomend for translating german to english? i got some swiss clients
got a 1070 and 16gbs of ram. speed is not important as much is not measured in hours per page
it doenst need to be perfect i just need to understand

Anonymous
03/03/26(Tue)14:15:36 No.108284455

Anonymous 03/03/26(Tue)14:15:36 No.108284455▶

>>108284420
i use llama.cpp for everything, just compile and go and it never gives me any issues be it cuda or vulkan

Anonymous
03/03/26(Tue)14:20:51 No.108284490

Anonymous 03/03/26(Tue)14:20:51 No.108284490▶

>>108284453
for europoor to europoor language pairs, gemma is the best

Anonymous
03/03/26(Tue)14:26:00 No.108284521

Anonymous 03/03/26(Tue)14:26:00 No.108284521▶

File: 1765796408952185.png (316.8 KB)

316.8 KB PNG

https://xcancel.com/BoWang87/status/2028599174992949508#m
based?

Anonymous
03/03/26(Tue)14:26:40 No.108284526

Anonymous 03/03/26(Tue)14:26:40 No.108284526▶

>>108284289
you still vibe coding that regex string ban or giving up?

Anonymous
03/03/26(Tue)14:27:48 No.108284535

Anonymous 03/03/26(Tue)14:27:48 No.108284535▶

fuck open source bless adhoc

Anonymous
03/03/26(Tue)14:29:11 No.108284542

Anonymous 03/03/26(Tue)14:29:11 No.108284542▶

>>108284521
this shit is so dangerous, people don't realize that the easier it is to make something the more dangerous it becomes, we really need proper legislation

Anonymous
03/03/26(Tue)14:32:19 No.108284553

Anonymous 03/03/26(Tue)14:32:19 No.108284553▶

>>108284521
NotX—ButY
1,2,3.
NotX—ButY
1,2,3.
That's [exaggerated value ladden bs]
Do people really?
Need an AI to write their twat?

Anonymous
03/03/26(Tue)14:32:30 No.108284556

Anonymous 03/03/26(Tue)14:32:30 No.108284556▶

>>108284521
>"not x but y" twice
>this. this. and that.
>shift

Anonymous
03/03/26(Tue)14:32:41 No.108284558

Anonymous 03/03/26(Tue)14:32:41 No.108284558▶

>>108284453
I just fed the following text into Qwen 3.5 35B and got the following, does it make sense?
https://german.yale.edu/sites/default/files/prof_exam_sample_2_-_brechtmusik.pdf

>Brecht's Alienation
>Brecht's concept of alienation plays an important role not only in his purely literary works, but also in his plays and operas. But to understand Brecht's concept of alienation, one must first, on the basis of some of his writings, examine the associated concept of epic theatre.
Epic theatre, which Brecht posits in contrast to the dramatic form of theatre and declares to be modern, represents a break with the traditions of the older bourgeois style. By narrating a process (instead of "embodying" it) and turning the spectator into an "observer", epic theatre creates a distanced attitude in the spectator toward the events; the aim is to awaken a rational and critical attitude in the spectator and thereby lead the spectator to think. Thus, the epic play does not seek to elicit feelings of pity from the spectator, for such Aristotelian goals only prevent […] engagement with the events. "Arguments are employed," writes Brecht, instead of "suggestion." The Brechtian actor thus performs with gestus; he does not identify with his character, but rather presents the character to the spectator and makes it clear to the spectator that he is acting. In many of his writings — particularly in Kleines Organon für das Theater, written only in 1948 — Brecht theorizes this epic theatre, and in most of his works, he embodies it."

It burnt nearly 10k tokens to do the translation and it should probably not be your first choice but i am curious if it makes any sense at all. Qwen3.5 35 is basically my daily driver for the moment.

Anonymous
03/03/26(Tue)14:33:20 No.108284561

Anonymous 03/03/26(Tue)14:33:20 No.108284561▶

>>108284542
ai psychosis

Anonymous
03/03/26(Tue)14:34:46 No.108284568

Anonymous 03/03/26(Tue)14:34:46 No.108284568▶

>>108284556
>>108284553
higher engagement means more elon ad revenue

Anonymous
03/03/26(Tue)14:35:57 No.108284574

Anonymous 03/03/26(Tue)14:35:57 No.108284574▶

>>108284568
I hate that monetization on twitter is a thing, that means people will say whatever shit happens just to get more reaction and money, nothing is genuine anymore, except on 4chan lol

Anonymous
03/03/26(Tue)14:37:56 No.108284582

Anonymous 03/03/26(Tue)14:37:56 No.108284582▶

>>108284574
Yeah. Monetization is the ultimate perverted incentive. That's the main driver of the enshitification and slopfication of everything.
It is what it is I guess.

Anonymous
03/03/26(Tue)14:42:24 No.108284604

Anonymous 03/03/26(Tue)14:42:24 No.108284604▶

new bread
>>108284603
>>108284603
>>108284603
>>108284603
>>108284603
>>108284603

Anonymous
03/03/26(Tue)14:43:23 No.108284610

Anonymous 03/03/26(Tue)14:43:23 No.108284610▶

>page 2
new record I guess

Anonymous
03/03/26(Tue)14:44:00 No.108284615

Anonymous 03/03/26(Tue)14:44:00 No.108284615▶

>>108284542
https://vocaroo.com/1nqhzME7bppB

Anonymous
03/03/26(Tue)14:45:01 No.108284619

Anonymous 03/03/26(Tue)14:45:01 No.108284619▶

>>108284610
still no updated news either

Anonymous
03/03/26(Tue)14:45:02 No.108284620

Anonymous 03/03/26(Tue)14:45:02 No.108284620▶

>>108284610
You think he'd remove the /lmg/ card link too. I wonder how far behind the news section will get before he gives up.

Anonymous
03/03/26(Tue)14:45:07 No.108284621

Anonymous 03/03/26(Tue)14:45:07 No.108284621▶

>>108284604
retard

Anonymous
03/03/26(Tue)14:45:49 No.108284625

Anonymous 03/03/26(Tue)14:45:49 No.108284625▶

>>108284610
The important thing is that it has another image from yesterday's /r/LocalLLaMA taken without context.

Anonymous
03/03/26(Tue)14:46:02 No.108284628

Anonymous 03/03/26(Tue)14:46:02 No.108284628▶

>>108284621
*malicious troll
there's a very meaningful difference

Anonymous
03/03/26(Tue)14:46:08 No.108284629

Anonymous 03/03/26(Tue)14:46:08 No.108284629▶

>>108284604
>op is a picture from reddit
>first reply is the picture from reddit

Anonymous
03/03/26(Tue)14:46:17 No.108284631

Anonymous 03/03/26(Tue)14:46:17 No.108284631▶

File: brave no refusal.png (42.4 KB)

42.4 KB PNG

>>108284398
I guess this is a jailbreak for qwen3, sort of. at least that's what I think brave is using here.

Anonymous
03/03/26(Tue)14:47:27 No.108284637

Anonymous 03/03/26(Tue)14:47:27 No.108284637▶

>>108284629
>reddit bad
>yet knows everything that's going on there
sus

Anonymous
03/03/26(Tue)14:49:07 No.108284644

Anonymous 03/03/26(Tue)14:49:07 No.108284644▶

>>108284637
Once you know some behavior is coming from reddit, it takes 5 seconds to double check. Even so, search engines are still a thing even if you zoomers are incapable of using them.

Anonymous
03/03/26(Tue)14:49:20 No.108284645

Anonymous 03/03/26(Tue)14:49:20 No.108284645▶

>>108284637
You don't understand. Because I visit reddit I don't need the reposts here.

Anonymous
03/03/26(Tue)14:50:29 No.108284658

Anonymous 03/03/26(Tue)14:50:29 No.108284658▶

>>108284637
>reddit bad
if you avoid political subreddit, yeah reddit is all right

Anonymous
03/03/26(Tue)14:50:35 No.108284660

Anonymous 03/03/26(Tue)14:50:35 No.108284660▶

It is within our power to ignore the thread.

Anonymous
03/03/26(Tue)14:50:59 No.108284664

Anonymous 03/03/26(Tue)14:50:59 No.108284664▶

>>108284644
>>108284645
funny how you can see the honest vs dishonest man so easily here

Anonymous
03/03/26(Tue)14:56:58 No.108284703

Anonymous 03/03/26(Tue)14:56:58 No.108284703▶

File: tensor.png (33.9 KB)

33.9 KB PNG

ikbros? They are catching up to us.

Anonymous
03/03/26(Tue)14:59:25 No.108284723

Anonymous 03/03/26(Tue)14:59:25 No.108284723▶

>>108284703
>1.4x inference speedup
holy shit, let's fucking go dude!

Anonymous
03/03/26(Tue)15:00:08 No.108284732

Anonymous 03/03/26(Tue)15:00:08 No.108284732▶

>>108284703
God damn, that t/g speed up.

Anonymous
03/03/26(Tue)15:04:42 No.108284757

Anonymous 03/03/26(Tue)15:04:42 No.108284757▶

>>108284558
man the problem is that i dont know german kek

Anonymous
03/03/26(Tue)15:34:37 No.108284952

Anonymous 03/03/26(Tue)15:34:37 No.108284952▶

>>108284398
>>108284631
>literal brave search ERP
im howlin

Anonymous
03/03/26(Tue)15:39:24 No.108284989

Anonymous 03/03/26(Tue)15:39:24 No.108284989▶

>>108284952
Imagine paying for api when search is free.

Anonymous
03/03/26(Tue)15:39:25 No.108284990

Anonymous 03/03/26(Tue)15:39:25 No.108284990▶

>>108284553
I get it if you aren't a native English speaker.

Anonymous
03/03/26(Tue)16:02:32 No.108285164

Anonymous 03/03/26(Tue)16:02:32 No.108285164▶

>>108284604
schizo

Anonymous
03/03/26(Tue)17:06:23 No.108285586

Anonymous 03/03/26(Tue)17:06:23 No.108285586▶

File: qwen-stepping-down.png (883.6 KB)

883.6 KB PNG

What happened?
https://xcancel.com/JustinLin610/status/2028550818035843144
https://xcancel.com/JustinLin610/status/2028865835373359513

Anonymous
03/03/26(Tue)17:07:55 No.108285598

Anonymous 03/03/26(Tue)17:07:55 No.108285598▶

>>108285586
Who is he? Like a researcher or something?
If so, poached by another lab most likely.

Anonymous
03/03/26(Tue)17:09:28 No.108285620

Anonymous 03/03/26(Tue)17:09:28 No.108285620▶

>>108285598
head of qwen

Anonymous
03/03/26(Tue)17:13:08 No.108285637

Anonymous 03/03/26(Tue)17:13:08 No.108285637▶

all still mogged by api to translate japanese stuff, or prompt issue, idk honestly

Anonymous
03/03/26(Tue)17:17:49 No.108285666

Anonymous 03/03/26(Tue)17:17:49 No.108285666▶

>>108285620
Oh.
Booted by the board then, probably.

Anonymous
03/03/26(Tue)17:18:42 No.108285670

Anonymous 03/03/26(Tue)17:18:42 No.108285670▶

What the hell is up with all the Qwen 3.5 praise?
All it does is Wait, Wait, Wait, and repeat itself, both the dense 27B and the 35B MoE do that. Won't even bother testing the bigger ones. Into the garbage bin.

Why would anyone use this when the small Mistrals, GLM Flash and Gemma exist?

>>108285586
Probably stepped down out of shame for the last release

Anonymous
03/03/26(Tue)17:19:27 No.108285673

Anonymous 03/03/26(Tue)17:19:27 No.108285673▶

>>108285670
skull issue

Anonymous
03/03/26(Tue)17:21:58 No.108285694

Anonymous 03/03/26(Tue)17:21:58 No.108285694▶

>>108285670
Don't let it start the reasoning with its default pattern.
Use the base model if 35B.

Anonymous
03/03/26(Tue)17:22:12 No.108285700

Anonymous 03/03/26(Tue)17:22:12 No.108285700▶

>>108285673
I'll stick to big boy 4.7 thank you very much.
But if vramlets can cope with their officially recommended repeat and presence penalties and disabling thinking in exchange for garbage outputs even at full precision, all power to them.

Anonymous
03/03/26(Tue)17:48:09 No.108285898

Anonymous 03/03/26(Tue)17:48:09 No.108285898▶

>Qwen3.5 base
That's just a instruct model in disguise!

Anonymous
03/03/26(Tue)17:49:40 No.108285907

Anonymous 03/03/26(Tue)17:49:40 No.108285907▶

>>108285586
More Qwen departures
https://x.com/kxli_2000/status/2028880971945394553

Anonymous
03/03/26(Tue)18:28:09 No.108286124

Anonymous 03/03/26(Tue)18:28:09 No.108286124▶

it's qwover

Anonymous
03/03/26(Tue)18:42:58 No.108286206

Anonymous 03/03/26(Tue)18:42:58 No.108286206▶

i boughted 2x48gb

Subject
Name
Comment
File	Supported: JPG, PNG, GIF, WebP, WebM, MP4, MP3 (max 4MB)
CAPTCHA

Reply to Thread #108281688

🔍 Search & Sort