/v/ - Thread 737137393

>>737138529
I haven't seen a reason to swap off of Deepseek V3.2 with Novita as a provider
It'll literally pretend to do anything you want
I have several incest bots, loli bots and loli incest bots

Anonymous
04/17/26(Fri)21:01:49 No.737138772

Anonymous 04/17/26(Fri)21:01:49 No.737138772▶

>>737138654
>on some days
My current model works all the time
I'm out

Anonymous
04/17/26(Fri)21:02:01 No.737138786

Anonymous 04/17/26(Fri)21:02:01 No.737138786▶

>>737138529
31b is insanely good for a local model but it doesn't stack up to 300b+ cloud models, of couse

26b4a needs abliteration for lolishit because moes are better at refusing but it's fast and light enough i'm thinking about running it 24/7 on a second card as an assistant

Anonymous
04/17/26(Fri)21:02:58 No.737138838

Anonymous 04/17/26(Fri)21:02:58 No.737138838▶

>>737138772
It's free and very good so can't complain

Anonymous
04/17/26(Fri)21:04:09 No.737138908

Anonymous 04/17/26(Fri)21:04:09 No.737138908▶

>>737138772
and what is it?

Anonymous
04/17/26(Fri)21:04:36 No.737138935

Anonymous 04/17/26(Fri)21:04:36 No.737138935▶

>>737138908
I'm
>>737138695

Anonymous
04/17/26(Fri)21:05:23 No.737138976

Anonymous 04/17/26(Fri)21:05:23 No.737138976▶

>>737137393
can I run locally with 4070ti?

Anonymous
04/17/26(Fri)21:05:45 No.737138994

Anonymous 04/17/26(Fri)21:05:45 No.737138994▶

>>737137393
GLM 5.1 for being able to follow instructions very well. Kimi for the best prose and creativity, with occasionally very bad following of instructions.

Anonymous
04/17/26(Fri)21:11:28 No.737139325

Anonymous 04/17/26(Fri)21:11:28 No.737139325▶

>>737138529
Really good for size and I can run it locally and not very censored

Anonymous
04/17/26(Fri)21:11:42 No.737139337

Anonymous 04/17/26(Fri)21:11:42 No.737139337▶

>>737138994
How do you get Kimi to work? Half the messages it spits out are whining about lack of consent.

Anonymous
04/17/26(Fri)21:12:30 No.737139378

Anonymous 04/17/26(Fri)21:12:30 No.737139378▶

>>737139337
Probably have to jailbreak it first?

Anonymous
04/17/26(Fri)21:16:54 No.737139646

Anonymous 04/17/26(Fri)21:16:54 No.737139646▶

>>737139337
https://www.reddit.com/r/SillyTavernAI/comments/1roxt1c/freaky_frankimstein_swansong_final_kimi_k25_think/
try this preset

Anonymous
04/17/26(Fri)21:19:39 No.737139805

Anonymous 04/17/26(Fri)21:19:39 No.737139805▶

>>737138976
You can use smaller models for sure with 12gb vram. 24b would be pushing it but may work with cpu split. Some of the better dense models like gemma4 31b would probably be very slow unless you get at least 16gb vram or quant it into lobotomy.

If you've got the ram for it you could run mixture of expert models likes qwen3.5 35ba3b or gemma4 26ba4b. Those aren't as smart as dense models but are much faster when offloaded onto ram.

Subject
Name
Comment
File	Supported: JPG, PNG, GIF, WebP, WebM, MP4, MP3 (max 4MB)
CAPTCHA

Reply to Thread #737137393

🔍 Search & Sort