Thread #108553789
File: highlights_g_108550008_1775613484_1.jpg (1.6 MB)
1.6 MB JPG
Pride, Ego, Autism, Edition
Discussion and Development of Local Image and Video Models
Previous: >>108550008
https://rentry.org/ldg-lazy-getting-started-guide
>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows
>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe
>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
>Qwen
https://huggingface.co/collections/Qwen/qwen-image
>Klein
https://huggingface.co/collections/black-forest-labs/flux2
>LTX-2
https://huggingface.co/Lightricks/LTX-2
>Wan
https://github.com/Wan-Video/Wan2.2
>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46
>Illustrious
https://rentry.org/comfyui_guide_1girl
>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage
>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg
>Local Text
>>>/g/lmg
>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
310 RepliesView Thread
>>
>>
>>
>mfw Resource news
04/07/2026
>Anima preview3 released
https://huggingface.co/circlestone-labs/Anima#preview3
>FrameFusion Image Interpolation: Compact image interpolation model for generating in-between frames
https://github.com/BurguerJohn/FrameFusion-Model
>An Inside Look at OpenAI and Anthropic’s Finances Ahead of Their IPOs
https://www.wsj.com/tech/ai/openai-anthropic-ipo-finances-04b3cfb9
>PrismML debuts energy-sipping 1-bit LLM in bid to free AI from the cloud
https://www.theregister.com/2026/04/04/prismml_1bit_llm
>ComfyUI Hires Fix Ultra - All in One
https://github.com/ThetaCursed/ComfyUI-HiresFix-Ultra-AllInOne
>ATSS: Detecting AI-Generated Videos via Anomalous Temporal Self-Similarity
https://github.com/hwang-cs-ime/ATSS
>1.x-Distill: Breaking the Diversity, Quality, and Efficiency Barrier in Distribution Matching Distillation
https://thu-accdiff.github.io/1.x-distill-page
>Your Pre-trained Diffusion Model Secretly Knows Restoration
https://sudraj2002.github.io/yptpage
>Imagine Before Concentration: Diffusion-Guided Registers Enhance Partially Relevant Video Retrieval
https://github.com/lijun2005/CVPR26-DreamPRVR
>A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens
https://deltatok.github.io
>SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing
https://github.com/EasonXiao-888/SpatialEdit
>OpenWorldLib: A Unified Codebase and Definition of Advanced World Models
https://github.com/OpenDCAI/OpenWorldLib
>KupkaProd Cinema Pipeline: Powered by LTX 2.3
https://github.com/Matticusnicholas/KupkaProd-Cinema-Pipeline
>Wan VACE Prep: ComfyUI nodes for video editng workflows
https://github.com/stuttlepress/ComfyUI-Wan-VACE-Prep
04/06/2026
>UNICA: A Unified Neural Framework for Controllable 3D Avatars
https://github.com/zjh21/UNICA
>WSVD: Weighted Low-Rank Approximation for Fast and Efficient Execution of Low-Precision Vision-Language Models
https://github.com/SAI-Lab-NYU/WSVD
>>
>mfw Research news
04/07/2026
>Erasure or Erosion? Evaluating Compositional Degradation in Unlearned Text-To-Image Diffusion Models
https://arxiv.org/abs/2604.04575
>Beyond Few-Step Inference: Accelerating Video Diffusion Transformer Model Serving with Inter-Request Caching Reuse
https://arxiv.org/abs/2604.04451
>Training-Free Image Editing with Visual Context Integration and Concept Alignment
https://arxiv.org/abs/2604.04487
>Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning
https://arxiv.org/abs/2604.04746
>ExpressEdit: Fast Editing of Stylized Facial Expressions with Diffusion Models in Photoshop
https://arxiv.org/abs/2604.03448
>Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision
https://hyunsoocha.github.io/vanast
>KiToke: Kernel-based Interval-aware Token Compression for Video Large Language Models
https://arxiv.org/abs/2604.03414
>DiffSparse: Accelerating Diffusion Transformers with Learned Token Sparsity
https://arxiv.org/abs/2604.03674
>Focus Matters: Phase-Aware Suppression for Hallucination in Vision-Language Models
https://arxiv.org/abs/2604.03556
>OP-GRPO: Efficient Off-Policy GRPO for Flow-Matching Models
https://arxiv.org/abs/2604.04142
>AvatarPointillist: AutoRegressive 4D Gaussian Avatarization
https://kumapowerliu.github.io/AvatarPointillist
>The Geometry of Robustness: Optimizing Loss Landscape Curvature and Feature Manifold Alignment for Robust Finetuning of Vision-Language Models
https://arxiv.org/abs/2603.27139
>Beauty in the Eye of AI: Aligning LLMs and Vision Models with Human Aesthetics in Network Visualization
https://arxiv.org/abs/2604.03417
>ITIScore: An Image-to-Text-to-Image Rating Framework for the Image Captioning Ability of MLLMs
https://arxiv.org/abs/2604.03765
>Banana100: Breaking NR-IQA Metrics by 100 Iterative Image Replications with Nano Banana Pro
https://arxiv.org/abs/2604.03400
>>
File: WVI2V_CC_INT_07-04-26-22-29_00001.mp4 (1.8 MB)
1.8 MB MP4
>>
>>
>>
File: 79d019ca62198ae4a19d4d28d8c3db83.jpg (186.5 KB)
186.5 KB JPG
can someone please change the writing on her shirt to debian?
>>
File: 1583345919393.jpg (113.2 KB)
113.2 KB JPG
TDRussell... please... I'm begging you... I want to see oversized insects get absolutely destroyed by futa's with huge cocks. Put it in the dataset, homie.
>>
>>
File: Oh nyo.jpg (61.9 KB)
61.9 KB JPG
>>
File: 20230314_171527.jpg (4.5 KB)
4.5 KB JPG
>>108553868
I only tried, several, times with Preview2 and all I could get is is the futa rubbing up against their abdomen. I'm talking about t2i, i2i works but that's more effort on my part.
>>
>>
File: WVI2V_CC_INT_07-04-26-22-37_00001.mp4 (1.9 MB)
1.9 MB MP4
>>
>>
>>
>>
File: 6F8964EDD68E44059455892C81FFDC4B.jpg (42 KB)
42 KB JPG
>>108553886
Don't judge me, anon, please, my heart can't handle it.
https://files.catbox.moe/wzbh2c.png
>>
>>
File: WVI2V_CC_INT_07-04-26-22-50_00001.mp4 (1.2 MB)
1.2 MB MP4
>>
File: 4752158635.jpg (137.2 KB)
137.2 KB JPG
>>108553925
Alright here you go, anon. If I remember correctly I kept overly specifying things in the prompt because Anima was being stubborn, like turning the insect into an insect boy/girl.
https://files.catbox.moe/uk8mej.jpg
>>
File: WVI2V_CC_INT_07-04-26-22-59_00001.mp4 (858.6 KB)
858.6 KB MP4
>>
>>108553916
https://youtu.be/VliOCZKBM-w?si=si5q3thSrWTAaIkR&t=3
>>
>>
>>108553977
Anon, does catbox not have metadata on images? I specifically uploaded the original and then downloaded it off catbox to make sure it has the original metadata on it. Also I'm retarded and I don't know what the hell a futabox is and googling it gave me nothing.
>Also what do you mean by third time? That's the second link I shared
>>
File: 1766581180856600.jpg (616.5 KB)
616.5 KB JPG
>>
File: deNS_zi_00018_.png (3.6 MB)
3.6 MB PNG
>>
>>
>>
File: ComfyUI_08584_.png (801.1 KB)
801.1 KB PNG
what did anima mean by this
>>
File: this was supposed to be a snickers.png (754.4 KB)
754.4 KB PNG
New version of anima seems alright. Feels just a bit side-gradey. Still want to see that realism lora...
>>
>>
File: 1749391890567071.jpg (35.6 KB)
35.6 KB JPG
>>108554182
what a huge chocolate bar. i've never found one this size in my darn fallen eu...
>>
File: HFTMJnObgAA090R.jfif.jpg (869.1 KB)
869.1 KB JPG
seedance 2.0 beaten by a new model, likely also chinese
>>
>>
>>
File: 1768147362790899.png (1.3 MB)
1.3 MB PNG
Any good Anima 3 upscale workflows?
>>
>>
File: 1767907395674028.png (1.6 MB)
1.6 MB PNG
>>108553858
here u go broski
>>
File: file.png (292.9 KB)
292.9 KB PNG
Techlet here, I know more than your average techlet but i'm vastly out of my depth compared to even a dumb /g/ anon. I've got a 2080 Super and my gens are seizing at 50% moving at a glacial speed with a rampaging 95% gpu workload going on in task manager. Is it possible to input some sort of command to break the workload up into smaller resource dumps, or dick down my CPU a bit to try and peel some off of my GPU? I'm 99% sure it is based on my available VRAM that is causing this but I can be totally blind, retarded, and wrong.
>>
>>
>>
File: HappyHorse.png (201.3 KB)
201.3 KB PNG
>>108554249
That name... xD
>>
>>
File: Anima_01706_.png (1.1 MB)
1.1 MB PNG
>>>/vg/562917381
>>>/co/153133847
I am not a discord fudder or thread schizo and I love this model but I am disappointed and worried about anima. I compared roughly 40 images on preview 2 vs 3:
Potential for prompt adherence is higher (could do some prompts 2 struggled a lot to do), but it's still inconsistent, perhaps even more so than before.
Backgrounds are noticeably worse sometimes, I would guess on average too.
Hands are still seed lottery despite more high-res training.
Text is more or less the same, maybe slightly better but still inconsistent seed lottery.
It's struggling to do very common characters it could easily do in the past previews consistently, that might be the single most worrisome thing about it.
I never trained a large scale finetune before so I am unqualified for armchair engineering here but my amateur lora trainer impression is if I am being optimistic it's the weird halfway epoch before the lora churns into the optimal place. And if I am being pessimistic it's getting fried already.
Maybe prompting or sampler meta changed with new release? (Using cfg circa 5, euler ancestral/er sde simple, 30-40 steps, as tdrussell says, I really doubt this is the cause though. Tried different kinds of prompts too.)
I didn't notice any uptick in <100 instance artist prompt adherence despite what Tdrussell says, though I haven't tested that extensively.
Photo-realistic gens are maybe a bit better, but memes aside still feels practically unusable for this purpose without a lora of some sort.
>>
File: 1751824648004360.jpg (659.7 KB)
659.7 KB JPG
>>
File: 85686358389.jpg (1.2 MB)
1.2 MB JPG
>>
>>
>>
File: we would be so fucking back.png (957.2 KB)
957.2 KB PNG
>>108554249
>It's LTX 2.4 and will be open sourced tommorow
just imagine
>>
>>108554249
https://xcancel.com/AngryTomtweets/status/2041640342764843097#m
>metalic sound
that alone will not make the model better than seedance 2.0 lol
>>
are you demoralized by the current state local image generation? just finished burning through tons of credits testing recraft v4 pro and I'm speechless of the results. The model is still censored to a small degree with some generation rejections but holyshit.
https://files.catbox.moe/yg9m7m.png
https://files.catbox.moe/0zlujr.png
https://files.catbox.moe/avl42o.png
https://files.catbox.moe/fqgb4x.png
https://files.catbox.moe/863rm4.png
https://files.catbox.moe/aux9cj.png
https://files.catbox.moe/1yszq8.png
https://files.catbox.moe/ttbxkn.png
https://files.catbox.moe/bt73u2.png
https://files.catbox.moe/wc402a.png
https://files.catbox.moe/lmwtm4.png
https://files.catbox.moe/4wc6ug.png
https://files.catbox.moe/ptw1ny.png
https://files.catbox.moe/wbd3nl.png
https://files.catbox.moe/873kp6.png
https://files.catbox.moe/nkhia0.png
https://files.catbox.moe/wi5hl4.png
https://files.catbox.moe/pd6g9b.png
https://files.catbox.moe/h279o0.png
>>
File: it's so over.png (176.1 KB)
176.1 KB PNG
>>108554832
>how does it feel to see a set of API models that are way less cucked than the local models we currently have
it feels bad anon, what else can I say?
>>
>>108554769
Less than 0.00001% chance of that imo.
Most probably just new Veo version
Too soon and too good for Wan (won't be open source anyway)
Maybe some other dark horse Chinese model (very unlikely but a lot more likely than Israeli FOSS suddenly getting order of magnitude better and still be released for free)
>>
>>108554806
i hope that's a new minimax hailou or vidu q model. Fuck bytedance and their greedy cucked and nurfed seedance 2 model. I hope its not veo4 because google is bound to improve on the censorship with the latest gemini models.
>>
File: 1752320850043753.png (442.8 KB)
442.8 KB PNG
>>108554861
>I hope its not veo4 because google is bound to improve on the censorship with the latest gemini models.
google has become quite based recently, gemma 4 is so uncensored and smart it's laughable, I really have some hopes they'll also release an image model locally
>>
>>108554832
it's time to confront that fact that localkeks are here for ideology, not quality. they coped by saying saas was 'censored' but now more and more APIs are emerging with minimal restrictions. the goalposts move, the mask slips. it really is all about being poor and entitled.
local has been abandoned. it was obvious over a year ago. 'based china' was the eternal cope of 2024/2025, and now they've sold out too. but local models were shit long before everyone quit releasing them. midjourney was always more aesthetic, dall-e always had better cultural knowledge. flux was always just budget dall-e 3 with zero styles or characters, just coping with text on signs no different than what emad shilled with sd3.
local will never receive a SOTA image/video model again, it will be endless stagnation. by the time local receives a model as good as nano-banana 1, we'll already be exploring realtime coherent deep-dive virtual worlds with API.
>>
File: 1757693582715351.png (1 MB)
1 MB PNG
https://huggingface.co/jdopensource/JoyAI-Image-Edit
When are you going to implement this you fucking ComfyUi fuckers???
>>
>>108554857
we desperately need new players in field for local open source image model. Alibaba has abandon open sourcing image and video generation models and black forest labs are just absolute safe cucks that continue to poison both their saas and open source models with safety bullshit and synthetic slop. 1mp image generation is dead end and seedvr2 has its limitations.
>>
>>
>>
>>
>>
>>
>>
File: 3c323d6f1d5b2c39118335be5aafaadd.jpg (1.8 MB)
1.8 MB JPG
The more sophisticated a model gets, the less abstract I can go..
>>
>>
>>
I know this will get lost in between the deranged ramblings of Julien and the raiders, but I am sincerely disappointed in Preview 3 in a way that I wasn't with Preview 2.
My confidence in the model is shaken. I hope the final version turns out alright but with the current trajectory, we can end up with something worse than Preview 2.
And no I am not interested in Mugen/Chenckin/whatever the fuck cliptranny discordtrannies are shilling, fuck off.
I guess this is just a bad year for local.
>>
>>
>>
File: 47cd7367b57f6aec86e9591ea99793de.png (671.4 KB)
671.4 KB PNG
Damn, preview3 has improved lolis quite a lot.
>>108554981
Seed dance 2.
>>
>>
>>
>>
File: cb3187a547a6b6d73b25323d90fa8c68.jpg (518.4 KB)
518.4 KB JPG
>>108555077
Apparently hags grow beards and mustache
>>
>Can't draw Ryu's gi with correct color consistently
>Hallucinated weird brooch into Lapis's outfit in the canoe gen
>Drew her skirt too short in another (And I made dozens of gens of her in Preview 1 + 2 and neither did anything similar)
>Forgot how to do Batgirl's outfit properly
>Spider-Gwen is completely raped
Seriously it's forgetting characters. I posted evidence besides trust me bro fud but no one cares yet it seems. If I can be arsed, I will sample multiple seeds to demonstrate the problem and make a post in the hf thread.
>>
>>
>>
>>108555252
>>108555264
Make sure you include the metadata :-)
>>
>>108555268
>it's so good it can perfectly reproduce the training data
that's bound to happen, the more AI models improve, the more they can do stuff we're asking from them in the first place, which is to reproduce the training data
>>
>>
>>
File: 8563864373472.jpg (1.5 MB)
1.5 MB JPG
>>
File: 1773706478921640.jpg (702.7 KB)
702.7 KB JPG
>tfw no zitslop wife
>>
File: HDP2qjwXQAAjpWr.jpg (127.7 KB)
127.7 KB JPG
>>108555555
HOLY GET
>>
File: WERE SAVED.png (2.5 MB)
2.5 MB PNG
>>108554249
https://xcancel.com/bdsqlsz/status/2041793884146299288
https://happyhorse-ai.com/
>fully open source
>15b
OMG ITS HAPPENING!! OMGOMGOMGOMGOMGOMGOMGOMG
https://www.youtube.com/watch?v=xb2fjZa_L74
>>
File: 1771427149880488.png (1006.6 KB)
1006.6 KB PNG
>>108555676
BIG IF TRUE
>>
>>108555676
>>108555682
Do you think it's something like Z-video?
>>
File: file.png (352.5 KB)
352.5 KB PNG
>>108555676
I hope it's not a fake website anon, you got my hopes up.
>>
File: 1765500092347293.png (656.6 KB)
656.6 KB PNG
>>108555676
https://xcancel.com/bdsqlsz/status/2041809530942845107#m
confirmed open source, we'll get the model in 2 days
>>
>>108555676
>>108555707
>Inb4 100gb so that nobody can run it anyway so paypiggy for the API regardless.
>>
>>
File: ComfyUI_20201.png (3.3 MB)
3.3 MB PNG
>>108554832
Why not choose different dimensions if everything is just going to be centered?
>>108555676
Fingers crossed that it fits in 24GB of VRAM...
>>
File: I won't doubt the chinks ever again.png (565.9 KB)
565.9 KB PNG
>>108555676
I'm sorry for doubting you Alibaba, you are the real goat
>>108555769
it's a 15b model, barely bigger than Wan 2.1
>>
>>108555676
>>108554249
This is not in the same level of Seedance 2.0 (obviously) but it's way better than fucking Wan 2.7, why would Alibaba give us that when they didn't want to give us their worse video models??
https://files.catbox.moe/57lulu.mp4
>>
>>
>>108554562
Your points are valid. The things you mention are realistic. Unfortunately, this is a bad place since it is a general of little relevance and very much a shitpost central. I would put your comment on the Huggingface page or on CivitAI itself.
The intelligence of Anima reached its limit in preview 1. In preview 2 and 3 there were aesthetic improvements, but the comprehension failures continue to be the same.
It is a shame, but at the same time I understand why tdrusell chose such a small model for this.
The use case of Anima and Illustrious are the same. In terms of scene and object character relationships, it is still crippled as hell, like low level Photoshop when you reach a certain degree of complexity.
It is a lightweight goon model updated in terms of architecture., also do not see it as necessary for it to be a more complex model and the best, since there is no realistic use for an anime model to be that intelligent, because 99 percent of its users are still making single girl cowboy shot or typical NSFW scenes.
>>
File: 1758323415928084.png (350.4 KB)
350.4 KB PNG
>>108555676
>https://happyhorse-ai.com/
>15b
This is just a finetune of daVinci-MagiHuman, right? I'm sure I'm right...
>>
>>
>>
File: 1754101408099756.png (301 KB)
301 KB PNG
>>108555838
maybe it's daVinci-MagiHuman but it can do native 720p this time
https://xcancel.com/bdsqlsz/status/2041811909884965324#m
>>
File: 1771040277091625.png (425.7 KB)
425.7 KB PNG
>>108555846
or maybe not
>>
>>
>>
>>
File: based.png (431.3 KB)
431.3 KB PNG
>>108555676
>Users: "Please Alibaba, release Wan 2.5"
>Alibaba: "How about I give you something even better"
>>
>>
>>108555676
https://files.catbox.moe/cx8cg7.mp4
The audio quality is insane, if this is what we'll get locally then we're back to levels never imagined, holy shit dude
>>
>>
>>
>>
>>
File: 1749645077228879.png (892.7 KB)
892.7 KB PNG
>>108555676
he's not wrong lol
>>
>>
File: Ikr.png (76.7 KB)
76.7 KB PNG
>>108555676
https://xcancel.com/ArtificialAnlys/status/2041591989083500933#m
More examples on this post
>>
>>
>>
>>108556059
>>108555552
>>108555126
>>108555831
Stop posting Anima, old news, if you want to keep shilling or discussing it go to dedicated Anime generals.
Fuck off.
>>
>>
>>108556066
Here is a crazy theory. It's fake, gay and corrupt like anything else in the benchmeme ecosystem?
Like if some company were to approach them and say "We are launching a new model soon. Here is $50k, put us in the top spot of your leaderboard this week to build hype. Ok, thanks bye." why the fuck would they refuse?
>>
>>
>>
Hello, we discuss upcoming tech here. Anima has already been updated, and anons are discussing it in their respective anime threads. Once Anima is updated, there’s no point in discussing it here. If you don’t know how to read the room, let me say it for you.
>>
>>108556151
there's some examples where it's talking on twitter
https://xcancel.com/Ricardojiang888/status/2041794075779854530#m
>>
File: rug pull?.png (395.7 KB)
395.7 KB PNG
>>108555676
https://xcancel.com/EtherCoins/status/2041831371895927068#m
lmaooo
>>
>>108555707
>>108556185
>https://xcancel.com/bdsqlsz/status/2041809530942845107#m
HOLY SHIT HE REMOVED THE TWEET SAYING THAT IT'LL BE OPEN SOURCED, LMAOOOOOOO
>>
File: Screenshot 2026-04-08 121346.jpg (48.4 KB)
48.4 KB JPG
>>108555676
are you fucking kidding me
>>
>>
>>
File: image.png (511.4 KB)
511.4 KB PNG
>>108556198
fuck off
>>
>>
>>
File: I wonder who's fudding Anima that hard...png (139.6 KB)
139.6 KB PNG
>>108556155
You don't get to decide what should be discussed here, go fuck yourself.
>>
>>108556191
https://xcancel.com/bdsqlsz/status/2041805114894381334#m
time to remove that one too anime man
>>
>>
File: sad.png (281.2 KB)
281.2 KB PNG
>>108555676
It's over...
>>
>>108556249
Hey, I see you’re the only anime poster left. Just so you know, there are dedicated anime diffusion threads where you can talk about anime with others who share your interests. This isn’t the right place or context to keep discussing a model that has already been updated.
>>
>>
>>
>>
>>
>>
>>
>>108556268
https://github.com/brooks376/Happy-Horse-1.0
sus
>>
>>
>>
>>108556315
>keep posting it
this, make this retard >>108556330 angrier I want to see some melties
>>
>>108555778
>>108555778 #
holy based jenner
requesting permission to repost sexo jennies to /r/Realistic Parody AI. Your jennies mog everything they're genning over there
>>
File: Anima3_00048_.png (1.3 MB)
1.3 MB PNG
Anima is great
>>
>>
>>
File: ComfyUI_19716.png (2.2 MB)
2.2 MB PNG
>>108556336
Go for it. Once i they're on your computer you can do whatever you want with 'em.
>>
>>
>>108556325
> Repository status: The model weights and inference code are marked "coming soon." This README documents the announced architecture, training, and benchmark results. Star and watch this repo to be notified the moment the weights are published. In the meantime, you can try the model and read the latest updates on happyhorses.io.
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
File: anifart.jpg (1.7 MB)
1.7 MB JPG
>>108556388
>a general who has 2 users samfagging?
Anifart is here, but who's the 2nd user?
>>
>>
>>
>>
>>
>>108556268
>>108555676
now that it's been revealted to be fake? what model is it then? A new Kling? Veo 4?
>>
>>
>>108556545
when was it revealted to be fake?
saas models aren't worth the risk after the seedance 2.0 cease and desist.
make a godly open model and sell server compute, the end result is the same and they avoid getting shutdown by hollywood.
>>
>>
>>
>>108556580
>when was it revealted to be fake?
here, he based his open source prediction >>108556268 on this completly sus github >>108556325
that guy is also talking about DeepseekV4 (that doesn't exist yet), 99% a scam
https://github.com/brooks376/DeepSeek-V4-AI-Coding-Assistant
>>
>>
>>
>>
>>
File: AAAAAAA.png (58.4 KB)
58.4 KB PNG
>>108555780
>I'm sorry for doubting you Alibaba
>>108556268
Well... FUCK YOU CHINA
>>
>>
>>
>>
>>
>>108556860
>What is Happy Horse 1.0?
>Happy Horse 1.0 is a 15B-parameter open-source AI video generation model that jointly produces video and synchronized audio from text or image prompts.
whats next, complain about 15B params, or open source bad?
>>
>>108556291
YOU ARE MENTALLY ILL. YOU SHOULD REFER YOU ARE SELF TO A MENTAL HEALTH INPATIENT FACILITY. YOU HAVE A BAD KIND OF RETARDATION WITH RETARDS TO YOUR ARE BRAIN. aLSO THIS IS AN ANIME WEBSITE. n-WORD N-WORD N-WORD
i even think anima is plastic slop too but i want them to post it more. I want them to post chungus 'astronaut riding horse on moon holding a sign that says "text conprehenssion" [misspelt on purpose] with a rabbit in the top left, a dog in the top right and three chibi anime girls are watching also' adg slop images here because it owns you to do that
>>108556345
He has a backup supply of CHUD energy and im afraid not an insignificant amount. He'll be going for days
>>
>>
>>108556887
>believing a fake website
ngmi >>108556604
>>
>>
>>
File: FxAzX-SX0AAZ-DV.jpg (9.3 KB)
9.3 KB JPG
>https://red.anthropic.com/2026/mythos-preview/
>~1000 open source repos tested
>frontier model discovered 595 basic tier bugs and dozens of severe bugs including 0days.
>>
>>
File: ComfyUI_08619_.png (910 KB)
910 KB PNG
>>
>>
File: mogaokekked.png (39 KB)
39 KB PNG
>>108555676
Are there genuine, ironic brownoids in this thread who don't realize this is a fake pop-up site just like what happened with mogao (which turned out to be seedream)? I'll be sure to include all the "APIKEKS BTFO" posting in the screencap".
>>108555780
>>108555874
>>108555901
Local is an ABSOLUTE EMBARASSMENT
>>
File: 260408-174433 Svi 00001(1).mp4 (3.3 MB)
3.3 MB MP4
What's a good wan lora for quick cut to sex?
>>
>>
>>
>>
File: _AnimaPreview3_00003_.jpg (409.9 KB)
409.9 KB JPG
>>
>>
>>
>>
>>
File: dammit.png (152.3 KB)
152.3 KB PNG
>>108556951
Sorry anon, the hopium dose got the best of me
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>108556949
Why is this so hard to believe? The domain isn't even anything that would lend it credulity "happyhorse-ai.com", there are about a dozen domain names they probably would have went for before that if it was real
>>
>>
>>
>>
>>108557082
it doesn't take a genius to realize these companies are going to move towards open models to maintain plausible deniability and just sell server compute.
in the last 4 months we saw the worlds riches man have his AI lobotomized, and a 500 billion dollar company brought to its knees with a cease and desists letter.
>>
>>108557158
>in the last 4 months we saw the worlds riches man have his AI lobotomized, and a 500 billion dollar company brought to its knees with a cease and desists letter.
that's the thing, if they have that much power, they could destroy the life of someone who's willing to release a model that's too powerful to the nature (local)
>>
>>
>>
>>
File: e67550287712557c7f92f4c4d97d5892.png (15.4 KB)
15.4 KB PNG
Is there a plugin or a node that lets you make these notes of text very minimalistic?
I plan on having a lot of them.
>>
>>108557188
Just use promptcat https://github.com/sevenreasons/promptcat or any other prompt storage tool. Or just fucking plain text file
>>
File: ComfyUI_temp_sjuir_00021_.png (1.9 MB)
1.9 MB PNG
>>108557188
can't get any more minimalistic than label node (rgthree).
>>
>>
>>108557182
>we can make a lot of money selling server time for our model
vs
>we can get sued by disney and then take our model offline
open models let companies exists in a legal grey area, it has nothing to do with "local".
>>
>>108557201
That's not userfriendly for multiple users.
>>108557206
That's almost perfect, but you can't select to copy paste very easily.
>>
File: cope-maxxing.png (177.7 KB)
177.7 KB PNG
>>108556268
Hmpff, it wasn't that good anyway!
>>
File: _AnimaPreview3_00044_.jpg (403.2 KB)
403.2 KB JPG
>>
File: 1766570287465007.jpg (720.3 KB)
720.3 KB JPG
>>108556596
1 more 4 u
>>
>>
File: _AnimaPreview3_00054_.jpg (296.4 KB)
296.4 KB JPG
>>
>>
>>
>>
>>
>>108557050
samples look really bad and overbaked, don't fall for this shit mixes anon, they are just glorified merged loras, those people don't have the resources to train a finetune, they just train a lora with a small dataset and merge into the base model, they are scamming you anon, you can do so much better, train your own stuff
>>
>>
File: Z-image-Klein_00929_.png (828 KB)
828 KB PNG
nyaa
[spoiler]ignore the slopped paw[/spoiler]
>>
>>
>>
File: 38057800387796390787934.png (102.6 KB)
102.6 KB PNG
>>108557568
Numbers look fine.
>>
File: 1617234637622.jpg (93.3 KB)
93.3 KB JPG
>>108557533
i don't think it's wise to ignore ltx. new loras appear regularly. alibaba has no choice but to attract attention, or risk losing the local market permanently
>>
>>
>>
>>
>>108557610
that's how they are ranked
>Models are ranked using an Elo rating system derived from user votes in blind comparisons. Users compare videos generated from the same input image and choose the result they prefer. Higher Elo scores indicate a model is preferred more often.
>>108557623
you should relax, this might actually be a closed api model. you don't want to be in this thread in two days hyping up the happy horse api.
>>
>>108557640
>>108557623
he did the meme lol
https://www.youtube.com/watch?v=nsNrwHA6Big
>>
>>
>>
>>108555676
>A youtuber specialized on API models talks about that model
it's over... it'll definitely won't be local
https://www.youtube.com/watch?v=mmk9C6bkV_c
>>
>>
>>
>>
>>
>>108557712
good luck sending a cease and desist to an open model.
>reee someone made tom cruise!
>idk probably a lora or whatever, take it up with big_booty_genner69
defeat jews with this one simple trick
>All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.
>>
File: anima3preview_.png (1.1 MB)
1.1 MB PNG
>>
File: deMS_zi_00027_.png (3.9 MB)
3.9 MB PNG
>>
File: ComfyUI_158183_.png (3.8 MB)
3.8 MB PNG
>>
File: anima3preview_.png (2.9 MB)
2.9 MB PNG
>>
File: anima3preview_.png (2.5 MB)
2.5 MB PNG
>>
>>
File: 00004-4059277889.png (2.2 MB)
2.2 MB PNG
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>108557876
>what is this shit
Happyhorse, your Seedance 2.0 """killer"""
>>108557890
>a brown person having a meltdown over a new model.
you're talking about ani seething about Anima every single day?
>>
>>
>>108557811
For standard 1024px single-image generation on a mid-to-high-end GPU, you're right — it's a modest win. The real value is in: (a) high-res workflows where decode VRAM becomes the binding constraint, (b) batch/iterative pipelines where decode overhead compounds, and (c) tightly-constrained hardware where every GB matters and you'd otherwise fall back to quality-degrading tiling. It's essentially VRAM headroom you can redeploy toward higher resolution or larger batch sizes, which is genuinely useful in production scenarios.
>>
>>
>>
>>108557890
>a brown person having a meltdown over a new model.
exhibit A -> >>108557905
>>
>>
>>
>>
>>
>>
File: 00007-3140823318.jpg (1.5 MB)
1.5 MB JPG
>>
File: anima3preview_.png (1.5 MB)
1.5 MB PNG
>>
>>
>>
Fresh when ready
>>108557992
>>108557992
>>108557992
>>108557992
Fresh when ready
>>
File: anima3preview_.png (2.1 MB)
2.1 MB PNG
>>
I am new to local llm/diffusion, I am mostly finding comfyui workflows in random places and playing with them.
I stumbled on this and was wondering if anyone had any other neat workflows.
https://weirdwonderfulai.art/comfyui-workflow/qwen-edit-2509-multiple- camera-angle-lora/
Is there a common repository everyone is using for workflows? Or where does everyone find them?
>>
>>
>>108558274
civitai has random workflows, most custom nodes you download will have an "examples" folder in the install directory. and any site that doesn't strip metadata can/will have the workflow imbedded into the image, so you can drag and drop the image into comfy and it will open the workflow.
>>
>>
>>
File: 00009-4142179651.jpg (1.3 MB)
1.3 MB JPG
>>
>>
>>
>>108558319
Ah nice, I will look civitai again, I didn't realize they included examples.
>>108558324
Thanks! I somehow missed this site.
>>108558356
Have you made any neat workflows that do something novel (like the changing a camera angle example)
>>
fresh when ready
>>108558395
>>108558395
>>108558395
>>