Thread #108682974
HomeIndexCatalogAll ThreadsNew ThreadReply
H
Discussion and Development of Local Image and Video Models

Previous: >>108681463

https://rentry.org/ldg-lazy-getting-started-guide

>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP

>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows

>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe

>Z
https://huggingface.co/Tongyi-MAI/Z-Image

>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/

>Qwen
https://huggingface.co/collections/Qwen/qwen-image

>Klein
https://huggingface.co/collections/black-forest-labs/flux2

>LTX-2
https://huggingface.co/Lightricks/LTX-2

>Wan
https://github.com/Wan-Video/Wan2.2

>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46

>Illustrious
https://rentry.org/comfyui_guide_1girl

>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage

>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg

>Local Text
>>>/g/lmg

>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
+Showing all 364 replies.
>>
no more trolling
>>
this thread is for FRENS ONLY
>>
File: 723864.jpg (1.2 MB)
1.2 MB
1.2 MB JPG
>>
File: 979nqo.png (812.9 KB)
812.9 KB
812.9 KB PNG
>>
>>108683023
why don't you ever upscale these?
>>
>>108683042
overheating. got an old rig.
>>
>>108683059
>overheating
put a temp limit with MSIAfterburner nigga
>>
>>108683066
I click on that and nothing happens. It be like read only mode or sumpin.
>>
>mfw Resource news

04/24/2026

>MAI-Image-2
https://playground.microsoft.ai/chat

>ComfyUI-NAG-Extended: NAG support for Flux 2 Klein and Anima
https://github.com/BigStationW/ComfyUI-NAG-Extended

>UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detection
https://github.com/Zhangyr2022/UniGenDet

>VARestorer: One-Step VAR Distillation for Real-World Image Super-Resolution
https://github.com/EternalEvan/VARestorer

>Sapiens2
https://github.com/facebookresearch/sapiens2

>Vista4D: Video Reshooting with 4D Point Clouds
https://eyeline-labs.github.io/Vista4D

>Pre-process for segmentation task with nonlinear diffusion filters
https://github.com/cplatero/NonlinearDiffusion

04/23/2026

>ParetoSlider: Diffusion Models Post-Training for Continuous Reward Control
https://shelley-golan.github.io/ParetoSlider-webpage

>DynamicRad: Content-Adaptive Sparse Attention for Long Video Diffusion
https://github.com/Adamlong3/DynamicRad

>Normalizing Flows with Iterative Denoising
https://github.com/apple/ml-itarflow

>LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model
https://github.com/inclusionAI/LLaDA2.0-Uni

>Illustrious XL & NoobAI-XL Style Explorer
https://github.com/ThetaCursed/Illustrious-NoobAI-Style-Explorer

>AI Model & ‘MAGA’ Influencer Emily Hart Unmasked as Indian Man
https://www.yahoo.com/news/articles/ai-model-maga-influencer-emily-091027504.html

04/22/2026

>Embedding Arithmetic: A Lightweight, Tuning-Free Framework for Post-hoc Bias Mitigation in Text-to-Image Models
https://github.com/cvims/EMBEDDING-ARITHMETIC

>Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generation
https://github.com/CompVis/patch-forcing

>TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generation
https://github.com/Hong-yu-Zhang/TS-Attn

>AnyRecon: Arbitrary-View 3D Reconstruction with VDM
https://yutian10.github.io/AnyRecon
>>
>mfw Research news

04/24/2026

>AttentionBender: Manipulating Cross-Attention in Video Diffusion Transformers as a Creative Probe
https://arxiv.org/abs/2604.20936

>KD-CVG: A Knowledge-Driven Approach for Creative Video Generation
https://kdcvg.github.io/KDCVG

>Linear Image Generation by Synthesizing Exposure Brackets
https://arxiv.org/abs/2604.21008

>Exploring the Role of Synthetic Data Augmentation in Controllable Human-Centric Video Generation
https://arxiv.org/abs/2604.21291

>AttDiff-GAN: A Hybrid Diffusion-GAN Framework for Facial Attribute Editing
https://arxiv.org/abs/2604.21289

>Projected Gradient Unlearning for Text-to-Image Diffusion Models: Defending Against Concept Revival Attacks
https://arxiv.org/abs/2604.21041

>Sparse Forcing: Native Trainable Sparse Attention for Real-time Autoregressive Diffusion Video Generation
https://arxiv.org/abs/2604.21221

>StyleVAR: Controllable Image Style Transfer via Visual Autoregressive Modeling
https://arxiv.org/abs/2604.21052

>Building a Precise Video Language with Human-AI Oversight
https://linzhiqiu.github.io/papers/chai

>Seeing Isn't Believing: Uncovering Blind Spots in Evaluator Vision-Language Models
https://arxiv.org/abs/2604.21523

>ID-Eraser: Proactive Defense Against Face Swapping via Identity Perturbation
https://arxiv.org/abs/2604.21465

>When Prompts Override Vision: Prompt-Induced Hallucinations in LVLMs
https://pegah-kh.github.io/projects/prompts-override-vision

>Seeing Fast and Slow: Learning the Flow of Time in Videos
https://seeing-fast-and-slow.github.io

>Addressing Image Authenticity When Cameras Use Generative AI
https://arxiv.org/abs/2604.21879

>Multiscale Super Resolution without Image Priors
https://arxiv.org/abs/2604.21810

>Prototype-Based Test-Time Adaptation of Vision-Language Models
https://arxiv.org/abs/2604.21360

>Latent Denoising Improves Visual Alignment in Large Multimodal Models
https://arxiv.org/abs/2604.21343
>>
File: file.png (221.5 KB)
221.5 KB
221.5 KB PNG
>>108683002
>Why isn't 24GB enough?
you think I'm some sort of poorfag?
>>
>>
>>
>>108683114
all that gpu power for nothing, it's not like there's a big local model that can be used and is competitive with the best API models
>>
>>108683132
LLMs
>>
> >108683096
> >108683101
Fuck off
>>
>>
>>108682974
<- not mine, but she's gorgeous~ I love Raiden and I'll use her as my paper on my phone.
>>
>>108683154
how do you gen these? is it just z image turbo?
>>
>>
>>108683157
gpt-image-2
>>
>>108683157
Zimage Turbo + lora for photos + lora for likeness. Latent upscale gives realistic detail
>>
File: o4gged.png (958.4 KB)
958.4 KB
958.4 KB PNG
>>
>>108683158
anima + zit? can you share anima prompt/workflow? i've found that res_2m is really good for realism
>>
>>
>>
>>
>>
>>108683290
fine wine needs time.
>>
>>108683290
aged perfectly, based comfy
>>
>>108683290
Based. He knew API was the future and invested heavily in it. China learned from this and quickly pulled local support for WAN right after. We probably wouldn't have models like GPT-Image-2 without him.
>>
>>108683290
I forget, what model was this about?
>>
File: 384.jpg (387.5 KB)
387.5 KB
387.5 KB JPG
>>
>>108683328
it was hunyuanimage 3.0 (80b model lool)
>>
>localpoors threw a fit because hunyuan was too big for their 24gb
aged like wine >>108683002
>>
>>108683339
hunyuan wasn't good though, so it was big and bad
>>
hunyaun was great, but comfykeks wouldn't know because local is verboten
>>
>>
>>
>>108683352
Nice
>>
the bigger the model, the better it is, it's common sense
>>
>>108683349
>hunyaun was great
care to show some images?
>>
>>108683290
>>108683330
>>108683349
Wait, I looked into this and it's true?? Hunyuan 3 was better than Nano Banana but never received a ComfyUI implementation because it threatened the API nodes ecosystem. Holy shit can we ditch ComfyUI already? It has undoubtedly harmed local thanks to this.
>>
>>108683366
ty! nice lora, is it the k-pop girl?
>>
>>108683405
>Hunyuan 3 was better than Nano Banana
source: (((the media))), oy vey
>>
>>108683405
do you enjoy seething since months about comfy being the most relevant local ui and successful?
>>
>>108666242
If you're still here what's the second textbox for that we can't edit?
>>
>>108683405
we should all switch to InvokeAI
>>
>>108683423
it's not local
>>
>>108683441
>>
>>108683441
how do i run it locally then anon?
>>
So API nodes are local now??? Sweet!
>>
>>108683449
>>108683456
why do you keep feeding him
>>
>>108683458
who said that anon?
>>
>>108683468
it's in the OP
>>
>>108683458
>>108683477
are you really that bored? like you really have nothing else to do with your life? kinda sad when you think about it
>>
>>
i just discovered ltx2.3 loras
>>
>>108683477
where in op is the statement that "API nodes are local now" anon?
>>
black snape, but ltx 2.3:

https://litter.catbox.moe/gktnj1crp42z1o9g.mp4
>>
>>108683500
I still can't believe they made Snape black. I think it would have been more tasteful to make hermione black if they were shooting for DEI
>>
>>108683513
>I think it would have been more tasteful to make hermione black if they were shooting for DEI
hermione is a female, she's already a DEI
>>
>>108683513
>I think it would have been more tasteful to make hermione black if they were shooting for DEI
ron is already a ginger
>>
*yawn*
>>
I just downloaded the ComfyUI desktop app from the link in the OP. How many credits do I deposit to get started with anima?
>>
>>
>>108683513
>I still can't believe they made Snape black.
don't underestimate the willingness of the wokies to stir the pot, they're so good at that
>>
>>108683531
making the weasleys black would have been great actually. they already live in a shit hole.
point stands, though snape should never be black. that is the epitome of a white character
>>
File: 9hbwyv.png (1.2 MB)
1.2 MB
1.2 MB PNG
>>
n*gbo-esque honestly
>>
snape is an incel, that is white culture >>108683541
>>
>>108682673
from scratch
>>
>>108683541
>though snape should never be black. that is the epitome of a white character
he's literally described as a man with a pale skin on the book lol
>>
>>108683540
>don't underestimate the willingness of the wokies to stir the pot, they're so good at that
They're good at stirring the pot but their shit makes no money. This harry potter remake is going to flop hard because the woke crowd hates jk rowling and the fact that she makes money from everything related to harry potter and it's going to piss off the normal people too so who does that leave to even watch that shit?
>>
>>>/tv/
>>
>>
>>108683497
careful icarus
>>
They made Snape black? This reminds me of when ComfyUI added API nodes into a UI that was originally meant for local models. It's all about testing the waters until people get too tired to care about it anymore. By that point it's already normalized and the subversive vermin won.
>>
>>108683583
can you answer the question anon? >>108683498
>>
>>108683497
which loras? They seem really hit or miss
>>
>>108683577
fucking slut tease
>>
i NEED another COMPUTER.

(for genning)

I am gaming and need to GEN.

and another one to talk to my virtual people.
>>
>>108683591
lets just say... the kino lora
>>
>>108683540
it's an English series starring English people, they don't give a shit about retarded American Zoomer politics

like you know this guy has a posh London accent and an extensive background in Shakesperean stage productions, right
https://youtu.be/96GAI4ioekM?t=11
>>
>>108683591
this one seems cool
https://civitai.red/models/2557755/retro-90s-anime-style-lora-ltx-23?modelVersionId=2874411
>>
>>108683563
Rowling doesn't give a shit, SHE retconned Dumbledore into being gay herself years after the book series had ended, just as a public anecdote
>>
>>108683605
looks hilariously bad
>>
>>
>>108683603
>it's an English series starring English people
are you implying that woke only happens in the US?? lmao
>>
>>108683610
nobody cares that old nigga dun got kilt
>>
>>108683616
i'm implying American Zoomers are the overwhelming majority of people who give any kind of fuck about the Woke Boogeyman
>>
>>108683615
this is nice, what model?
>>
>>108683629
Anima p3
>>
>>108683583
why is anyone supposed to give a fuck about api nodes in comfyui? it literally doesn't matter.
if they switch to full api support and drop local, doesn't matter because someone will just fork it and local will be completely unaffected.
>but then local won't have all the latest API support!
?
>>
>>108683627
>the Woke Boogeyman
my fucking ass, there's no boogeyman about this, they knew Snape was canonically a white man in the book, they knew that rewriting history and Netflix'ed him into a nigga would stir the pot, they know what they're doing, they're taunting people, and you defend them because you're probably some gay ass liberal who loves this woke slop, right
>>
>>108683627
today I learned that the entire right wing party in america are all zoomers. that's crazy
>>
>tardbo back to shilling groids and api nodes
>>
>>
>>108683647
based zoomers
>>
toe socks.
>>
I don't want to gen after today's incident. Because I know my gen will be used as bait for investors by Comfy to create speculation that there is activity in local models.
I don't want to be part of this fake engagement system.
>>
>>
>>108683290
I love when daddy comfy decides whats best for me. that way I dont have to think for myself <3
>>
>>
>>108683720
35 stars status?
>>
>>
>>108683132
>all that gpu power for nothing
rtx 6000 pro is the only gpu that can handle video gen workloads without having to unload and reload models all the time, totally worth it
>>
>>
Havent genned in over a year and was just looking to get back into it. Not trolling or baiting, i'm reading this shit about local being obsolesced by api and don't know if I should be taking that seriously or not. Have local models actually hit a ceiling or stopped getting meaningful updates? The last models I was using were XL Illustrious merges and I have a 4090. I quit before video was much of a thing.
>>
>>108683781
If you haven't used api, the newer local models are quite impressive
if you have, they'll seem way behind
>>
>>108683745
new koff game idea
koff kicker
vamp survivor like, you kick away ghosts as they swarm you. you get xp and power up your kicks
could be a platformer too
>>
>>108683156
Damn, is she AI? I want more photos of her. Can you whip some nudes up for me, and also pictures of her on wholesome dates with me?
>>
>>108683795
sent ;)
>>
I'm genning a RAP MUSIC HIT SINGLE.

This is the FIRST RAP SONG to achieve SIGNIFICANT ATTENTION FROM THE JEWISH RAP PRESS

(I predict)
>>
>>108683806
>This is the FIRST RAP SONG to achieve SIGNIFICANT ATTENTION FROM THE JEWISH RAP PRESS
All rap is funded and created by Jews, though.
>>
>>108683788
What immense resources do api models require that they can't be run local anymore?
>>
>>108683815
open weights for starters
>>
>>108683815
They can be run locally with blackwell pro gpus, but companies stopped releasing them after an agreement with ComfyUI. Such models were deemed 'too big' to be worth implementing, so now they'll just be behind API. That's what happened with WAN 2.5, and now Qwen.
>>
>>
>>108683789
that's a good idea
>>
>>108683002
everyone laugh at the poor!
>>
standby, generating some ltx2.3 kinos right now
>>
>>
>>
>>108683815
indians think every image they gen on banana or image 2 requires hundreds of gigs of vram.
>>
>>108683096
>>108683101
thanks!
>>
>>108683883
no one proved those saars wrong though, is the local 6b model that is on the same level as NBP or GPT-image 2 in the room right now?
>>
>>108683806
>THE JEWISH RAP PRESS
I'm kinda humored by the idea of a bunch of hasidics being like "really diggin the flow of the new j cole album. got some b.i.g. styling to it"
>>
>>108683821
Does that mean the usual gatekeeping then if I'm feeding my prompt to idk the wan2.5 server or whatever?

>>108683820
I don't know what that is or why it's important
>>
>>108683906
That's Rick Rubens entire job.
>>
>>
>>108683917
which one is rick rubens
>>
>>108683905
>every local model is actually SDXL
we will ignore the fact that after a couple of weeks every api model gets hit with the same sea of complaints about reduced image fidelity when they quietly switch over to quantized models.
you already have people complaining about the image quality of gpt2, next week they will update their censorship and copyright guardrails.
and then it's the same old ballgame of "well it's still better than *insert 2-3 year old model*."
>>
>>108683934
the guy with the big forehead
>>
Anon, link the realism LoRA for anima
>>
>>108683941
sure, a local model that'll reach GPT Image 2's level will surely be there anytime soon, 2 weeks!
>>
>>
>>108683771
chroma is like searching a junk yard for valuables people have accidentally thrown out
>>
how can i use this thing
https://civitai.com/models/253383/super-dance

is just a bunch of pictures
>>
>>108683963
how many giggle bits of super compute to cook up these beauties?
api bros eatin good
>>
>>108683947
I've tried uploading to civitai but the fucking site wont work.
>>
>>108683970
Control net
>>
>>108683976
Imagine taking pride on API models while posting girls, when in reality can't do generate anything adult/nsfw related, is like hearing an ultra religious guy talking about sex and porn
>>
>>108683976
>random shit that doesn't make sense
>multiple fingers
>fucked up gun
>"details" are just noise added across the whole image
I swear this thing is just a 10b active parameter MoE model that has been hyper optimized for text and chart rendering, with a prompt enhancer LLM put in front of it.
>>
>>108683986
Use hugging face anon? CIVITAI runs ok today in EU
>>
>>108683976
wtf is that lmao
>>
>>108683976
the policewoman has two foreheads lmao
>>
>>108684012
well they did say it was a thinking model.
>>
OH WE GENNING
>>
>>108683976
The only thing you're eating is Sam's dick faggot
>>
>>108684008
That right there is ground truth.
>>
gen image of sexy woman and spot the errors then fix them on repeat until it's perfect, no mistakes. thanks agentic image genning.
>>
>>108683822
Abysmal background for a 9b model.
>>
Can't gen this with a [FEMINIST] ai:
https://files.catbox.moe/a036xv.flac

As a coincidence, catbox will be 18 in exactly 7 years.
>>
>>
>>108684067
18 is too old
>>
>>108684022
>512x512
its 2023 again
>>
>>108684083
It's a flac, you can change the lyrics etc. but uh. I just now realized I had like... a broken personally custom node (for debug).

Let me gen a new one, and you can work on that lol.

My point is that (again, imo) the feminism of the commercial music models won't allow this.
>>
>>108684093
what model? it's not super great right now but I can see this improving
>>
>>108684084
It's my sd1.4 in my Ace Step 1.5 (now XL) wf.

It's basically instant, like idk let me check... well, for me it's a pathetic 9 seconds, but on an nvidia rig it will instantly appear.

sd1.4 easily trounces modern models for album art, it's not even close.

picrel is genning (the audio part of the wf)
>>
File: 2022 Gen.png (360.3 KB)
360.3 KB
360.3 KB PNG
>>108684084
Here's a gen from 2022
>>
>>108684102
SOUL
>>
>>108684098
Ace Step 1.5 XL. I was doing pre-XL, but I think XL really is better (it's like 2x the size at 9gb). It's fun. That's all it has to be.
>>
>>108684104
:) I like soft jawlines.
>>
That is the speech of a soulless corpo, worthy of being from Apple, Windows, or Google, what hat has ComfyUI transformed into?
They're celebrating like venture capitalists, throwing around metrics about user growth and annualized bookings as if this community project was always meant to be a startup pitch deck.
This corporate doublespeak about "investing in what the community cares about" while simultaneously courting top talent and scaling like a Silicon Valley unicorn is exactly the kind of grifting that betrays what open source is supposed to represent.
>>
>>108684125
wasn't the comfy guy some anon on here who just wanted to learn about how stable diffusion works?
>>
>>108684125
>That is the speech of a soulless corpo, worthy of being from Apple, Windows, or Google
More like a cryptobro bragging about their coin drops. This dude isn't professional at all. Fucking loser.
>>
File: 2026 Gen.png (2.6 MB)
2.6 MB
2.6 MB PNG
>>108684102
Same prompt today
>>
>>108684125
comfy is the most retarded bullshit program ive ever fucking used convoluted dog shit that you have to load 40000 different mods to do anything at all fucking need to just kill themselves already
>>
>>108684125
all you had to say was that he sounds like a faggot
>>
>>108684138
seething brainlet confused by a handful of nodes
>>
>>108684135
i wanna fuck marie rose
>>
>>108684125
This is very sad and Friday's events marked a before and after in the history of /ldg/.
>>
>>108684142
ah yes just a few nodes! 60000 nodes later
kill yourself
>>
>>108683806
>>
>>108684146
>had to exaggerate massively to make his point.
just say your tiny brain is overwhelmed and ask for help
>>
>>108684161
shut the fuck you fucking pajeet faggot retard. im sure you use templates just shut the fuck up before i smash your bloody skull in
>>
I will continue to post api gens in ldg, you lot deserve it after that absolute comfyshill embarrassment
>>
>>108684171
we really should rebrand or remove comfyui from the OP because of that. why are we keeping corpo shit in the OP? at least link to one of the de-saas'd comfyui forks instead
>>
>>108684169
YOU BLOODY FOCKIN BASTAR
>>
>>
>>108684169
maybe ms-paint is more your speed?
Or do all the buttons and toolbars anger and confuse you?
>>
>>108684169
having a melty again anifart? you forgot to take your meds today?
>>
>>108684193
your fuckin existence angers me you worthless fucking street shitting monkey fuck!
>>
>>108683577
holy shit this got this good when I was out? maybe I can finally ditch my wan
>>
>>108684204
>melty
please leave zoomie
>anifart
whomst?
>>
>>108684177
what would that change exactly? everyone uses comfyui because it is the best tool for the job, bar none.
>>
>>108684209
>who
you don't remember your own name?
>>
Dall-e API users:
>Hey OpenAI dropped a new image model
>Sweet, I'll check it out and see if it's any good
Localkeks
>WE JUST RAISED 500 BILLIONS DOLLARS OF FUNDING FOR API NODES, PLEASE LIKE RETWEET AND SPREAD THE NEWS TO WIN COMFY CRYPTO! THANK YOU BLACKROCK AND CHASE CAPITAL, MAKE SURE TO SCAN YOUR ID TO USE THE NEWEST BYTEDANCE NODES!
Why are they like this? 'local' shills for API more than API themselves.
>>
>>108684125
does it hurt your feelings to discover all comfy ever wanted was to be a san francisco techbro?
>>
>>108684205
>your fuckin existence angers me
good, I'm glad I'm making you seethe that much, feelsgoodman
>>
>>108684222
is 'local' some guy you are beefing with on discord?
>>
>>108684248
Are you saying ComfyUI isn't local???????
>>
>>108684252
epic checkmate
>>
>>108684252
>>108684257
how do we eradicate the samefagging?
>>
>>108684252
what made you think it was mutually exclusive?
>>
>>108684271
>what made you think it was mutually exclusive?
a troll is gonna troll
>>
File: ldg.png (3.1 MB)
3.1 MB
3.1 MB PNG
>>
>>108684218
>>
>>
>>108684285
she's washing the glue off my keyboard
>>
>>108683795
idk bwo, it's from a previous thread
>>
But what is the psychic toll of using and API?
>>
when can we expect good gens?
>>
>>108684295
2031
>>
>>
>>
>>
>>108684125
Yeah, compare to GGerganov's speech on 100k stars.
>>
annnd catbox is broken again.
>>
>>108681683
>>108682197
Alright, so I pulled out my HD600s now and ran the Japanese one through

https://github.com/entrepeneur4lyf/Web-Audio-Mastering
(Dunno what I'm doing yet, everything was automatic)

I will admit, after cutting mud and a bunch of other stuff, with some issues with voice fixed it's quite pleasing to listen to now. Seems to have marginally improved audio quality, at least mitigating some issues anons talked about last thread. So it is in fact acceptable after some changes, unlike what negative anons want to claim (it's over just because it's not commercial good out of the box).

I did listen to real tracks and Udio and can now hear the difference, so I will admit it's not yet perfect, but it's better than nothing.

Also, plenty in the community are looking into solutions as well. I came across several solutions I have yet to try. Remember, this is all in its infancy, and this is as bad as it's going to be. It will only get better from here on out.

https://vocaroo.com/1h2m51Wv8mh1
>>
>>108684432
when is dcw coming to comfy?
>>
it seems like reddit has basically ignored dcw, but the ace step guys and I guess china knows what's up.
>>
>>108684440
I do not recommend using Comfy at all for ACEStep, cpp version is faster and can run on another tab alongside Comfy due to excellent memory management. Make sure to build it from source. If you must, maybe try ACEStep cpp Comfy extension (though I haven't tried that and no idea if they updated it to have DCW).
>>
>>108684545
I guess I really should. Originally, the Gradio didn't work with Radeon.
>>
>>108683219
This looks very natural.
>>
>>108683236
no way
>>
>>108684125
This is exactly why I stopped contributing to the project because of this shit. Why am I funding a corporate project that has money behind it? If it is no longer community focused, then they can hire their own damn programmers with that money to fix their broken shit. I can just stop genning for a bit.
>>
>>108684554
Yh, thing is Gradio verison was an entirely vibecoded disaster, while the cpp is evidently made by a real dev, and in fact the only proper UI dev for ACEStep because it's like heaven on earth compared to everything else in speed. The devs should be ashamed of the Gradio, abandon it and make the official one cpp.
>>
>>108683160
So just a standard hires fix? Ultimate SD 1.5x upscaling with 4 x Ultrasharp, Heunpp2/linear quadratic and 3 steps seems to work fine as well.
>>
>>108684650
was meant for >>108683171
>>
>>108684602
I may try that c version then, once they finish the dcw feature.
>>
>>108684650
that's a nice jen
>>
>>108684688
Thanks. I did my first validation runs and turns out I've been overfitting crazy hard. Stop using prodigy for zturbo training.
>>
>>108684695
share your setting pls, i've been struggling with zit loras
>>
>>108684432
I think mine's broken
https://vocaroo.com/1aGGhcTRT8pv
>>
File: PonyV7.png (1.9 MB)
1.9 MB
1.9 MB PNG
why aren't you using Pony V7?
>>
>>108684716
my furry model of choice is chroma
>>
>>108684602
And looking again, I think that dcw may be possible to implement with some existing nodes, but I don't know for sure. :|
>>
>>108684703
Nothing special about the settings. They were pretty much the default OneTrainer settings but with adjusted lr. Learn how to use validation. Dataset = likeness.
>>
>>108684650
>So just a standard hires fix?
yeah! When using loras there's no artefacts. That's a great gen. Just saw her early interviews on Fergusons talkshow, she was absurdly pretty.
>>
>>108684772
She's getting old but so are we all. She seems like a good mother.
>>
is civitai fucked ever since they did the split? i know it was down for a bit but i havent seen a single new lora being uploaded in like 48 hours.

i even turned off my filters and nothing.
>>
>>108684914
pretty much haven't been there since the split, can't log in to red yet.
>>
Piper Perri Surrounded
>>
>>108684157
your own lora?
>>
>>
>>108684177
add --disable-api-nodes in your starting flags...
>>
>>108685141
Prompt catbox whatever help genning this anon, I have anima and z and klein and chroma
>>
Where is anon anima realism LoRA from previous thread, was looking awesome...
>>
>>108685223
very likely anima to zit, workflow can be found from previous threads
>>
>>108685247
comfy shareholders didn't like it
>>
>>108684914
my country's straight up banned so I just use the archive. I think it lists everything that gets posted on civitai
>>
>>108683002
>>
>>108683781
most of it is 1 false flagging psychopath trying to make the general angry about comfy and api by posing as an api shill
>>
>>
>>108685328
vpn will be necessary more and more anon, you might as well bite the bullet unless you want to end in a fluffy intranet
>>
>>108685784
i've made pizzas with tomato slices like that and didn't like it
>>
So are infographics like scrapbooking, or what? I don't see the point.
>>
>>108684711
>>108684722
No clue. I'd try troubleshooting the c version, maybe you downloaded something wrong. If it still doesn't work then perhaps try

https://github.com/DawnW0lf/ComfyUI-DCW-Diffusion-Color-Wavelets-Node

Noticed some guy implemented it for Comfy, though settings don't look exact same at first glance.
>>
>>
Is there another site besides civitai where you can post your works of art?

it's insane how they're always having issues
>>
>>108685125
It got uploaded a couple threads back.
>>
>>108686066
yeah on 4chan.org/g/ldg/
>>
>>
File: IMG_2311.jpg (73.9 KB)
73.9 KB
73.9 KB JPG
>>108686095
amazing
-nicholas
>>
>>108686082
too ephemeral, I need my updoots
>>
>>108685896
Actually, same
>>
>>108686123
One (You) is worth 1000 updoots.
>>
>>
>>108685947
Yeah... It's like cloudfags have discovered how to use import image in word.

Who cares but catalog or menu creator l ffs
>>
firing up the ltx2.3 kinoplex
>>
>>108683115
>>108683154
>>108683171
>>108683219
>>108683267
she was hot
>>
>>108685984 #
Poor port or snake oil. Have removed it after 15 min...
>>
>>108683366
Anooon...Gib me anima lora/WF for this realism D:
>>
>>108686227
Is it worth using now? Tried when it was released and got still images with terrible sound
>>
thoughts?
https://www.reddit.com/r/StableDiffusion/comments/1sv8uo3/comparing_realism_zimage_turbo_vs_ernie_turbo_vs/
>>
>>108686247
Well.. the 3 are free... I use the 3. Ernie is somehow the one I use less, but it has recent anime and vg knowledge
>>
>>108684914
civitai mods banned playtime_ai from the site. Also his reddit account was perma banned within 30 minutes of posting about his ban from civitai on r/stablediffusion and r/civitai . His huggingface account seems to very dead and all the discussions are closed and 404'd.
https://huggingface.co/Playtime-AI
>>
>>108686246
it was always good. you just need to learn the art of the kino
>>
>>108686267
he got raped to death
this is what happens when you release good loras
>>
>>108686267
Yeah this guy was probably Epstein's friend...

Wait, I'm wrong, actually Epstein's friends are free...
>>
>>108686275
pretty depressing to see him go. would've dropped some shekels to his kofi to moralize and encourage him but even that link is dead. fuck that simpping faggot in the comment section posting that he deserved the ban. Too many snitching lurking faggots in these threads and on reddit.
>>
>>108686066
Twitter and Pixiv.
>>
>>108686272
as good as acestep, sure
>>
>>108686267
>"maybe local models aren't that good but at least we have the full liberty to make great coom loras unlike you API cucks"
>someone makes great coom loras
>gets banned everywhere
grim
>>
libertarianism is almost always the cause
>>
>>108686272
>it was always good
kek, nice joke
>>
>>108686316
>>108686341
HIS REDDIT IS BACK
https://www.reddit.com/user/playtime_ai/
>>
>>108686354
>leddit
useless if he can't post his shit on civitai or huggingface lul
>>
>>108686354
>1.5 slop
>whatever this is https://old.reddit.com/r/KinkTown/comments/1nemf4c/m4f_42_i_am_married_open_minded_kinky_switch_and/
>disney and other underage char posting
is this just you as part of a humiliation fetish
>>
>>
>>108686371
sad that this legit cuck will end up as a martyr
>>
>>108686371
thats your average localcuck
>>
>>108686371
>>108686387
>literal localcuck
>>
oh no no localbros it's fucking over
>>
>>108686387
how do APIchads generate coom pictures
>>
>>108686267
local loses its only appeal if we can't share coom loras freely, wtf
>>
>>108686405
grok or seedance 2
>>
>literal cuck is the backbone of local loras
>>
>>108686247
Always cute how these comparisons don't mention how much VRAM/RAM is in use during inference.
>>
>>108686371
> https://old.reddit.com/r/KinkTown/comments/1nemf4c/m4f
what is this?
>>
File: argyle.jpg (380.9 KB)
380.9 KB
380.9 KB JPG
>>
>>108686371
whoa the guy that makes nsfw loras is a degenerate
what a shock
>>
>>108686371
dont scroll further, this guy is a total freak and makes the local community look bad
>>
>>108686371
>https://old.reddit.com/r/KinkTown/comments/1nemf4c/m4f_42_i_am_married_open_minded_kinky_switch_and/
this is the year of our lord 2026, we can now hide our history on leddit, why isn't he doing it? showing this shit publicly doesn't do him any favors
>>
Is this some kind of elaborate ruse?
>>
>>108686463
>we
>>
>>108686463
keeping his degenerate behavior public is part of his cuck fetish
>>
>>108684295
When the Chinese get their heads out of their asses and even then only if Xi doesnt cockblock us.
>>
>>108686463
> showing this shit publicly doesn't do him any favors
why?
>>
i think api cucks are just mad because they can't goon
have some patience with them
>>
>>108684157
Can you try making a three panels comic?
>>
>>108685947
>>108685947
based jenner. jenny looks wonderful in a catsuit
>>
>>108686656
>>108685947
stop it
>>
>>
i'm generating too many kinos. my hard drive will be full soon
>>
>>108686723
Upload them here and let AI scrappers use them and use the new model to get them back.
>>
File: cfjwni.png (1.3 MB)
1.3 MB
1.3 MB PNG
>>
>>108686622
Like with just a prompt? Here's one after like 10 tries, most were pretty bad. I'm sure toss has one already with this exact punchline somewhere. With a little effort you could probably just gen the panels individually and put them together
>>
File: z7901b.png (1 MB)
1 MB
1 MB PNG
>>
>>108686845
where's the sus?
>>
Comfu ui sucks ass, it breaks once I run SD with forge and most of the instalation files get nuked, confy uo have malware or what ?
>>
>>
Powerful local models for efficiency, security, privacy, sovereignty. ¡Viva la Revolución!!
>>
>>108686689
Stop what?
>>
I wish lodestones wasn't a retard with the attention span of a fruit fly
>>
>>108687107
but comfyui took all these things away. what do?
>>
>>108687122
responding to yourself
posting ugly women
>>
>>108687107
>local models
>freedom
you have to tiptoe around tranny mods to avoid getting banned for posting problematic content
>>
>>
>>108687203
Haute cature.
>>
Thoughts on the anima turbo lora?
>>
>>
>>108687299
It sucks ass, x4~5 more generation time vs SDxlXL @1024 x 1024 , the only reason I would use anima is for the prompt interpreter and getting different results, I'm not testing tha shit
>>
civitai is so fucking bad I think some jeet could vibecode a better site
>>
>>108687299
>Thoughts on the anima turbo lora?
really bad, it destroys the colors and styles
>>
good morning. im going to bed now
>>
>>108687299
i like it a lot. i like it more than base gens. the way it constrains the model is really nice imo. i'd use it even without the speedup
>>
>>108687299
It's all right, but I was weary of using it since you couldn't use negative prompt, but now that NAG has implented it I can go for it
>>
>>108687361
It's almost 11 AM in Cali and the Civitai admin is still in bed dreaming about cuckholdry.
>>
>>108686845
Nice, thanks.
I wanted to know both how hard it would be, and how coherent (graphic and story wise) it could go.
You just did that in one prompt? Didi you explicitly said what happens in each panel or let the AI decide?
>>
>>108687361
lets say i made a better site with no bugs, no payment processor bullshit, and you could upload anything you wanted
whats in it for me exactly? reddit upvotes?
>>
>>108687619
I don't even care if they follow payment processor rules and want to make money I just want the site to be functional and usable.
>>
>>108687619
You may run ads, and ask for crypto payments for being a middleman between retards and Runpod.
>>
>>108687619
>whats in it for me exactly?
glownig cp spam
>>
>>108687619
>whats in it for me exactly?
you would spend your time working on a site instead of wasting your time trolling on a local thread with API models, that's a good start
>>
File: cyrh5h.png (954.3 KB)
954.3 KB
954.3 KB PNG
>>
>>108687659
normies hate crypto but decent idea
>>108687671
how am i trolling? im running api models locally with comfyui, which is linked in the op
>>
>>108687717
>im running api models
this post is off topic
>>
>ani grift civitai clone
lol
>>
>>108687432
>>108687446
Just tried it. The quality is noticeably worse but as an AMDkek the speed increase is soooo good. Probably won't use it for now but I hope they improve it.
>>
do your part, hide and ignore the troll
>>
>>108687725
anima runs on amd? good to know
>>
>>108687722
so im supposed to ignore the api nodes in comfyui just because it makes you uncomfortable?
>>
>>108686235
Maybe throw whatever cpp issue you're having at Gemini (include the repo's URL in the prompt). It's quite good at troubleshooting.
>>
>>108687758
you are supposed to stay on topic on a local general like everyone else, you aren't above the rules
>>
>>108687779
Rules are for the little people.
>>
>>108687779
nta, you are correct, but he's baiting for (you)s in the LOCAL diffusion general. he knows, he's just being a faggot about it.
>>
>>108687717
>normies hate crypto
Because cryptobros make a whole thing out of it. Gotta find a way to accept crypto that saves the customer from the torrents of bullshit your average shitxchange shills. Then it's no different than the original experience of buying buzz, I assume.
>>
>>108687818
you are little though
>>
>>108687779
maybe op should make a list of blacklisted nodes in comfyui to remove any gray areas
>>
Fresh

>>108687829
>>108687829
>>108687829
>>108687829
>>
>>108687827
if you're too retarded to understand that you should not post images from API models in there, then you deserve to be banned
>>
>>108687754
Yeah. Gens are pretty slow on my 7900xtx though (~28 secs per gen)
>>
>>108687138
Madam, I'll have you know I do neither!

>>108687758
You can also load audio and video in Comfy... but you don't see anyone talking about music and movies because they're not jackasses.
>>
>>108683002
comfy has resorted to shilling on /ldg to try to get people to pay money to make images with his noodleshit
>>
>>108687717
>I'm only pretending to be retarded
boring
>>
>>108687851
not much longer compared to my IL gens, but I have just a 7800XT. Oh well, still worth trying out at least.
>>
>>108683577
whoa.. this is ltx? when did this happen? last i remember ltx was just blurry incoherent shit
>>
>>108683976
is this supposed to look good? if this is all you got then you're eating out of the dumpster
>>
>>108684650
what do you use for the tooling? is it comfy or forge?
>>
>>108687890
wrong
>>108684432
>>
>>108687717
hot

Reply to Thread #108682974


Supported: JPG, PNG, GIF, WebP, WebM, MP4, MP3 (max 4MB)