Thread #108609718
File: highlights_g_108604726_1776273831_1.jpg (3.2 MB)
3.2 MB JPG
Discussion and Development of Local Image and Video Models
Previous: >>108604726
https://rentry.org/ldg-lazy-getting-started-guide
>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows
>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe
>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
>Qwen
https://huggingface.co/collections/Qwen/qwen-image
>Klein
https://huggingface.co/collections/black-forest-labs/flux2
>LTX-2
https://huggingface.co/Lightricks/LTX-2
>Wan
https://github.com/Wan-Video/Wan2.2
>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46
>Illustrious
https://rentry.org/comfyui_guide_1girl
>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage
>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg
>Local Text
>>>/g/lmg
>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
298 RepliesView Thread
>>
>mfw Resource news
04/15/2026
>DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature Caching
https://huggingface.co/tencent/DisCa
>Lyra 2.0: Explorable Generative 3D Worlds
https://research.nvidia.com/labs/sil/projects/lyra2
>AniGen: Unified S3 Fields for Animatable 3D Asset Generation
https://github.com/VAST-AI-Research/AniGen
>T2I-BiasBench: A Multi-Metric Framework for Auditing Demographic and Cultural Bias in Text-to-Image Models
https://gyanendrachaubey.github.io/T2I-BiasBench
>Generative Refinement Networks for Visual Synthesis
https://github.com/MGenAI/GRN
>VideoFlexTok: Flexible-Length Coarse-to-Fine Video Tokenization
https://videoflextok.epfl.ch
>DiffusionPrint: Learning Generative Fingerprints for Diffusion-Based Inpainting Localization
https://github.com/mever-team/diffusionprint
>Chain-of-Models Pre-Training: Rethinking Training Acceleration of Vision Foundation Models
https://github.com/deep-optimization/CoM-PT
>Self-Adversarial One Step Generation via Condition Shifting
https://github.com/LINs-lab/APEX
>See-through WebUI
https://github.com/BeamManP/see-through-webui
>ERNIE-Image: Repackaged model files for ComfyUI
https://huggingface.co/Comfy-Org/ERNIE-Image
04/14/2026
>Nucleus-Image Released
https://huggingface.co/NucleusAI/Nucleus-Image
>ERNIE-Image: Text-to-image generation model built on a single-stream Diffusion Transformer
https://huggingface.co/baidu/ERNIE-Image
>Danbooru Dataset Filter: High-Speed Metadata Explorer for AI Training
https://github.com/ThetaCursed/Danbooru-Dataset-Filter
>ChatGPT will praise the mood and 'bedroom/DIY texture' of fart sounds pulled from YouTube
https://www.pcgamer.com/software/ai/chatgpt-will-praise-the-mood-and-b edroom-diy-texture-of-fart-sounds-p ulled-from-youtube
>RefineAnything: Multimodal Region-Specific Refinement for Perfect Local Details
https://limuloo.github.io/RefineAnything
>>
>>
>mfw Research news
04/15/2026
>Ride the Wave: Precision-Allocated Sparse Attention for Smooth Video Generation
https://arxiv.org/abs/2604.12219
>StructDiff: A Structure-Preserving and Spatially Controllable Diffusion Model for Single-Image Generation
https://butter-crab.github.io/StructDiff
>Scaling Exposes the Trigger: Input-Level Backdoor Detection in Text-to-Image Diffusion Models via Cross-Attention Scaling
https://arxiv.org/abs/2604.12446
>PromptEcho: Annotation-Free Reward from Vision-Language Models for Text-to-Image Reinforcement Learning
https://arxiv.org/abs/2604.12652
>MAST: Mask-Guided Attention Mass Allocation for Training-Free Multi-Style Transfer
https://arxiv.org/abs/2604.12281
>Bridging the Micro--Macro Gap: Frequency-Aware Semantic Alignment for Image Manipulation Localization
https://arxiv.org/abs/2604.12341
>LottieGPT: Tokenizing Vector Animation for Autoregressive Generation
https://lottiegpt.github.io
>Combating Pattern and Content Bias: Adversarial Feature Learning for Generalized AI-Generated Image Detection
https://arxiv.org/abs/2604.12353
>Nucleus-Image: Sparse MoE for Image Generation
https://arxiv.org/abs/2604.12163
>HDR Video Generation via Latent Alignment with Logarithmic Encoding
https://HDR-LumiVid.github.io
>CoD-Lite: Real-Time Diffusion-Based Generative Image Compression
https://github.com/microsoft/GenCodec/CoD_Lite
>SubFlow: Sub-mode Conditioned Flow Matching for Diverse One-Step Generation
https://yexionglin.github.io/subflow
>OFA-Diffusion Compression: Compressing Diffusion Model in One-Shot Manner
https://arxiv.org/abs/2604.12668
>EDGE-Shield: Efficient Denoising-staGE Shield for Violative Content Filtering via Scalable Reference-Based Matching
https://arxiv.org/abs/2604.06063
>Visual Preference Optimization with Rubric Rewards
https://arxiv.org/abs/2604.13029
>On the Robustness of Watermarking for Autoregressive Image Generation
https://arxiv.org/abs/2604.11720
>>
>>
>>
>>
>>
>>
>>
>>
File: 1750838806432146.png (261.5 KB)
261.5 KB PNG
>being spawned into a consciousness of a human before the matrix is made
what a shit life, lmao
>>
File: 1758620173448287.jpg (459.2 KB)
459.2 KB JPG
>>108609697
>The Chinese always come out with really nice architectures, but they really can't into quality training data. Shame, the model had potential, but it's clearly slopped. It's very strange, due to Flux 2 VAE, some photos look very realistic, while others don't look it at all. They likely used a mixture of both slopped and real data, and it shows.
that pisses me off because I thought that the humongus hype Z-image turbo generated would be a sign to those fuckers that people your models way more if you only train on real data, fucking souless bugs get your shit together!
>>
>>108609942
I feel ya anon, I thought I was fortunate to have grown during the golden age of video games but people in 50 years will be eating so good, like fucking Sworld art online shit but in real life, damn...
>>
>>
>>
>>108609990
theres always worse, but theres always much better, especially for all eternity in the matrix where 99.9999% people that ever existed will actually be compared to this shit time.
and sorry to burst your retarded libshit feminist bubble of the past that you randomly decided to add the imaginary foid suffering into this convo with, but not everyone in the past was raped. i know, shocker. being raped is only a common occurence if you are born as a dalit woman in india in the year of your brown sisters superpower of 2025.
>>
>>
>>
>>108610124
I put the comma outside the parenthesis but I don't know for sure.
I believe it makes more sense because the tag comma isn't part of what you want to emphasize semantically, but technically I don't know how it precisely works.
>>
>>
>>
File: 33206432037480.png (806.3 KB)
806.3 KB PNG
>>
File: 528980582670072.png (3.5 MB)
3.5 MB PNG
>>
>>
File: ErnieImage_Output_272627.png (3.4 MB)
3.4 MB PNG
Ernie Turbo sadly seems to respond to traditional hi-res-fix upscaling in the same weird artifacty way that Z Image Turbo does. Picrel is Ernie with 8 steps @ 896x1152 -> 1.5x upscale with 4xFaceUpSharpDAT -> 8 steps second pass denoise @ 0.5 strength. Klein Distills at their standard 4 steps don't have this problem, they can even handle 2x without it going weird like this.
>>
>>
>>
>>
File: 60359428348944.png (2.5 MB)
2.5 MB PNG
>>
>>
>>108610508
>Wai is so popular?
Jeets are too lazy to pick and use loras when needed so they want prebaked shitmixes
>ultimately the work of the anima fags should be way more respected, they're the ones making a real finetune after all
I don't disagree
>>
>>
>>108610537
>>108610541
fair enough, but the fact that Wai got recieved so well here shows that /ldg/ is browner than I expected :(
>>
>>
>>
File: 1755152940138396.png (231.1 KB)
231.1 KB PNG
https://www.reddit.com/r/StableDiffusion/comments/1smfz58/ernie_turbo_ is_pretty_awesome_i_think_this_is_m y/
glad that the ledditors are shitting on ernie, those chinks need to understand that Z-image turbo fucking exists, and that we wil never settle for less
>>
>>
>>
>>
>>
File: 1755907179655578.png (127.2 KB)
127.2 KB PNG
>>108610666
>More Chinkslop with no editing despite the fact they used an LLM that has a vision encoder?
there will be an edit model though, but ernie is as slopped as Klein so I really don't see the point, it'll probably be even worse
https://xcancel.com/ErnieforDevs/status/2044290766349185257#m
>>
I don't get anon's obsession with no editing. How often do you edit images? Would you dismiss local banana pro if it came without edit capability?
Mediocre quality and plastic look are bigger problems with these models.
>>
>>108610693
I agree that plastic skin is a big issue, but editing is a powerful tool, I'm actually using NBP to make multiple scenes from one image input, and I can use those frames to make funny videos (first frame + last frame) with LTX for example
>>
>>
File: 811745274722071.png (2.1 MB)
2.1 MB PNG
>>
>>108610693
most of the people who come here to seethe about local models don't know anything more than basic free slop gens and nano banana, nano banana is the only way they get can "consistent" characters, edit models are super important to them because it's the only way they can make their AI influencers.
>>
File: Usecase?.png (171.8 KB)
171.8 KB PNG
>>108610693
>>108610731
>usecase for a good editing model that could make anime characters and celebrities loras obsolete?
>>
>>108610709
>I'm actually using NBP to make multiple scenes from one image input, and I can use those frames to make funny videos (first frame + last frame) with LTX for example
same, but with seedance, it's so funny to see how the model can transition from first frame to last frame
https://www.youtube.com/watch?v=CpoH9TGrwaE&t=81s
>>
>>
>>108610693
>How often do you edit images?
Nobody uses edit models for the same reason people don't use 3D generation models. BECAUSE THEY ARE CURRENTLY TRASH.
When people get an edit model that doesn't lose quality when making iterative edits, that will be the default way most people will use all image gen ai.
>>
File: awooo.png (222.8 KB)
222.8 KB PNG
>>108610790
>When people get an edit model that doesn't lose quality when making iterative edits
it'll only happen without a VAE
>>
>>
>>108610800
>it'll only happen without a VAE
Correct.
>>108610693
>Mediocre quality and plastic look are bigger problems with these models.
Also, with an actually good edit model, this would actually be a solved problem then since you could simple tell it what style you want or give it a realistic photo and tell it to gen shit like that.
>>
File: 666632985707544.png (2.3 MB)
2.3 MB PNG
>>
>>108610813
>simple
simply
>>108610693
>Would you dismiss local banana pro if it came without edit capability?
No, but not because but despit of it. People would use it still because it would be a good model still, but to act as if the editing capabilities and reference image upload and understanding is not the main point of NBP is delusional.
>>
File: 990617329780653.png (1.4 MB)
1.4 MB PNG
>>
>>
>>
>>
>>108610892
>what people are in fact doing now, is flow models
with which models? they all seem to still average out the overvibrant colors througout the whole image without being able to get really dark/light images
>>
>>
File: 271976308820233.png (2.8 MB)
2.8 MB PNG
>>
File: 515392093392311.png (2.6 MB)
2.6 MB PNG
>>
>>
>>108608824
>>108608861
KITAAAA
>>108610430
Preview 3 came out so recently that I'm surprised they could pop out a finetune of it so soon. Then again, Wai was releasing new versions pretty frequently at one point.
>>
>>
File: is this nigga serious?.png (479.8 KB)
479.8 KB PNG
>>108610971
>Klein has significantly better raw image quality than Z Image
Z image base maybe, Z image turbo no way
>>
File: 1079235741687862.png (2.6 MB)
2.6 MB PNG
>>
File: 1052507738036170.png (2.6 MB)
2.6 MB PNG
>>
File: 1759398051091128.png (58.9 KB)
58.9 KB PNG
>>108611007
jesus!
>>
> Holy shxt… The rules of AI image generation just completely changed. A GPT-based model currently testing in the arena under the bizarre name 'duct-tape' is turning the global AI community upside down. What exactly did they feed this thing?
> Native-level text generation with zero awkwardness. Chilling consistency maintained down to the pixel. Overwhelming illustration quality ready for immediate commercial use.
>"Is it just downloading photos from the internet?" That was every tester's first reaction. It's that unbelievable.
>A game-changer that instantly crumbles Nano Banana Pro's dominance has arrived. A lot of people might need to start packing their desks again.
>>
>>
>>
File: trust the plan.png (211.9 KB)
211.9 KB PNG
>>108611062
>or something that really happened?
it'll happen
>>
>>
File: 928930140436976.png (2.9 MB)
2.9 MB PNG
>>108611028
they're pretty cute
>>
>>
>>
>>108610970
>>108611007
nice, this is anima to zit I assume
>>
File: 1756820463828329.png (505.2 KB)
505.2 KB PNG
Don't buy that used 3090 goy.
>>
>>
>>
>>
>>
File: 983644357632647.png (2.5 MB)
2.5 MB PNG
>>108611172
>>108611196
yes indeed
>>
>>
File: 1097519631578103.png (2.6 MB)
2.6 MB PNG
>>108611298
no
>>
File: HF5WRznb0AAozlF.jfif.jpg (1.7 MB)
1.7 MB JPG
>Midjourney V8.1 is live! Our iconic aesthetics are back w native 2K HD rendering - 3x faster and 3x cheaper vs V8. Full quality V8.1 1K mode is faster than V7 draft mode. Image prompts are back. New "Describe" is live - and you'll love our new moodboards & srefs. More soon <3
>>
File: 654782018247116.png (2.1 MB)
2.1 MB PNG
>>
File: ComfyUI_temp_targv_00126_.png (2.2 MB)
2.2 MB PNG
>>108611288
Really nice, what LLM are you using to caption the photos? qwen or gemma4
>>
File: 1749310063649930.png (97.1 KB)
97.1 KB PNG
>>108611362
local models?
>>
>>
>>
>>108611362
im not even going to try it, look at their vibecoded slop site. this is just embarrassing
https://alpha.midjourney.com/
>>
>>108611362
i usually hate the unrealistic, weird saturation, weird 3d render plastic look but that one doesnt seem as sloppy as usual although the fact that ZIT still blows everything except NBP out of the water with actual candid realism is hilarious
>>
>>108611371
didnt mean it in a bad way, hes based, i just didnt see him in a while, although i havent been here in a while either
i thought it might be him given the oversaturated asian girl squatting gen and that it was interesting that he moved on from chroma to whatever that model is
>>
>>108611362
[SERIOUS DISCUSSION]
How come Midjourney is the only model capable of doing rich colors without looking completely fried? Local cannot come close to this dynamic range without cranking a lora to weight 2+
>>
>>
>>
>>
>>
>>108611429
>remember when Midjourney was the boss
no? aside from some niches they mostly got their popularity because they allowed anyone to joing a discord server and start genning right then and there in a discord channel. they had 1 click button upscales and edits, this was the main thing that allowed youtuber normgroids to hype ai image gen easily to the normgroid masses, creating a positive feedback loop.
>>
>>
File: ComfyUI_21388.png (2.5 MB)
2.5 MB PNG
>>108611362
>Image prompts are back
WAT, how did you use it before?
>>108611406
They could always just post-process the image and crank up the vibrancy. In PS you can usually push a full 100% without banding/contrast issues (I've done it several times).
>>
File: WaiAnima1_00008_compare.png (2.7 MB)
2.7 MB PNG
Preview 3 on the left, WaiAnima on the right. The refinements are appreciable.
>>
>>
>>
>>
>>
>>
>>
File: WaiAnima1_00009_compare.png (4 MB)
4 MB PNG
>>108611454
Same prompt and seed at 1280x1600. Also significantly improved over vanilla preview 3. The latter looks so different that I reran it just in case I got a setting wrong somewhere, but nope.
>>
File: 1000008193.jpg (1.1 MB)
1.1 MB JPG
Ernie will win. True seed and texture are awesome
>>
File: 25477721173077.png (2.4 MB)
2.4 MB PNG
>>108611368
Thanks, I just use danbooru tags.
>>
File: WaiAnima1_00009_compare.png (4 MB)
4 MB PNG
>imagepost "succeeds" but the post doesn't show up
>retrying says there's a duplicate file, but the post doesn't show up
JFC, how long is this breakage gonna last?
>>108611454
Same prompt and seed at 1280x1600. Also significantly improved over vanilla preview 3. The latter looks so different that I reran it just in case I had a different setting somewhere, but nope, there's no mistake.
>>
>>
>>108611545
companies dont want to invest into something as niche as this since they wont suddenly gain some big piece of the market since the model wont be production ready anyway, so they invest just enough to try something new and gain #1 FOSS spot while using actual top R&D talent and compute on getting even 0.5% extra on 1 benchmark for their LLM
>>
>>
File: the sequel to falling down starring george lucas.jpg (193.8 KB)
193.8 KB JPG
Is there ANY way to get good video generation that isn't filtered? All I want is to put images in a prompter, say what I want, and try to get it decent. Do I really have to into SD/ComfyUI shit these days because this shit gets increasingly censored?
>>
>>
>>
>>
>>
File: Z-image turbo.png (1.9 MB)
1.9 MB PNG
>>108611490
this look like plastic garbage, are you fucking serious anon? Z-image turbo mogs that shit
>>
>>
>>
>>108611536
I like WAI more. I don’t know how it does it, but it knows how to make things with good quality. Many people say it’s “WAIslop,” but that only happens if you don’t configure the tags at all, especially with SDXL. I have high expectations for WAI Anima.
>>
>>
>>
>>
>>
File: Shifty AnimaPreview3 MK4 sample.jpg (1.5 MB)
1.5 MB JPG
Cooked this LoRa. Tested this for a whole day. I think it's working well.
https://civitai.red/models/2546093/shifty-nikke-goddess-of-victory-ani ma-lora?modelVersionId=2861325
>>
>>108611829
my guess is cloudflare, look at this shit: https://www.cloudflarestatus.com/
>>
>>
>>108611829
my guess is cloudflare, look at this shit: https://www.cloudflarestatus.com/
>>
>>
>>
File: png too big.jpg (2.3 MB)
2.3 MB JPG
>>108611489
>>108611536
Oh, so posts just might not show up for 10+ minutes.
Interesting that poses from prior preview versions can resurface in WaiAnima.
>>
>>
>>
>>
>>
File: 00303-2660396362.png (2.6 MB)
2.6 MB PNG
>>
>>
File: 1761504815134854.jpg (71.5 KB)
71.5 KB JPG
goddamn civit grenaded my setup
all the md5s are gonna change
my API checks have gotten be redone
my spreadsheet has gotta be reworked
>>
>>
>>
File: deEF_zi_00012_.png (2.2 MB)
2.2 MB PNG
>>
File: 1750628155863628.png (1.9 MB)
1.9 MB PNG
really enjoying the ability to actually craft compositions in anima
it even (somewhat) understands how broken glass works
>>
File: 00011-107078608.png (2.5 MB)
2.5 MB PNG
>>108611490
ernie has that newish bright colored skin texture that's in qwen image 2.0. Not bad but clearly has the ai feel to it.
>>108612062
>>108611659
>>108611156
>>108611020
>>108610960
please anon can you share prompt fro some of these? are you using controlnet and img2img?
>>
>>
>>
>>
File: 00020-619052514.png (2.6 MB)
2.6 MB PNG
>>108612650
it's astroturfing these threads and promotes shitposting.
>>
>>
>>
>>
File: 4chan_g_animanon-troll-MO.png (115.8 KB)
115.8 KB PNG
>>108612650
It's 99% one troll who's jealous it got seed funding from ComfyUI.
>>
File: 1756664336789151.jpg (852.7 KB)
852.7 KB JPG
>>
File: 1772410481160274.jpg (701.3 KB)
701.3 KB JPG
>>
File: Flux2-Klein_00235_.png (1.9 MB)
1.9 MB PNG
>>
>>
>>
File: mfw.png (886.7 KB)
886.7 KB PNG
>>108613189
>>
>>
File: 1745550056334281.webm (3.9 MB)
3.9 MB WEBM
Did you like HappyHorse? Then you will love HappyOyster kek
https://xcancel.com/HappyOysterAI/status/2044618799089926428#m
>>
>>108613279
>Alibaba has gotten genuinely good on their craft
>And that's the exact moment they stopped going local
I hope you localkeks enjoyed being used as a free advertising tool, because remember, it's only local until it's good ;D
>>
>>
File: SAAR DO NOT REDEEM.png (1.2 MB)
1.2 MB PNG
>>108613279
>>
>>
>>108612127
>the fuck? This is beyond useless.
that was my feeling during the whole 2026 year, only Klein turned out to be decent, the rest was a bunch of nothingburgers, and now that Alibaba left open source the future is so fucking grim
>>
>>
>>
File: that's right.png (236.3 KB)
236.3 KB PNG
>>108613404
>Traffic has been dropping steadily
that's a shame, I know that trooncord is slowly killing forums and shit but I don't want to be a fucking avatarfag, fuck that site, and I'm upset that the jew that bought it chickened out when he tried to implement IDs on NSFW threads, he had the perfect occasion to kill it kek
>>
File: 8478356856382872.jpg (2.6 MB)
2.6 MB JPG
>>
>>108613283
I'm completely satisfied with Klein, and BFL will continue to provide me with new models in the future.
Since I don't know any celebrities anyway and Trump isn't suitable for gooning, I can live with Klein's downsides.
>>
>>
>>108613538
>and BFL will continue to provide me with new models in the future.
they won't, they gave us decent models only because they had to compete against Alibaba, now that Alibaba is gone they have no reason to give us better models and compete, Alibaba's death means BFL's death
>>
>>
>>108613608
no one said anything about true blacks, it's not that thing that'll get us that vivid and varied set of colors midjourney is producing, and for the moment only that model can do something like that, only them know the secret sauce
>>
>>108611362
Midjourney is like the antithesis of Z-image turbo, every seed is a completly different image, but unfortunately for them, those images are often wonky, looks like the balance between quality and variety hasn't been reached yet
>>
>>108613610
>what are Klein, Chroma, any DiT model in existance...
Chinese also recently released a model that is objecticely both more aesthetic and technically impressive than MJ
https://ernieimageprompt.com/
Nothing will save MJ because it's pure slop.
>>
File: I kek'ed irl, nice job.gif (3.5 MB)
3.5 MB GIF
>>108613684
>Chinese also recently released a model that is objecticely both more aesthetic and technically impressive than MJ
>https://ernieimageprompt.com/
>>
File: bruh.jpg (1.1 MB)
1.1 MB JPG
>>108613684
>the model that has been finetuned with only Nano Banana Pro's output is not slop
come on dawg
>>
>>
>>
>>
>>108613684
Also
>not fried
Zoom in on the hands here
https://alpha.midjourney.com/jobs/de24932f-53fe-4fe6-8002-d90602f8f838 ?index=3
This is not just a quirk of the latest model. Their VAE has been stuck in 2023 for the longest time. MJ was the king of aesthetics for a while, but it fell off hard and became a grift the moment more technically capable models became both cheaper and open sourced. I should say, a lot of their "aesthetics" are also stuck in that year.
>>
File: 1765589856842655.png (35.3 KB)
35.3 KB PNG
I tried anime but my gens are all just black boxes, I use Forge Neo.
Was there anything special I had to do? I think I got the right stuff.
>>
File: 1768436056140323.png (320.1 KB)
320.1 KB PNG
>>108613753
they stopped making effort and improving the architecture because they saw the aesthetics alone was making them rich as fuck, they don't need to touch anything else actually, why would you risk ruining the aesthetics for something more solid but also more boring (if they go that path they'll have to compete with fucking NBP and GPT Image 2), at least with aesthetic slop they have no rivals to compete with
https://research.contrary.com/company/midjourney
>>
>>
>>
File: 1775384842409089.png (199.8 KB)
199.8 KB PNG
>>108613774
case in point, MJ survives because other companies focus on make solid but sterile images, and to be fair, it's not like they don't want to do it, it's because we still haven't solved styles, NBP, as good as it is on editing cannot reproduce styles, no model actually can
>>
>>
>>108613774
>MJ, which aesthetically doesn't stand a chance against several local models released past 2025 across multiple departments, somehow stands a chance against an encyclopedic model released by Google that can create any image possible because it literally has seen almost the entire domain of images.
I think you are very confused. If MJ got into an Arena and were competing against SOTA models it would laughably be at the bottom. MJ may have made its name off of aesthetics, but in the current day it is far from it. The vast majority of the people they attracted to their grift are not tech savvy, as they focused heavily on marketing to complete normies on social media, so their customers being blissfully unaware of better models in existence is their only advantage. It also does not help that Civit is a cesspool to this day, so when one thinks open source they immediately think of Civit and attribute those sloppers to current open source capabalities, which is far from truth.
>>
File: 1753181502955974.jpg (471.1 KB)
471.1 KB JPG
>>108613827
I'm using anima prompts I found on civitai but some were just not working for me, this is pretty weird. I think it's okay now maybe, after turning off speedups in the .bat. My opencv denoiser extension seems broken though, giving some error. That sucks.
>>
>>108613851
>MJ, which aesthetically doesn't stand a chance against several local models released past 2025 across multiple department
are you living on the same universe as me? local models were aesthetically more varied and interesting during the SD1.5/SDXL days, nowdays it's just pure DiT era slop that can do 5 styles max, what are you even talking about?
>>
File: 1754938902861440.jpg (71.2 KB)
71.2 KB JPG
>>108613871
That's what I was wondering too. Old SD models had great aesthetics.
>>
>>
>>108613871
We now have edit models. So any style is possible on the fly with local. Aside from those, the quality of images (which are an objective aesthetic metric, unless you think bad hands are still aesthetic in this day and age), plus prompt understanding far trumps what MJ can output.
The collage on this thread alone has more aesthetic variety than anything MJ can produce. It can't do proper amateur photography, nor multiple objects nor text coherently. MJ truly has no moat, and if you can't see that you're not any better than some brainrotted Zoomer from Tiktok who just learned about image models by scrolling through his feed.
>>
File: 1752643726893451.png (162 KB)
162 KB PNG
>>108613947
>We now have edit models. So any style is possible on the fly with local.
no edit model can reproduce styles, I really start to believe you're living on another universe, how's life in there? do you call "matter" "antimatter"?
>>
>>
>>
>>108613954
May not be perfect, but they absolutely can, and they will only get better at doing so. Haven't really had trouble with Klein, but Ernie edit model will probably be even better at that as the style variety out of the box resembles what NBP can do.
>>
File: 1749473760768220.png (226.3 KB)
226.3 KB PNG
>>108613983
>May not be perfect, but they absolutely can
care to show an example? I
>>
File: 913512290657719.png (2.9 MB)
2.9 MB PNG
>>108612606
Prompts are just danbooru tags, it's anima -> zit img2img, and zit I just remove the "masterpiece, best quality, score_7, photo \(medium\)" and keep the rest of the tags. But I did train an Anima lora on realistic images. FYI if you train on realism, do no use the "@" token, it pushes it towards illustration (I think).
>masterpiece, best quality, score_7, photo \(medium\), sunna \(zenless zone zero\), zenless zone zero, 1girl, animal cutout, animal ear fluff, animal ears, bell, black bra, black choker, black panties, black thighhighs, blunt bangs, blush, bra, breasts, cat cutout, cat ears, cat girl, cat lingerie, cat tail, choker, cleavage cutout, closed mouth, clothing cutout, earrings, fake animal ears, fang, frilled bra, frills, green eyes, green hair, hair ornament, hairclip, jewelry, jingle bell, kemonomimi mode, leaning forward, looking at viewer, medium hair, musical note earrings, nail polish, navel, neck bell, one side up, open mouth, panties, paw pose, pink nails, ribs, skindentation, small breasts, solo, striped clothes, striped thighhighs, tail, thighhighs, toeless legwear, toenail polish, toenails, toes, underwear, underwear only, indoors
>>
wtf anons. seedance 2.0 is way less censored now. it still has sensitive copyright filter system but nsfw prompts seems be able to go through model. All of these were text2video.
https://litter.catbox.moe/iguqdbqozpr68taz.mp4
https://litter.catbox.moe/sa4ic053p8jrmci1.mp4
https://litter.catbox.moe/pawrqmmuiwrpv6yx.mp4
https://litter.catbox.moe/ygs5abg61l7ltmy1.mp4
https://litter.catbox.moe/afc8ep89li01gi0m.mp4
>>
File: I'm... I mean he's interested to know.png (150.7 KB)
150.7 KB PNG
>>108614026
>wtf anons. seedance 2.0 is way less censored now.
damn, what API site are you using? it's for a friend
>>
>>
File: kys.gif (142.4 KB)
142.4 KB GIF
>>108614026
>genning SAAS slop, downloading it and then uploading it and then posting it to a local thread
>>
>>
>>
File: ack.png (270.6 KB)
270.6 KB PNG
>>108614053
>>
>>108614035
https://budgetpixel.com/
budget pixel. Image2video is still fucked and doesn't allow human references. It seems like bytedance reach a ultimate compromise of the model for kike lawyers to back off from them. Would love to use local or seedream images for it but its a no go. There's rumors of a work round to trick the model but i don't know how it works.
>>
>>
>>108614026
>>108614073
put that here >>>/wsg/6126746 they might be interested, this is still a local thread after all
>>
>>
>>
>>108614087
>Those videos aren't from Seedance, it's from LTX!
>>108614137
>LTX videos on /vdg/ mog those videos !
So LTX > LTX?
>>
>>
File: thank you localkeks.png (66 KB)
66 KB PNG
>>108614204
>if you paid money for them in 2026
you literally paid (((Jensen))) thousands of dollars to goon on a 5 second Wan 2.2 goon slop lol
>>
>>
File: 1773223496146212.png (51.1 KB)
51.1 KB PNG
>>108614254
>to own a computer, you must pay a 2000 dollars gpu
good Nvdiakek, that's right, the more you buy, the more you save!
>>
>>
>>
>>108614204
>you must be ashamed to give anyone money to gen AI videos
>>108614307
>you're jealous because I gave daddy jensen a ton of money to gen AI videos
kek
>>
File: 558299712669949.png (2.5 MB)
2.5 MB PNG
>>
>>
>>108614337
>>108614204
>if you paid money you should feel bad.
>>
>>
>>
File: 662789903062783.png (2.2 MB)
2.2 MB PNG
>>108614026
>>108614087
I have to agree, that doesn't look like seedance 2.0
>>
File: 1749435765168363.png (40.4 KB)
40.4 KB PNG
>>108614026
>less censored now
don't fall for this trap again. they can update the filter at any time
>>
File: 154879799467901.png (2.5 MB)
2.5 MB PNG
>>
File: plastic shit.png (1.9 MB)
1.9 MB PNG
https://xcancel.com/bdsqlsz/status/2044726628920742398#m
looks like Nano Banana Pro will remain the king for a long time lool
>>
>>
File: bugs bunny ahh teeth.png (300.5 KB)
300.5 KB PNG
>>108614373
>>108614433
what's up, doc?
>>
>>
>>
File: 1766928243572563.png (253.6 KB)
253.6 KB PNG
>>108614469
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>108614502
>https://huggingface.co/spaces/silveroxides/Lodestone-Tagger-UI
>Lodestone
>tagger
>ui
>tagger , tag, tag pictures
>huggingface
Lodestone released a image tagger in hugingface
>>
>>
>>
File: Untitled_Dxsigldq.png (2.8 MB)
2.8 MB PNG
>>108614383
anon I'm not 100% dependent on closed source saas shit but i also cant be 100% dependent local toys. I just very pragmatic when it comes to this hobby. I like the diversity in choices in models both in saas and local route.
>>
File: Ernie_00107_.png (3.2 MB)
3.2 MB PNG
heh, ernie.
fuck I'm old
>>
>>
>>
>>108614469
Isn't he still training it?
Anyway it looks schizo like most lodestone shit. Seems to know insane amount of niche tags, but can't even determine more common ones reliably. And it's further useless since most of its knowledge is esoteric furfag shit. Maybe if you pruned furfag tags from it it could have some use, but I think I would just continue to use WD14 if I ever needed to do lora training for a tag based model.
He also forgot to prune meta tags unrelated to image content like "English commentary" or "grandfathered content" and wasted compute and weights teaching model gibberish.
This is also 5 gigs compared to a few hundred megabytes of typical tagger. Worth noting for quality/speed tradeoff when batch tagging a lot images.
>>
>>108614540
holyshit they banned playtime ai and deleted all his ltx 2.3 loras from civitai. i sense a great purged coming soon anons.
https://civitai.red/user/playtime_ai_
here some of his lora on civarchive
https://civarchive.com/users/playtime_ai_
>>
>>
>>
File: 1747525741064656.jpg (1.1 MB)
1.1 MB JPG
>>108614648
I was but I turned it off since it said I might want to. I had negative prompts too.
I dunno what was going on but it works now.
>>
>>
>>
>>
File: 351198179555883.png (2.4 MB)
2.4 MB PNG
>>108614433
>>108614444
yes
>>
>>
I can see that our friends on /lmg/ is also enjoying the Chinese culture kek
>>108614665
>>108614999
>>
>>108614478
I haven't used ltx for a while but it seems better, as in less jank. I know the limits of the model so I haven't tested any high speed chases or backflips, but it seems better
I'm not redoing my loras a thid time though, I'll wait for ltx3
>>
>>108614995
>>108614326
are these just making cosplay or fan art into real or generated from scratch because those are very character accurate
>>
File: fuck you.png (24.8 KB)
24.8 KB PNG
Is there a way to suppress these fucking error reports in Comfy? I pulled for the first time in awhile, now it's spamming this shit every time I use a bypass switch between txt2img and img2img on my workflow.
>0 errors
YEAH, NO SHIT YOU FUCKING FAGGOT
>>
>>
>>
>>
>>
>>
File: _AnimaPreview3_00211_.jpg (311.9 KB)
311.9 KB JPG
>>
>>
>>108615180
I see, better keep some water close by, don't want anything catching on fire
>>108615218
So I should make flatties? Smh, I guess I better start to love flatties AAA cup
>>
>>108615255
I'm using Fast Groups Bypasser and it happens every time I switch. Not sure about any other switches. There's a few other retarded "errors" it reports that aren't errors at all as far as my workflows are concerned, but I did what the other anon said and blocked the element, so it's all good now.
>>
>>
File: _AnimaPreview3_00220_.jpg (443.7 KB)
443.7 KB JPG
>>
>>
>>
File: 1764055001256421.jpg (553.7 KB)
553.7 KB JPG
>>
File: _AnimaPreview3_00236_.jpg (379.1 KB)
379.1 KB JPG
>>108615327
It has cool characters and animation, but there is certain slop look, can't deny. I wonder if it's because mix of 3d animation. It has certainly different look with 90's anime filter
>>
File: 1764828795097111.jpg (698 KB)
698 KB JPG
>>
File: 1747509564915808.jpg (1.6 MB)
1.6 MB JPG
>>
>>
>>
File: _AnimaPreview3_00260_.jpg (405.1 KB)
405.1 KB JPG
>>
>>
File: deSA_zi_00037_.png (2.8 MB)
2.8 MB PNG
>>108615503
>>
File: 996232.png (1.6 MB)
1.6 MB PNG
>>108615327
open up
>>
>>
>>
Fresh when ready
>>108615635
>>108615635
>>108615635
>>
>>108615519
>>108615305
>>108615223
Cool gens