Thread #108629083
File: highlights_g_108615635_1776506249_1.jpg (2.8 MB)
2.8 MB JPG
Discussion and Development of Local Image and Video Models
Previous: >>108615635
https://rentry.org/ldg-lazy-getting-started-guide
>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows
>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe
>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
>Qwen
https://huggingface.co/collections/Qwen/qwen-image
>Klein
https://huggingface.co/collections/black-forest-labs/flux2
>LTX-2
https://huggingface.co/Lightricks/LTX-2
>Wan
https://github.com/Wan-Video/Wan2.2
>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46
>Illustrious
https://rentry.org/comfyui_guide_1girl
>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage
>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg
>Local Text
>>>/g/lmg
>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
338 RepliesView Thread
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
File: Chroma1-HD-Flash.safetensors_00006_.png (1.7 MB)
1.7 MB PNG
>>108629377
filename, 10 steps, sa_solver_pece/beta
>>
File: 642480616883123.png (2.5 MB)
2.5 MB PNG
>>108629255
It can.
https://files.catbox.moe/04xafw.mp4
>>
>>
>>
File: 1755454056575276.png (1.7 MB)
1.7 MB PNG
>>108629567
you can do realism with Z-image turbo
>>
>>
File: 1763495017795822.jpg (1.9 MB)
1.9 MB JPG
Can someone fix this?
>>
>>
>>
>>
>>
File: Wan22_SVI_Pro_Low_Fps_00093.mp4 (2.2 MB)
2.2 MB MP4
Next challenge, getting svi2pro to work my latent upscaling. Current issue, color match nodes doesn't work with default workflow.
>>
>>
>>
>>
File: lawl.jpg (1.1 MB)
1.1 MB JPG
>>108629692
Just a coincidence, move along, nothing to see here.
>>
>>
File: 1769647643872557.png (2.9 MB)
2.9 MB PNG
>>108629752
NBP is shit on making mangas though, it's always that one style, if only Ernie managed to copy NBP's realism we would be impressed, but that didn't happen, it's as plastic as your regular Klein slop
>>
>>108629717
Ernie is impressive but without the base knowledge of NBP, especially with upcoming GPT Image 2, it feels like a retarded amputated/ripoff version of it. Even if they are similar in prompt following, it still leaves so much to be desired.
>>
File: FUCK YEAH.png (101.8 KB)
101.8 KB PNG
>>108629780
>it feels like a retarded amputated/ripoff version of it.
chinks can only copy their masters (the white engineers in america)
>>
>>
File: 726341099955451.png (1.5 MB)
1.5 MB PNG
>>
>>
File: 1093935322028249.png (2.3 MB)
2.3 MB PNG
>>
File: ComfyUI_21676.png (2.4 MB)
2.4 MB PNG
>>108629575
You got a favorite "realism" string for Z?
>A gritty 1980s VHS screen capture
I like this one because it can really cut down on the bright/flat AI look (that year is optional, but it's great if you want more classical-looking hairstyles).
>>
>>108630423
For that image I went for this
>A candid image taken using a disposable camera. The image has a vintage 90s aesthetic, grainy with minor blurring. Colors appear slightly muted or overexposed in some areas.
>>
File: 1776524661026.jpg (272 KB)
272 KB JPG
>>108630423
jebby sexo
>>
File: 531091743399603.png (570.5 KB)
570.5 KB PNG
>>
File: 263704210141818.jpg (2.9 MB)
2.9 MB JPG
>>
So what's the purpose of image posting without metadata and/or tech discussion? Is this a slop dump general now? People hise their data/workflows and just silently post their gens to a /g/ general. Literally fuck off to discord with this shit.
>>
>>
>>
>>108630729
>not letting you scrape my shit
ironic, you're using models that were trained from billions of scrapped images, without the Artist's permission, an AI bro can't moralfag on this, that's why I don't do this, and you shouldn't too
>>
>>
File: ernie-res.png (51.2 KB)
51.2 KB PNG
>>108629083
I don't think Ernie was trained outside of the recommended resolutions. In my experience, resizing the dataset to those dimensions has allowed it to train at much higher learning rates without exploding losses.
Also the ComfyUI LoRA trainer is pretty decent once you hack in weight_decay and a proper optimizer like CAME. You also get day 0 support for LoRA training and the --fast/sigattn speed-ups apply to training. The lack of eval and live loss charts is somewhat disappointing.
>>
>>
>>
>>
>>
>>108630724
>>108630741
I agree with you, anime posters shouldn’t be posting only images in this thread, because that’s what the anime generals like /adt/, /hgg/, and /edg/ are for.
If anime posters like >>108630702 are going to come here just to spam anime, I think it would be better for them to stick to their dedicated anime generals, since that’s what those threads are for.
>>
>>
>>108629377
>>108629424
Isn't it crazy that there hasn't been a realistic original model or proper finetune of 3dpd even though it has the most freely and widely available data on the internet? 3d must truly be PD.
>>
>>
File: ComfyUI_00032_.png (1.2 MB)
1.2 MB PNG
>>108629083
no matter what I do my comfy won't stop generating solid black images as outputs and shits the bed when using flux
it worked properly before I installed another software but I can't uninstall it either because I need it
What should I do?
Should I clean reinstall everything?
>>
>>
>>
>>
>>
>>
>>108631113
Run git fsck to see if your comfy install is corrupted.
Then delete venv, recreate another one. activate it and proceed to do:
pip install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu130
pip install -r requirements.txt
>>
>>
>>
>>
File: ComfyUI_09846_.png (735.3 KB)
735.3 KB PNG
>>
>>
>>
>>
>>
File: _AnimaPreview3_00555_.jpg (504.3 KB)
504.3 KB JPG
>>108631131
euler a / simple, bump up cfg and steps if you dont get details. er_sde / beta also works, but is move sensitive to cfg and steps. I recommend using "CFG rescale" node with Anima with 0.7-1 str for pushing in details which works with text
>>
>>
>>
>>
>>
>>
File: _AnimaPreview3_00572_.jpg (464.3 KB)
464.3 KB JPG
>>
>>
>>
>>
>>
File: _AnimaPreview3_00594_.jpg (517.4 KB)
517.4 KB JPG
>>
Bros how the fuck do I crop + upscale to inpaint in SwarmUI? I thought that Mask Shrink Grow did exactly that, but if it's unable to see the rest of the image then it struggles to make sense of the prompt and to maintain a coherent artstyle. Plus I don't really notice a much higher res
>>
>>
>>108631984
I knew this post was gonna happen
Look, I'm retarded and new, I can't really help it; I'm slowly but surely trying to improve and at least I've gotten passable enough results
One day I'll face the music and deal with the spaghetti, but I'd rather have a really solid grasp of the fundamentals first
>>
>>
>>
>>
>>
>>
>>
runpod 5090, flux2_klein_9b_diffusers
did some automated benchmarking to optimize gen speeds, here's the best config I found so far:
- enable_partial_loading: true
- keep_ram_copy_of_weights: true
- max_cache_ram_gb: 40
- pytorch_cuda_alloc_conf: backend:cudaMallocAsync
cold gen after startup: 9.620s
warm gen: 3.263s
note: this is with the venv and model on the container disk, NOT network storage which will significantly degrade performance
>>
>>
>>
>>108632025
It's straightforward enough in the sense that having stuff hidden away behind drop-down menus makes it easier for me to have a "full view" of everything that's possible
I actually first started out with Cumfy but my issue with it was that, outside of importing cards, you just HAD to know which node to put where to achieve what; with Swarm I can clearly see there's a giant SAMPLING menu, that I can read up on and get a full grasp of so I can make conscious decisions about it.
Cards or premade workflows don't work for me because they either completely overwhelm me or "make me lazy": if the original author set this one value to 6.8, he probably had a good reason for it, why bother learning what it does? Why test out tweaking this one value when there's hundreds of other values I could be looking at instead?
It's simply not compatible with my small brain's way of learning things tbdesudesune
>>
>>108632090
>>108632108
>>forgot to enable checkpoints
checkpoints are how you resume
>>
>>108631591
>>108631669
Thanks. Is there a reason to pick simple over sgm_uniform? It sometimes feels like the difference between the two is completely random
>>
>>
>>
>>
>>
>>
>>108632135 (You)
nevermind im retarded
>>108632108
diffusion-pipe. i was pointing it to the incorrect dir whoopsie
>>
>>
>>
>>
File: 1756583897350100.jpg (590.5 KB)
590.5 KB JPG
>>108632381
>>
>>
>>
>>
>>
>>
File: 1756300627343632.png (900.1 KB)
900.1 KB PNG
>>
>>
>>
>>
>>
File: 00177-855147302.jpg (140.1 KB)
140.1 KB JPG
>Picrel
>>
>>
>>
>>
>>
>>
>>
>>
>>
File: file.png (46.2 KB)
46.2 KB PNG
>>108633080
>>
File: 1768297269001083.png (914 KB)
914 KB PNG
>>
>>
>>
>>
>>
File: 1745819760910980.png (1.5 MB)
1.5 MB PNG
>>
>>
>>108633116
As a 24GB VRAMlet I wish death upon your 12VHPWR
Also i disagree, Z-image is great and contrary to some of what I've heard here base trains pretty well if you avoid a few pitfalls so i think we will get good full finetunes
>>
File: 1773227092012837.png (1013.9 KB)
1013.9 KB PNG
>>
File: 1771322180569205.png (966.2 KB)
966.2 KB PNG
>>
>>
>>
File: 1753080575619783.jpg (350.9 KB)
350.9 KB JPG
>>
>>
File: aitoolkit.png (155 KB)
155 KB PNG
anima has some kind of cucked licence?
>>
>>
File: 1745101609187048.mp4 (1.1 MB)
1.1 MB MP4
>Replace the person's clothing with a dark blue hoodie and gray sweatpants.
>EditAnything IC-LoRA - LTX-2.3
https://www.reddit.com/r/StableDiffusion/comments/1sp03jq/editanything _iclora_ltx23/
Good shit.
>>
File: ComfyUI_temp_xkebm_00006_.jpg (1.1 MB)
1.1 MB JPG
>>
>>
File: 1757873873273978.png (27.5 KB)
27.5 KB PNG
>>108633727
lmao, it's over
>>
>>
>>
File: 1752029119829760.png (57.2 KB)
57.2 KB PNG
>>108633727
>>108633840
>>
File: 84673682872.jpg (527.8 KB)
527.8 KB JPG
>>
>>
>>
>>
>>108633854
https://github.com/ostris/ai-toolkit/issues/791
>>
>>
>>
>>
>>108633727
>trainer used by no one
>>108633903
>model used by no one
>>
>>
>>108629567
main thing here asuka and rei don't have an apparent reason to be bouncing
and in >>108620170 bottom left are bouncing too evenly at the same time
>>
File: 1772855046111202.png (85.8 KB)
85.8 KB PNG
>>108633903
>Kandinsky
those ruskovs had a very uncensored local video model, that can be good
>>
>>
File: Begone vramlets.png (96.8 KB)
96.8 KB PNG
>>108633315
>>108633116
vramlet yes
>>
>>
>>
>>
>>
>>108633969
https://huggingface.co/kandinskylab/KVAE-3D-2.0-t4s8/discussions/1
>>
>>
>>
File: 1763724111407354.png (1 MB)
1 MB PNG
>>108634017
damn, video VAEs are so fucking bad compared to image VAEs
>>
>>
>>108634017
https://github.com/kandinskylab/kandinsky-5
I completly forgot that we already had an image model from them, I guess it was so ass it was brushed it off quickly lool
>>
>>
>>
>>
>>
File: ComfyUI_21832.png (2.2 MB)
2.2 MB PNG
>>108633989
>14900KS
Why? You can clearly afford better.
>>
>>
>>
File: 1776572214.png (240.5 KB)
240.5 KB PNG
Who's going to code up the new UI?
>>
File: 1756350980549467.jpg (815 KB)
815 KB JPG
>>
>>
File: ComfyUI_temp_ckxth_00011_.png (2.3 MB)
2.3 MB PNG
>>
>>108635021
and localkeks will still pretend comfyui is a local-first ui. this is why local deserves nothing, they willingly shill for saas garbage and are too retarded to see when they're being blatantly manipulated.
>>
>>
File: ComfyUI_21929.png (2.2 MB)
2.2 MB PNG
>>108635021
How do you gut a GitHub project for profit and leave someone else holding the bag?
>>
>>
File: ComfyUI_temp_ckxth_00022_.png (2.2 MB)
2.2 MB PNG
>>
>>108635103
more common than you might think.
some people spend 10+ hours a day in a general and anons notice their posting style.
anons start to make fun of them, the more they react to it the more anons make fun of them.
after awhile they usually end up having a meltdown over something and accidentally overexpose themselves, then anons tag them with whatever they let split, 35 stars, a rank in a video game, etc.
at that point they usually start to double down or samefag anytime they get made fun of.
>he hasn't posted here in ages. you guys are schizos.
naturally that just makes it worse.
i would wager half the generals on /vg/ have an "ani" and a "35 star status" trigger word.
>>
File: 874232661242573.jpg (2.6 MB)
2.6 MB JPG
>>108635040
neat
>>
File: ComfyUI_temp_ckxth_00025_.png (2.4 MB)
2.4 MB PNG
>>
File: ComfyUI_temp_ckxth_00026_.png (1.9 MB)
1.9 MB PNG
anima is great, is what chroma should've been
>>
File: ComfyUI_temp_ckxth_00028_.png (1.8 MB)
1.8 MB PNG
>>
File: ComfyUI_temp_ckxth_00030_.png (1.8 MB)
1.8 MB PNG
>>
File: ComfyUI_temp_ckxth_00031_.png (1.8 MB)
1.8 MB PNG
>>
>>108634458
kek, it works:
"give the man a top hat and monocle."
https://files.catbox.moe/z4ujre.mp4
>>
File: ComfyUI_temp_ckxth_00036_.png (2.3 MB)
2.3 MB PNG
>>
>>
File: ComfyUI_temp_ckxth_00044_.png (2.4 MB)
2.4 MB PNG
>>
File: 84237421012860.png (2.9 MB)
2.9 MB PNG
>>108635273
>Tsuki ni kawatte, oshioki yo!
>>
File: ComfyUI_temp_ckxth_00051_.png (2.2 MB)
2.2 MB PNG
>>
File: 1757982871460549.png (3.1 MB)
3.1 MB PNG
>>108635173
ty
>>
>>108635261
another test: replace the bikini of the woman with a black business suit.
now im impressed, we essentially have klein edit but for ltx video.
https://files.catbox.moe/p6txpc.mp4
>>
>>108635386
site wouldnt load it. litterbox works though.
https://litter.catbox.moe/e564dzhvwbjyr5t0.mp4
>>
>>108635390
looks ok, I think it could work better with masking, since LTX tends to ruin details on fast movement videos, I think you can even add a reference frame with LTX, at least I think I've seen makeshift workflows that do that
>>
File: 527140267727265.png (2.2 MB)
2.2 MB PNG
>>
>>
>>
>>108635468
and a bit closer to the end result: just disable the prompt enhancer shit in the workflow, not needed + wastes time for token generation.
https://litter.catbox.moe/xfh7y6xd5vqsekbq.mp4
>>
>>
>>
>>
>>
>>
>>108635520
>>108635486
neat anon, nice work
>>
>>
File: 264381399006590.png (2.2 MB)
2.2 MB PNG
>>108635495
You know I never really thought of her as Japanese, since the first time I was watching Sailor Moon I didn't even know what anime was.
>>
>>
>>
>>
File: 328286036458144.png (1.6 MB)
1.6 MB PNG
>>
>>
File: bbs-zit-2026-04-19_00014_.png (3.9 MB)
3.9 MB PNG
>>
>>
File: 305815193955334.png (2.4 MB)
2.4 MB PNG
>>
>>
>>
>>
>>108635772
https://huggingface.co/Alissonerdx/LTX-LoRAs/blob/main/ltx23_edit_anyt hing_global_rank128_v1_9000steps_ad amw.safetensors
original post: https://www.reddit.com/r/StableDiffusion/comments/1sp03jq/editanything _iclora_ltx23/
>>
>>
>>
>>108635806
replace the man in the black mask with a large panda bear.
you're a big bear.
https://litter.catbox.moe/zglx45hvwtkqbdd2.mp4
>>
File: 5ca341a08fa39321244e7f71f515ae0b.png (400.3 KB)
400.3 KB PNG
>>108635810
Nice..
>>
File: 1765150153763874.png (133.8 KB)
133.8 KB PNG
>>108635881
use 2.3 distilled and these encoders, seems fine for me:
>>
>>108635886
this one:
https://huggingface.co/QuantStack/LTX-2.3-GGUF/tree/main/LTX-2.3-disti lled-1.1
>>
>>
>>
replace the clothes of the man in the black jacket with a black tuxedo, black top hat, and black moustache.
lmao, it seems to have issues with frequent cuts, but it did work.
https://litter.catbox.moe/xn35mp1ub2p74grz.mp4
>>
>>
>>
>>
File: bbs-zit-2026-04-19_00113_.png (3.4 MB)
3.4 MB PNG
>>
>>
>>
>>108636131
it's a lora that works with ltx 2.3 that makes it act like an edit model. pretty neat, testing it out now.
https://huggingface.co/Alissonerdx/LTX-LoRAs/blob/main/ltx23_edit_anyt hing_global_rank128_v1_9000steps_ad amw.safetensors
>>
>>
>>
>>
>>
>>
>>
>>
>>108636244
>>108636264
Thank you, I'm new to this, so looking up the guides, if you have any good links, I'd really appreciate it
>>
>>
>>108636295
depends how new and tech savvy you are.
personally with my rather modest GPU I use Forge UI (easy to use, check Youtube for a tuto), with an Illustrious or Pony base model (downloaded on civitai), and Controlnet Canny (which is an add-on to Forge UI).
>>
>>
>>
>>
>>
>>108636332
try this canny (works for illustrious), also you prob want openpose and depth as they are also useful.
https://civitai.com/models/941482/illustrious-xl-canny
I use reforge for my anime gens, very fast and controlnets are a couple clicks. sample output with this model:
https://civitai.com/models/1277670/janku-trained-chenkin-and-noobai-ro uwei-illustrious-xl?modelVersionId= 2786084
canny is good for getting 1:1 from the source, depth gives more flexibility, openpose for that pose and different lineart.
>>
>>
File: 1766768293774140.png (1.5 MB)
1.5 MB PNG
>>108636415
the controlnets are fine, and that model is good for animu (that and nova anime, and wainsfw)
>>
>>
File: 1753347940995539.png (3.7 MB)
3.7 MB PNG
>>108636427
and this is a test with canny (1 to 1 lineart)
if you want more variety, use controlnet depth or openpose by itself.
>>
>>
File: 1761436901764193.png (2.2 MB)
2.2 MB PNG
>>108636438
canny Miku with racing Marin as source:
controlnets are fun. can do all kinds of neat stuff with a reference. like flux klein edit.
>>
>>
>>
>>
>>
File: 1751690175852757.jpg (628.1 KB)
628.1 KB JPG
change the season to winter.
klein edit 9b distilled is so neat.
>>
>>
>>
File: 1747226350420228.png (2 MB)
2 MB PNG
>>108636513
fall:
>>
>>
File: 1756281715425404.png (1.8 MB)
1.8 MB PNG
remove all the buildings. and all the stone.
>>108636544
whats 4bturbo? ive only used klein edit and qwen edit
>>
File: 1766133922262777.png (1.7 MB)
1.7 MB PNG
replace the stone building on the right with a mcdonalds restaurant.
>>
>>
>>
>>
>>
File: 1762894608763473.png (3.6 MB)
3.6 MB PNG
>>
>>
>>
File: 0588893fa4605c0462443fced52a07d0.jpg (614.8 KB)
614.8 KB JPG
>one new workflow after another doesn't work
>get to one that works
>its kinda shit
It's just a revamped painting node which allows for masked image editing with klein.
Since you are inpainting, the context to the overall image is lost, so you have to expand the "mask" with some extra painting, but then it edits the entire image and ruins the purpose of the inpainting to begin with.
>>
>>
>>
File: 1046488701704735.png (1.4 MB)
1.4 MB PNG
>>
File: 1752896498623999.png (105 KB)
105 KB PNG
>>108635021
>Comfy is making money, the west has fallen
>>
File: 476595389420658.png (2.4 MB)
2.4 MB PNG
>>
File: file.png (741.7 KB)
741.7 KB PNG
https://www.youtube.com/watch?v=KIBf48Ih-7I
Livestream from ADOS, an open source AI art event featuring artists/developers from the ecosystem (CTO of LTX starting soon)
>>
>>
>>
>>
File: 362537891058033.png (2.4 MB)
2.4 MB PNG
>>
File: 1776578005689873.png (3.1 MB)
3.1 MB PNG
>>
>>
>>
File: 1750794921809932.png (3.6 MB)
3.6 MB PNG
>>
File: 1746049139541335.png (3.5 MB)
3.5 MB PNG
>>
File: ComfyUI_22000.png (2.5 MB)
2.5 MB PNG
>>108637111
I refuse to watch based solely on their font selection.
>>108637558
I like this one.
>>
>>
File: 1686797549887200.jpg (80.5 KB)
80.5 KB JPG
>>108637139
>You are cucks who listen to failed devs talk?
brutal
>>
>>
>>
File: 1719550287253246.png (2.2 MB)
2.2 MB PNG
>>108637633
>>108635096
>>108634531
based jenner
>>
>>
File: Screenshot 2026-04-19 170700.png (344.2 KB)
344.2 KB PNG
I think I figured this shitty workflow out.
The creator connected shit wrong?
>>
>>
>>108637988
RuntimeError: The size of tensor a (8008) must match the size of tensor b (1037952) at non-singleton dimension 2
Same fucking error in another ltx 2.3 workflow.
Fuck these gay ass nigger jeets making these god damn fucking workflows.
>>
File: 1747876829714076.jpg (918.8 KB)
918.8 KB JPG
babe wake up, ostris has another schizo moment
>>
>>108638173
forgot the link: https://xcancel.com/ostrisai/status/2045677110413668743#m
>>
>>
>>
>>
>>
File: ComfyUI_02843_.png (1.2 MB)
1.2 MB PNG
Anyone have some tips for the Anima to Z-image img2img? I'm finding on some stuff z-image wants to keep in anime :/ I've played with the denoise a bit
>>
>>
Any tips for generating multiple "sprites" of the same character? I generated an image of a character I like and I want to set him up in multiple poses with multiple facial expressions to be used with sillytavern. I don't even know where to start, I've just used A1111 and ComfyUI with PDXL, NovaFurry, Flux and Chroma. Haven't went much deeper than that.
>>
>>
>>
File: ComfyUI_00066_.png (1.1 MB)
1.1 MB PNG
>>108638822
Is there a chroma img to img workflow? I wasn't aware of one last I looked. I find Chroma text to img to be solid
>>
>>
>>
File: file.png (1.2 MB)
1.2 MB PNG
>>108638900
different anon here, I usually use Qwen Image Edit and it's good enough for 30 seconds.
>>
>>
>>
File: file.png (535.6 KB)
535.6 KB PNG
>>108639076
the prompt I used for the anime base image is from civ yes, specifically wai-anima
>>