Thread #108558395
File: highlights_g_108553789_1775666285_1.jpg (3.4 MB)
3.4 MB JPG
Discussion and Development of Local Image and Video Models
Previous: >>108553789
https://rentry.org/ldg-lazy-getting-started-guide
>UI
ComfyUI: https://github.com/comfyanonymous/ComfyUI
SwarmUI: https://github.com/mcmonkeyprojects/SwarmUI
re/Forge/Classic/Neo: https://rentry.org/ldg-lazy-getting-started-guide#reforgeclassicneo
SD.Next: https://github.com/vladmandic/sdnext
Wan2GP: https://github.com/deepbeepmeep/Wan2GP
>Checkpoints, LoRAs, Upscalers, & Workflows
https://civitai.com
https://civitaiarchive.com/
https://openmodeldb.info
https://openart.ai/workflows
>Tuning
https://github.com/spacepxl/demystifying-sd-finetuning
https://github.com/ostris/ai-toolkit
https://github.com/Nerogar/OneTrainer
https://github.com/kohya-ss/musubi-tuner
https://github.com/tdrussell/diffusion-pipe
>Z
https://huggingface.co/Tongyi-MAI/Z-Image
https://huggingface.co/Tongyi-MAI/Z-Image-Turbo
>Anima
https://huggingface.co/circlestone-labs/Anima
https://tagexplorer.github.io/
>Qwen
https://huggingface.co/collections/Qwen/qwen-image
>Klein
https://huggingface.co/collections/black-forest-labs/flux2
>LTX-2
https://huggingface.co/Lightricks/LTX-2
>Wan
https://github.com/Wan-Video/Wan2.2
>Chroma
https://huggingface.co/lodestones/Chroma1-Base
https://rentry.org/mvu52t46
>Illustrious
https://rentry.org/comfyui_guide_1girl
>Misc
Local Model Meta: https://rentry.org/localmodelsmeta
Share Metadata: https://catbox.moe | https://litterbox.catbox.moe/
Img2Prompt: https://huggingface.co/spaces/fancyfeast/joy-caption-beta-one
Txt2Img Plugin: https://github.com/Acly/krita-ai-diffusion
Archive: https://rentry.org/sdg-link
Collage: https://rentry.org/ldgcollage
>Neighbors
>>>/aco/csdg
>>>/b/degen
>>>/r/realistic+parody
>>>/gif/vdg
>>>/d/ddg
>>>/e/edg
>>>/h/hdg
>>>/trash/slop
>>>/vt/vtai
>>>/u/udg
>Local Text
>>>/g/lmg
>Maintain Thread Quality
https://rentry.org/debo
https://rentry.org/animanon
301 RepliesView Thread
>>
File: ComfyUI_temp_iqyfq_00011_.png (1.4 MB)
1.4 MB PNG
>>
>>
File: 00010-742740945.jpg (1.7 MB)
1.7 MB JPG
>>
>>
>>
>>108558450
anifart (>>108558470) has used his proxies to nuke the previous thread
>>
File: ComfyUI_temp_iqyfq_00025_.png (830.3 KB)
830.3 KB PNG
anima is great
>>
>mfw Resource news
04/08/2026
>OrthoFuse: Training-free Riemannian Fusion of Orthogonal Style-Concept Adapters for Diffusion Models
https://github.com/ControlGenAI/OrthoFuse
>MIRAGE: Benchmarking and Aligning Multi-Instance Image Editing
https://github.com/ZiqianLiu666/MIRAGE
>Few-Shot Semantic Segmentation Meets SAM3
https://github.com/WongKinYiu/FSS-SAM3
>PoM: A Linear-Time Replacement for Attention with the Polynomial Mixer
https://github.com/davidpicard/pom
>RS Nodes for ComfyUI: Cmprehensive custom node pack focused on LTXV audio-video generation, LoRA training and post-processing
https://github.com/richservo/rs-nodes
>FLUX.2 Small Decoder: Distilled VAE decoder for faster decoding and lower VRAM usage
https://huggingface.co/black-forest-labs/FLUX.2-small-decoder
>Nvidia snaps up AI chip packaging capacity as TSMC expands in U.S.
https://www.cnbc.com/2026/04/08/tsmc-nvidia-advanced-packaging-intel.h tml
04/07/2026
>Anima preview3 released
https://huggingface.co/circlestone-labs/Anima#preview3
>FrameFusion Image Interpolation: Compact image interpolation model for generating in-between frames
https://github.com/BurguerJohn/FrameFusion-Model
>An Inside Look at OpenAI and Anthropic’s Finances Ahead of Their IPOs
https://www.wsj.com/tech/ai/openai-anthropic-ipo-finances-04b3cfb9
>PrismML debuts energy-sipping 1-bit LLM in bid to free AI from the cloud
https://www.theregister.com/2026/04/04/prismml_1bit_llm
>ComfyUI Hires Fix Ultra - All in One
https://github.com/ThetaCursed/ComfyUI-HiresFix-Ultra-AllInOne
>ATSS: Detecting AI-Generated Videos via Anomalous Temporal Self-Similarity
https://github.com/hwang-cs-ime/ATSS
>1.x-Distill: Breaking the Diversity, Quality, and Efficiency Barrier in Distribution Matching Distillation
https://thu-accdiff.github.io/1.x-distill-page
>Your Pre-trained Diffusion Model Secretly Knows Restoration
https://sudraj2002.github.io/yptpage
>>
>mfw Research news
04/08/2026
>OmniCamera: A Unified Framework for Multi-task Video Generation with Arbitrary Camera Control
https://arxiv.org/abs/2604.06010
>Graph-PiT: Enhancing Structural Coherence in Part-Based Image Synthesis via Graph Priors
https://arxiv.org/abs/2604.06074
>Selective Aggregation of Attention Maps Improves Diffusion-Based Visual Interpretation
https://arxiv.org/abs/2604.05906
>Cross-Resolution Diffusion Models via Network Pruning
https://arxiv.org/abs/2604.05524
>Is CLIP Cross-Eyed? Revealing and Mitigating Center Bias in the CLIP Family
https://arxiv.org/abs/2604.05971
>Evaluation of Randomization through Style Transfer for Enhanced Domain Generalization
https://arxiv.org/abs/2604.05616
>ID-Selection: Importance-Diversity Based Visual Token Selection for Efficient LVLM Inference
https://arxiv.org/abs/2604.05601
>Improving Controllable Generation: Faster Training and Better Performance via $x_0$-Supervision
https://arxiv.org/abs/2604.05761
>Reading Between the Pixels: An Inscriptive Jailbreak Attack on Text-to-Image Models
https://arxiv.org/abs/2604.05853
>3DTurboQuant: Training-Free Near-Optimal Quantization for 3D Reconstruction Models
https://arxiv.org/abs/2604.05366
>Thinking Diffusion: Penalize and Guide Visual-Grounded Reasoning in Diffusion Multimodal Language Models
https://arxiv.org/abs/2604.05497
>Learning What Matters: Dynamic Dimension Selection and Aggregation for Interpretable Vision-Language Reward Modeling
https://arxiv.org/abs/2604.05445
>On the Robustness of Diffusion-Based Image Compression to Bit-Flip Errors
https://arxiv.org/abs/2604.05743
>HaloProbe: Bayesian Detection and Mitigation of Object Hallucinations in Vision-Language Models
https://arxiv.org/abs/2604.06165
>Efficient Inference for Large Vision-Language Models: Bottlenecks, Techniques, and Prospects
https://arxiv.org/abs/2604.05546
>Beyond Semantics: Disentangling Information Scope in Sparse Autoencoders for CLIP
https://arxiv.org/abs/2604.05724
>>
Sorry, maybe I am a bit retarded, but can someone explain to me what /ldg/, a general that never cared about aesthetics, did to get a dev that is probably the second best thing to happen to anime since SDXL lurking here?
>>
>>
File: I wonder who's fudding Anima that hard...png (139.6 KB)
139.6 KB PNG
>>108558512
it's just one schizo hating on Anima because he didn't manage to get the Comfy fund, don't mind him
>>
>>
>>
>>
>>
>>
>>108557890
>a brown person having a meltdown over a new model.
something like him? >>108558556
yeah, makes sense
>>
>>
>>
>>
File: 1748263462595126.png (755.3 KB)
755.3 KB PNG
https://xcancel.com/mark_k/status/2040877193933283364
it's insane at text holy fuck
>>
>>
File: Z-image-Klein_00981_.jpg (1.6 MB)
1.6 MB JPG
>>
>>
>>
>>
>>108558634
They are just kissing up to tdrusell so he keeps lurking here and to make him think his model is actually relevant.
But I will say it again, /ldg/ does not deserve to have tdrusell lurking here when there are so many anime generals using his model and comparing different artist tags, p2 vs p3, etc.
I still do not understand why he lurks only here.
>>
>>
File: 1ed768cc8d8237cef12a784757dbe584.png (11.8 KB)
11.8 KB PNG
How can I increase the inputs for this node?
>>
>>
>>108558565
I also saw someone complaining of not getting good dicks on anima and idk, maybe im going blind but they seem ok to me
https://files.catbox.moe/jtqua1.png
https://files.catbox.moe/gab3uk.png
https://files.catbox.moe/qtk3dp.png
>>
>>
>>
>>
>>
>>
>>
>>
>>108558731
I've had that happen on other types of nodes, but I can't find one that does it for text concat.
>>108558734
Yeah feeding a list into another list works, but it's so messy.
>>108558758
I'll try with claude later.
>>
>>
>>
>>
>>
>>
File: 743b99c19a526b68b8c45533cd6aaa11.png (25.2 KB)
25.2 KB PNG
>>108558862
Oh it actually worked, had to manually hook the input/output inside the subgraph. can even rename the inputs which is exactly what I needed too. Neat.
>>
>>108558761
Nah, my message is very clear, I am not Ani.
I am convinced that any 4chan anime diffusion general can contribute much more if tdrusell lurks there instead of lurking here. /ldg/ does not care about aesthetics, /ldg/ is more about realism, nodes, and NSFW.
>>
>>
>>
>>
>>
>>
>>
File: ComfyUI_temp_lfdjc_00034_.png (2.5 MB)
2.5 MB PNG
Preview3 is pretty good at stylized fonts.
>>
>>108558966
Just my opinion, no namefags or ulterior motives. I never said Chroma dev or gens belong in some NSFW realism thread. My point is about anime, Anima, and where its dev should focus. I stay out of what I don't know. And this will be my last message today.
>>
File: deSA_zi_00039_.png (1.7 MB)
1.7 MB PNG
>>
File: fsfsdfsdfsdf.jpg (753.7 KB)
753.7 KB JPG
Feels like preview3 is doing much more realistic stuff.
>>
>>
File: _AnimaPreview3_00006_.jpg (475.9 KB)
475.9 KB JPG
>>108559097
yeah noticed same, small difference but it's there
>>
File: 00072-2000614192.png (1.1 MB)
1.1 MB PNG
>>
File: _AnimaPreview3_00010_.jpg (428.7 KB)
428.7 KB JPG
>>
File: deSA_zi_00043_.png (1.7 MB)
1.7 MB PNG
>>108559127
action scenes have always been my holy grail but models don't really tend to 'get' it. z-image has some hit-or-miss success, but is still pretty biased towards static, centered shots
>>
File: deSA_zi_00040_.png (1.7 MB)
1.7 MB PNG
>>108559215
even this series is mostly "action about to happen" rather than "action happening
>>
>>
File: deCC_zi_00052_.png (2.6 MB)
2.6 MB PNG
>>108559257
no but thats a fun idea. I can tie together the kung fu prompt with this wuxia stuff I'd been doing and maybe get some cool sword-fighting stuff. might try that later
>>
>>
File: _AnimaPreview3_00025_.jpg (286.5 KB)
286.5 KB JPG
>>
>>
File: _AnimaPreview3_00029_.jpg (321.9 KB)
321.9 KB JPG
>>
>>108559096
>>108559253
>>108559292
stop talking to yourself schizo and go back to your /sdg/ containment board
>>
>>
File: ComfyUI_temp_iqyfq_00037_.png (1.7 MB)
1.7 MB PNG
>>
File: _AnimaPreview3_00034_.jpg (341.5 KB)
341.5 KB JPG
>>
File: ComfyUI_temp_iqyfq_00039_.png (1.5 MB)
1.5 MB PNG
>>
File: ComfyUI_temp_iqyfq_00043_.png (1.6 MB)
1.6 MB PNG
>>
>>
File: _AnimaPreview3_00040_.jpg (279.5 KB)
279.5 KB JPG
>>
>>108559382
>>108559409
get the fuck out you're not welcome here
https://rentry.org/debo
>>
File: ComfyUI_temp_iqyfq_00046_.png (1.5 MB)
1.5 MB PNG
>>
>>
>>
>>108559447
avatarfaggots like him are welcome nowhere, it's against the rules, he's lucky /sdg/ is not enforcing that, and that's the reason why this general died in the first place, you give the avatarfags an inch, they'll take the arm and will make it all about themselves, that won't be repeated on /ldg/
>>
>>
>>
>>108559494
shut the fuck up anifart
https://rentry.org/animanon
>>
>>
File: o_00010_.png (1.5 MB)
1.5 MB PNG
ldg is for everyfren
>>
>>
File: 1769432142254647.jpg (1.7 MB)
1.7 MB JPG
>>108559494
>your personal grudges
don't pretend you like debo, you hate him, it's been documented
>>
>>
File: _AnimaPreview3_00054_.jpg (388.8 KB)
388.8 KB JPG
>>
>>108559521
>why shouldn't we let avatarfags break the avatarfagging rules?
because /sdg/ died because of that, duh, that rule exists for a reason, this isn't discord faggot, if you want to have an avatar, go back where you belong
>>
File: deMS_zi_00028_.png (3.6 MB)
3.6 MB PNG
>>108559513
based
and fun gen
>>
>>
>>
>>
>>
>>
File: o_00013_.png (1.7 MB)
1.7 MB PNG
slipped on a banana
>>
File: _AnimaPreview3_00064_.jpg (309.5 KB)
309.5 KB JPG
>>
File: _AnimaPreview3_00056_.jpg (320.6 KB)
320.6 KB JPG
I got banned for sharing my opinion
>>
File: _AnimaPreview3_00070_.jpg (442.4 KB)
442.4 KB JPG
>>
>>
File: _AnimaPreview3_00076_.jpg (354.9 KB)
354.9 KB JPG
>>
>>
File: _AnimaPreview3_00091_.jpg (411.1 KB)
411.1 KB JPG
>>
File: o_00020_.png (1.7 MB)
1.7 MB PNG
nuh uh
>>
File: 1745444981367775.jpg (639.9 KB)
639.9 KB JPG
>>
>>
File: _AnimaPreview3_00128_.jpg (454.8 KB)
454.8 KB JPG
>>
File: _AnimaPreview3_00143_.jpg (453.3 KB)
453.3 KB JPG
>>
File: deMS_zi_00030_.png (3.7 MB)
3.7 MB PNG
gobin
>>
>>
File: o_00026_.png (1.6 MB)
1.6 MB PNG
>>
File: 1767584649637238.jpg (648.6 KB)
648.6 KB JPG
>>
>>
>>
>>
File: o_00029_.png (1.1 MB)
1.1 MB PNG
>>
File: Screenshot 2026-04-09 at 00.15.30.png (743.6 KB)
743.6 KB PNG
>>108558395
The Anima character lora turned out a lot better than i expected, especially at only 500 steps :D
https://files.catbox.moe/4c1j9b.png
https://files.catbox.moe/etbd6t.png
>>
>>
>>108560750
>i am using ai to convert them into drawings of 2d women.
Can you do that to my MILF gen here?
>>108560722
>>
>>
>>
>>
>>
>>
>>
File: ComfyUI_20261.png (2.6 MB)
2.6 MB PNG
>>108560789
That's what she's crying about!
>>
>>
>>
>>
>>
File: o_00039_.png (1.2 MB)
1.2 MB PNG
>>
https://www.youtube.com/watch?v=Y-i52Dgb8vU&list=RDY-i52Dgb8vU
>>
>>
>>
>>
>>
>>
>>
>>
File: comfyui.png (68.5 KB)
68.5 KB PNG
Reposting from another board.
>Install comfyui
>Install some nodes
>Everything works fast and quick.
>Do a clean OS install and do a clean comfyui install
>UI changes no one asked for were added.
>Some nodes stop working.
>The UI update broke 3rd party nodes.
>Manage to find workarounds
>3 months later
>Disk where comfyui is dies, have to reinstall comfyui in a new partition
>There is another new update
>MORE UI CHANGES NO ONE ASKED FOR
>Half custom nodes i had as backup now don't work.
>Custom node devs seem to be tired of this bullshit UI upgrade breaking everything and stopped maintaining their nodes.
>Decide to go back a few versions
>Newest custom node manager demands newest comfy to work
This is some windows 11 pajeet-tier coding nonsense.
Comfyui has become a joke.
Which UI you use to generate images anons?
I moved away from Forge/A1111 forks because they are bloated and load/unload/move checkpoints in EVERY image generation, slowing my PC. Comfyui kept checkpoints loaded, with fast generations but it has become unusable.
>>
>>
>>
>>108561184
don't bother posting it here. you'll get swarmed by resident comfyorg shills calling you "ani", schizo and bunch of other bs
you literally can't post criticism of comfy or anima here without causing almost immediate wave of angry replies
>>
>>
>>
>>
>>
>>
>>
>>
>>108561184
forge shouldn't unload your model after a generation, config error. I am stuck on comfy, switched away from reforge ages ago and I cannot go back. haven't updated in 2 months or so. and yeah comfyui is a hot mess, comical incompetence. the guy coding the UI related stuff is either immensely good at sucking comfys cock or payed by a future competitor to nuke comfys reputation. I outsourced everything (entire models folder, output, wildcards, etc), could basically nuke the comfy install and not loose anything. there are ways to roll back shit tho, this guy here talks about it https://www.reddit.com/r/comfyui/comments/1q3k5q6/how_to_solve_everyth ing_forever_broken/
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
File: o_00043_.png (1.6 MB)
1.6 MB PNG
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
File: 1752719746089035.jpg (1.1 MB)
1.1 MB JPG
>>
File: 1764092777583517.jpg (1.3 MB)
1.3 MB JPG
>>
>>
>>
File: o_00046_.png (1.7 MB)
1.7 MB PNG
>>
>>
>>108561184
https://github.com/Haoming02/sd-webui-forge-classic this solves your problems
>>
File: 1771702664488821.png (3.2 MB)
3.2 MB PNG
>>
>>
>>
>>
File: o_00048_.png (1.4 MB)
1.4 MB PNG
>>
What methods are you guys using to generate more than 1 character in a pic?
I wanna generate some fighting scenes but whenever I use two loras for two different chars simultaneously, everything looks smudged and fucked up.
>>
>>
>>
>>
>>108561469
>>108561651
You must have a fulfilling personal life
>>
>>
>>
>>
>>
>>
>>
>>
>>108561911
You can tell it's good just by putting it side by side with any SDXL model using the comparison workflow on the huggingface page. Anima wins on like 99% of prompts. Literally the only reason to use SDXL at this point is if you have some special snowflake shitmerge / artist mix / lora mix that gives you aesthetics that Anima can't quite do out of the box.
>>
>>108558395
>>108555778
Is the jenny lora available anywhere?
>>
>>
File: ComfyUI_20327.png (1.5 MB)
1.5 MB PNG
>>108561184
There's really only Comfy if you want to do anything off the beaten path or experiment.
I just recently recovered from catastrophic update too. I had to nuke everything and reinstall a bunch of stuff and re-add them to PATH to get it back. I still can't interrupt the queue or change the prompt without manually clearing models/cache though, because the next gen will 100% cause Comfy to not clear any memory and start taking all other available memory, but at least it actually works again. Of course there are no error messages or anything showing this behavior, it just something it decided to do several updates ago, I guess.
I thought it was the OC'd memory on my 4090 at first, but even at stock the behavior was the same. Plus, that OC has never caused problems anywhere else before... my GPU has top-binned components designed for overclocking and a beefy cooler (Asus ROG Strix OC).
>>
>>
>>
>>108561912
>>108561579
What model? If it's sdxl then regional/masked prompting is your only option. On newer models you can just prompt it, but you still have to pray. Or gen both characters separately and use edit model to compose the result.
>>
File: 18793.png (741.5 KB)
741.5 KB PNG
>>108561713
anima with meme lora i made https://litter.catbox.moe/eu47xhmgqkae1505.safetensors
masterpiece, best quality, 2boys, yaoi, kissing
>>
File: 72575725725752.jpg (2.3 MB)
2.3 MB JPG
>>
File: 454564512121.png (117.5 KB)
117.5 KB PNG
>>108558395
>Nobody here talks about ACEStep 1.5 XL which just dropped
https://ace-step.github.io/ace-step-v1.5.github.io/#XLDemos
It's a different class of model bros, I'm not hearing any slop...
>>
>>
File: 671643621.jpg (1.8 MB)
1.8 MB JPG
>>
>>
>>108562337
>>108558395
>Discussion and Development of Local Image and Video Models
>>
>>
>>
>>
>>
>>
>>108562355
>childrens song is terribly synthetic
To be fair, I think it's a very good sign of diverse audio quality in dataset. Top music models also vary this depending on the nature of your prompt, E.G. the top model here
https://artificialanalysis.ai/music/leaderboard/instrumental
>>
>>
>>108562355
>https://x.com/ostrisai/status/2041926198599807079
dunno bro, loras seem like they will be hot
>>
>>
>>
File: 1755030729274033.png (1.5 MB)
1.5 MB PNG
>>108562304
neat
>>
>>108560916
>>108562034
based jenner
>>
>>108562504
>loras seem like they will be hot
I'm listing to sample songs bro, it's SOTA cloudshit level....
>Indie folk ballad with astronaut transmission aesthetics, gentle fingerpicked acoustic guitar and soft banjo arpeggios at 78 BPM in 3/4 waltz time, harmonica plays lonesome melodic phrases with a Neil Young-style reedy nasal tone, male vocal in a tired warm baritone with slight vocal fry and crackin...
from the demo sounds like a radio song. This is what the raw model can do without a LoRA... Local is back
>>
>>108562543
Of course, since original ACEStep was already really good with LoRAs (but still quite lackluster in musicality with some complex genres), this should be able to fully shut the gap now. Udio/Suno have not as much moat anymore. Can't wait to play around with this model.
>>
>>108562337
I don't really notice that much improvement over the last one. That doesn't mean it's bad by any means though I've enjoyed a lot of the songs it's made for me.
https://voca.ro/18LjOD4lMxxA
[Verse]
In a basement lit by a flickering screen
The saddest little losers that you’ve ever seen
Bluvoll is plotting a digital crime
Spending eight grand on a waste of time!
They cry "memory loss!" and "it's untrainable junk!',
While their own broken model is totally sunk
They brag about Civitai, acting so proud
Of a fake reputation in a pathetic crowd.
[Bridge]
Eight thousand dollars... what a joke.
[Chorus]
They’re clinging to wreckage, to a dying flame
While tdrussel is winning the digital game!
Backed by Comfy Org, he’s breaking the mold
While they’re stuck in the dirt with their Mugen of old!
Yeah, they’re chasing shadows, lost in the fray
While the real Chad is driving his Lambo away!
>>
>>108561921
I'll try that, thanks.
>>108562206
Illustrious models. I'm using Krita AI by the way, which has regional prompting. I tried it but whenever I load a different Lora for each region (representing each character) the results still look bad, both characters look completely different or even deformed.
My workaround so far has been inpainting, guess pure txt2img isn't quite there.
>>
>>
>>
>>108562580
The last one really wasn't as musical. It sounded more synthetic, less capable, and unable to properly mimic a bunch of music styles you prompted for without a LoRA (which was very challenging to train prior to later SideStep versions). Now this one is more diverse, able to mimic every music style much better, and the voices sound slop-free compared to that last one.
>>
File: ComfyUI_00429_.png (3.3 MB)
3.3 MB PNG
>>
>>108562613
The instruments are also much better aligned with the lyrics. Previously, you could achieve improvements with a LoRA, though it was still quite limited to the music within the LoRA, and now it's good out of the box. A better base model always leads to better LoRAs.
>>
>>108562593
there is the official comfy lora scheduler shit, https://blog.comfy.org/p/masking-and-scheduling-lora-and-model-weights
you could also try https://github.com/yaoliliu/FreeFuse
i fucked around with it a bit and it's pretty decent for an automated setup.
>>
>>
>>108562337
I was going to say "wow these are good" then I got to
>An instrumental orchestral piece built on a foundation of a powerful, sustained string section and a precise, grand orchestral percussion rhythm. Layered brass swells create a vast, cinematic backdrop, while a lyrical solo violin carries the main melodic theme. The arrangement evolves with the intro...
What the fuck this is awful. I guess orchestral slop is not something it was trained on.
>>
>>
>>108562337
>>108562678
I'm so confused. That opera sample, the singing sounds SO GOOD. Why do the instruments sound so bad? Did they train only on acappella opera or something? The instruments sound straight up MIDI while the singer is like a real opera singer. It's bizarre.
>>
>>
>>108562718
>Why do the instruments sound so bad?
I don't think it's bad at all. The instrumentation has been improved substantially so that it's much more diverse, but the actual sound quality could use some mastering or LoRA. But the songs themselves are coherent and well structured.
>>
>>
>>
File: Anima3_00102_.png (1.6 MB)
1.6 MB PNG
>>
File: SDG_News_00083_.png (2.6 MB)
2.6 MB PNG
>>108562820
yay fireworks :D
>>
>>
>>
>>
>>
>>
>>
>>108562939
preview2 was alright i guess, not tried 3
>>108562943
>filtered by civijeet
oof
>>
>>108562307
>>108562346
boxes please?
>>
>>108562820
>>108562835
wtf, now youre talking
>>
>>
I have been completely mindraped by the Anna Khachiyan tweet about the true white-brown distinction being pink or brown nipples. I can't gen clothed Asian women anymore, I need to see the nipples and they need to be pink.
>>
>>
>>
File: _AnimaPreview3_00058_.jpg (297.6 KB)
297.6 KB JPG
>>108562939
>what's anima like when it comes to realistic nsfw stuff?
need loras for realism
>>
File: 1751990492581686.png (198.3 KB)
198.3 KB PNG
can anima do gore?
>>
File: 1701270638937.png (1.2 MB)
1.2 MB PNG
I dunno where else to ask, but what are the best open source options for voice AI? Elevenlabs is good but I can't afford it
>>
>>
>>
>>
>>
File: ComfyUI_00457_.png (3.1 MB)
3.1 MB PNG
>>108563095
just do a normal upscale pass
>>
>>
>>
>>108563051
Vibe voice. It's quite good and can do emotion but you have to sort of prime it like give it some lines to guide it towards a certain mood like anger or whatever. It's trained on podcasts so sometimes you get gens with background music and shit. It can be a little flat but it's by far the best open source one that doesn't either require training loras or doing your own voice acting.
https://voca.ro/1iZ3b4arXaUS
There's background music in this one because my source clip has it.
>>
>>
>>
>>
>>
>>
>>
File: 1756387836871708.jpg (731.7 KB)
731.7 KB JPG
>>
File: Klein-9b_00001_.png (1.1 MB)
1.1 MB PNG
>>108562991
>Anna Khachiyan
>>
>>
>>
File: 1775556594418913.jpg (608.2 KB)
608.2 KB JPG
>>
File: 1767990539084180.jpg (716.1 KB)
716.1 KB JPG
>>
>>
>>
File: 1753840467791026.jpg (673.3 KB)
673.3 KB JPG
>>
>>
File: 1756298982597284.jpg (613.1 KB)
613.1 KB JPG
tfw no elf gf
>>
File: 1759093990729118.jpg (715.2 KB)
715.2 KB JPG
and to end it, my zitslop wife
>>
>>
>>
>>
File: 1760577742516602.png (3.6 MB)
3.6 MB PNG
>>108563435
ty anon https://litter.catbox.moe/r0gmx2u102vy5erl.png
>>
File: 883683683.jpg (2.1 MB)
2.1 MB JPG
>>108563391
please don't impersonate me...
>>
>>
Fresh when ready
>>108563476
>>108563476
>>108563476