Birchlabsさんのプロフィール画像

Birchlabsさんのイラストまとめ


ML Engineer at Anlatan (@novelaiofficial). co-author of HDiT (Hourglass Diffusion Transformers). works on diffusion models and LLMs. 日本語を勉強してる。
birchlabs.co.uk

フォロー数:191 フォロワー数:4507

ah, that was the parent commit. here's the real one (with ceiling).
unscale each latent channel (÷0.18215), center on mean. if any channel has a max exceeding 42: threshold by 99.95%.
so only sigmas 8.56, 4.80, 2.55 get thresholded. some clipping remains.
https://t.co/oiSBUldN7j

0 1

CFG20, told to get its 99.95%ile down to the same as CFG7.5's.
code here (see CFGDynTheshDenoiser):
https://t.co/1Zn1aefIFL

0 1

high CFG scale doesn't always give you stylish pop art. dynamic thresholding can hugely repair an image.

0 7

think I got dynamic thresholding working.
CFG scale 20. left is what you'd usually get.
salvaged the dynamic range by thresholding the latents on every samping step.

9 35

fumo embedding works in waifu-diffusion too (no retraining required). WD forgot everything about photos or plushies, but it still understands "make it chibi, give it little T-rex arms and make the sleeves too big".
prompt coherence is way better too (I trained it on booru labels)

1 7

I tried to fix the hips but she turns into megaman.
sometimes you have to work within the constraints of what noise you get in the high sigmas of your seed! the Unet predicts that sigma 3.8167 denoises to a wide cloud of legs, so that's what you're up against.

0 1

this is with waifu-diffusion 1.3, final epoch.
if you get too many legs: try specifying what legwear was worn

2 11

trying out the new DPM-Solver samplers from k-diffusion 0.0.2+ (~1 week ago). outperforms Heun.
dpm_adaptive is great for producing final masters of images; you don't need to guess how many steps to use.
dpm_fast just gave me noise, so sadly can't try it out at low step counts.

0 5

IIRC: left vaporwave girl was 8 sampler steps, right was 30. left has a plasticky, specular look which is interesting in its own way. I definitely recommend exploring low sample counts. they're not as "true" as a converged result, but they're frequently interesting!

0 1

I also made one for epoch 5 but released epoch 6 faster than I could finish the animation

0 4