File size: 479 Bytes
14865a6 c75f322 14865a6 5650ee6 c75f322 5650ee6 c75f322 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 |
---
license: apache-2.0
---
# UNet with Sliding Window Attention
- 8ch latent by moving modules from WF-VAE to [NoobAI XL VAE](https://huggingface.co/Laxhar/noobai-XL-Vpred-1.0/tree/main/vae)
- supports recent long context CLIPs
- variable num_head in MHA across the layers
- both the UNet and the Autoencoder are written in vanilla PyTorch
The result is similar to what [Mitsua](https://huggingface.co/Mitsua/mitsua-likes) accomplished back then.
## References
- 2411.17459 |