File size: 479 Bytes
14865a6
 
 
 
 
 
c75f322
14865a6
 
 
5650ee6
c75f322
 
5650ee6
 
c75f322
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
---
license: apache-2.0
---

# UNet with Sliding Window Attention

- 8ch latent by moving modules from WF-VAE to [NoobAI XL VAE](https://huggingface.co/Laxhar/noobai-XL-Vpred-1.0/tree/main/vae)
- supports recent long context CLIPs
- variable num_head in MHA across the layers
- both the UNet and the Autoencoder are written in vanilla PyTorch

The result is similar to what [Mitsua](https://huggingface.co/Mitsua/mitsua-likes) accomplished back then.

## References

- 2411.17459