File size: 490 Bytes

e2241c6

# LLAMIAFlux - Pretrained Model

This model was pretrained on the coyo-hd-11m-llavanext dataset to predict CLIP embeddings from text descriptions.

- Number of image generation heads: 4
- Training parameters:
  - Batch size: 32
  - Learning rate: 3e-05
  - Weight decay: 0.01
  - Epochs: 2
  - Model base: ssingh22/LLAMIAFlux-7b-unprojector-inverted
  - Trained on 9438926 image-caption pairs from coyo-hd-11m-llavanext
  - Trained on 9438926 image-caption pairs from coyo-hd-11m-llavanext