ssingh22
/

LLAMIAFlux-7b-unprojector-pretrain-t2i-0_5

Model card Files Files and versions

LLAMIAFlux-7b-unprojector-pretrain-t2i-0_5 / README.md

ssingh22's picture

Upload README.md with huggingface_hub

e2241c6 verified 7 months ago

|

history blame contribute delete

490 Bytes

LLAMIAFlux - Pretrained Model

This model was pretrained on the coyo-hd-11m-llavanext dataset to predict CLIP embeddings from text descriptions.

Number of image generation heads: 4
Training parameters:
- Batch size: 32
- Learning rate: 3e-05
- Weight decay: 0.01
- Epochs: 2
- Model base: ssingh22/LLAMIAFlux-7b-unprojector-inverted
- Trained on 9438926 image-caption pairs from coyo-hd-11m-llavanext
- Trained on 9438926 image-caption pairs from coyo-hd-11m-llavanext