raghavbali
/

gemma-3b-1b-unsloth-dpo

Generated from Trainer

Model card Files Files and versions

gemma-3b-1b-unsloth-dpo / tokenizer_config.json

raghavbali's picture

dpo training completed

4a616bd verified 4 months ago

history contribute delete

1.16 MB

File too large to display, you can check the raw version instead.