NoManDeRY
/

DPO-Shift-Qwen-2-7B-Ultrafeedback-fixed-1.0

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Model card Files Files and versions

DPO-Shift-Qwen-2-7B-Ultrafeedback-fixed-1.0 / tokenizer.json

NoManDeRY's picture

Upload folder using huggingface_hub

3f33393 verified 9 months ago

history contribute delete

7.03 MB

File too large to display, you can check the raw version instead.