Code for training

by cybertruck32489 - opened Sep 24

Discussion

cybertruck32489

Sep 24

It's impressive how the model responds using such a small amount of data. Could you share the code for training?

Liontix

Owner Sep 24

Hello @cybertruck32489 ,

You can use this notebook for the larger Qwen 3 8B model here. Just change the model_name to this Liontix/Qwen3-4B-Thinking-2507-Gemini-2.5-Pro-Distill

I will upload the safetensor repo for that model shortly.

cybertruck32489

Sep 25

No, that's not what I mean. I'm talking about the code that allows you to achieve such changes in the model's responses with so few examples.

Liontix

Owner Sep 25

I used the same notebook, just with a normal Qwen3 safetensor base model. That unsloth notebook handles the supervised fine-tuning and selects/loads the parameters that are going to be updated for the fine-tuning progress. For a good, fine-tuned model, I aim for at least 30 epoch runs on the chosen dataset. The used dataset really makes the difference here in my opinion, as it is very diverse considering its size.

LeroyDyer

Sep 28

wow i would suggest lowering the learning rate !!!
to at least 2e6
this way you dont get over fit like that notebook !
if your model has 0.01 its enough its better to be at 0.3 etc but what your looking for in training is the point when the model begins to flat line ..here perhaps at .2

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment