I messed up on the previous model. This is a fixed version.

A tiny 1B model that refuses basically anything you ask it! Trained on the refusal dataset. Prompt format is ChatML.

Training results:

Training Loss	Epoch	Step	Validation Loss
2.4352	0.0580	1	2.4462
1.5741	0.5217	9	1.4304
1.5204	1.0435	18	1.3701
1.0794	1.5217	27	1.3505
1.1275	2.0435	36	1.3344
0.6652	2.5217	45	1.4360
0.6248	3.0435	54	1.4313
0.6142	3.5072	63	1.4934

Training hyperparemeters:

The following hyperparameters were used during training:

Model tree for mrfakename/refusal

Base model

Finetuned

(99)

this model

Quantizations

mrfakename
/

refusal