Discussion
Collection
13 items
โข
Updated
This model is a fine-tuned version of microsoft/phi-4 on an unknown dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 2.6764 | 0.2235 | 10 | 2.4496 |
| 2.1053 | 0.4469 | 20 | 1.9257 |
| 1.222 | 0.6704 | 30 | 1.0594 |
| 0.1878 | 0.8939 | 40 | 0.1615 |
| 0.1642 | 1.1117 | 50 | 0.1395 |
| 0.1127 | 1.3352 | 60 | 0.1343 |
| 0.1483 | 1.5587 | 70 | 0.1332 |
| 0.1342 | 1.7821 | 80 | 0.1338 |
| 0.1529 | 2.0 | 90 | 0.1323 |
| 0.1327 | 2.2235 | 100 | 0.1289 |
| 0.095 | 2.4469 | 110 | 0.1286 |
| 0.1446 | 2.6704 | 120 | 0.1304 |
| 0.1631 | 2.8939 | 130 | 0.1265 |
Base model
microsoft/phi-4