Update README.md
Browse files
README.md
CHANGED
|
@@ -66,10 +66,10 @@ print(tokenizer.decode(outputs[0], skip_special_tokens=True))
|
|
| 66 |
## 📊 Training
|
| 67 |
- **Base:** GPT-OSS-20B (`unsloth/gpt-oss-20b`)
|
| 68 |
- **Dataset:** [AceReason-Math](https://huggingface.co/datasets/nvidia/AceReason-Math)
|
| 69 |
-
|
| 70 |
-
|
| 71 |
-
|
| 72 |
-
|
| 73 |
- **Method:** Parameter-Efficient Fine-Tuning (LoRA)
|
| 74 |
- **LoRA Config:** r=8, alpha=16, dropout=0
|
| 75 |
- **Merge:** `save_pretrained_merged` from Unsloth
|
|
|
|
| 66 |
## 📊 Training
|
| 67 |
- **Base:** GPT-OSS-20B (`unsloth/gpt-oss-20b`)
|
| 68 |
- **Dataset:** [AceReason-Math](https://huggingface.co/datasets/nvidia/AceReason-Math)
|
| 69 |
+
**Splits:**
|
| 70 |
+
**Train:** 40,163
|
| 71 |
+
**Validation:** 4,463
|
| 72 |
+
**Test:** 4,963
|
| 73 |
- **Method:** Parameter-Efficient Fine-Tuning (LoRA)
|
| 74 |
- **LoRA Config:** r=8, alpha=16, dropout=0
|
| 75 |
- **Merge:** `save_pretrained_merged` from Unsloth
|