Include Evals information
Browse files
README.md
CHANGED
|
@@ -25,6 +25,9 @@ All core training and inference code lives in this repository (see `train.py`, `
|
|
| 25 |
- **Objective:** Next-token prediction on web text (causal language modeling).
|
| 26 |
- **Use cases:** General text generation, experimentation, and as a base for future instruction-tuned models.
|
| 27 |
- **Status:** Undertrained research checkpoint – expect rough edges and occasional incoherence. I didn't stop training so more checkpoints will be published in the future.
|
|
|
|
|
|
|
|
|
|
| 28 |
|
| 29 |
I plan to continue training and to release instruction-tuned variants based on this model in the future.
|
| 30 |
|
|
|
|
| 25 |
- **Objective:** Next-token prediction on web text (causal language modeling).
|
| 26 |
- **Use cases:** General text generation, experimentation, and as a base for future instruction-tuned models.
|
| 27 |
- **Status:** Undertrained research checkpoint – expect rough edges and occasional incoherence. I didn't stop training so more checkpoints will be published in the future.
|
| 28 |
+
- **Evals:** 29.03% on MMLU
|
| 29 |
+
|
| 30 |
+
More detailed in EVALS.md
|
| 31 |
|
| 32 |
I plan to continue training and to release instruction-tuned variants based on this model in the future.
|
| 33 |
|