k050506koch commited on
Commit
0668470
·
verified ·
1 Parent(s): 9b5b518

Include Evals information

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -25,6 +25,9 @@ All core training and inference code lives in this repository (see `train.py`, `
25
  - **Objective:** Next-token prediction on web text (causal language modeling).
26
  - **Use cases:** General text generation, experimentation, and as a base for future instruction-tuned models.
27
  - **Status:** Undertrained research checkpoint – expect rough edges and occasional incoherence. I didn't stop training so more checkpoints will be published in the future.
 
 
 
28
 
29
  I plan to continue training and to release instruction-tuned variants based on this model in the future.
30
 
 
25
  - **Objective:** Next-token prediction on web text (causal language modeling).
26
  - **Use cases:** General text generation, experimentation, and as a base for future instruction-tuned models.
27
  - **Status:** Undertrained research checkpoint – expect rough edges and occasional incoherence. I didn't stop training so more checkpoints will be published in the future.
28
+ - **Evals:** 29.03% on MMLU
29
+
30
+ More detailed in EVALS.md
31
 
32
  I plan to continue training and to release instruction-tuned variants based on this model in the future.
33