Osaurus-AI commited on
Commit
325c336
·
verified ·
1 Parent(s): 9933416

Update Step JANG speed proof

Browse files
Files changed (1) hide show
  1. README.md +13 -2
README.md CHANGED
@@ -42,8 +42,19 @@ Text proof:
42
  "prompt_tokens": 26,
43
  "generated_tokens": 58,
44
  "prefill_s": 9.161997079849243,
45
- "total_s": 15.426342725753784,
46
- "decode_tok_s": 9.25874836391233
 
 
 
 
 
 
 
 
 
 
 
47
  }
48
  ```
49
 
 
42
  "prompt_tokens": 26,
43
  "generated_tokens": 58,
44
  "prefill_s": 9.161997079849243,
45
+ "contains_final_4": true
46
+ }
47
+ ```
48
+
49
+ Speed note: short cold measurements include MLX graph/kernel compile and are not representative of steady decode. A no-wrapper warmed decode run over 32 measured tokens produced:
50
+
51
+ ```json
52
+ {
53
+ "prefill_s": 9.369971990585327,
54
+ "warm_tokens": 4,
55
+ "measured_tokens": 32,
56
+ "decode_s": 0.7534263134002686,
57
+ "tok_s": 42.47263392697507
58
  }
59
  ```
60