Update README.md
Browse files
README.md
CHANGED
|
@@ -7,4 +7,5 @@ This is a MXFP4_MOE quantization of the model Qwen3-Coder-REAP-246B-A35B
|
|
| 7 |
|
| 8 |
Original model: https://huggingface.co/cerebras/Qwen3-Coder-REAP-246B-A35B-FP8
|
| 9 |
|
| 10 |
-
The model was originally in FP8, which limits its precision.
|
|
|
|
|
|
| 7 |
|
| 8 |
Original model: https://huggingface.co/cerebras/Qwen3-Coder-REAP-246B-A35B-FP8
|
| 9 |
|
| 10 |
+
The model was originally in FP8, which limits its precision.
|
| 11 |
+
I attempted to apply MXFP4 quantization after converting the model to FP32, but the quality degradation from the initial FP8 quantization cannot be fully reversed.
|