ShuaiBai623 commited on
Commit
e083d31
·
verified ·
1 Parent(s): 155c4c4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -7,7 +7,7 @@ license: apache-2.0
7
 
8
  # Qwen3-VL-235B-A22B-Thinking-FP8
9
 
10
- > This repository contains an FP8 quantized version of the [Qwen3-VL-235B-A22B-Thinking](https://huggingface.co/Qwen/Qwen3-VL-235B-A22B-Thinking) model. The quantization method is fine-grained `fp8` quantization with block size of 128, and its performance metrics are nearly identical to those of the original BF16 model. Enjoy!
11
 
12
 
13
  Meet Qwen3-VL — the most powerful vision-language model in the Qwen series to date.
 
7
 
8
  # Qwen3-VL-235B-A22B-Thinking-FP8
9
 
10
+ > This repository contains an FP8 quantized version of the [Qwen3-VL-235B-A22B-Thinking](https://huggingface.co/Qwen/Qwen3-VL-235B-A22B-Thinking) model. The quantization method is fine-grained fp8 quantization with block size of 128, and its performance metrics are nearly identical to those of the original BF16 model. Enjoy!
11
 
12
 
13
  Meet Qwen3-VL — the most powerful vision-language model in the Qwen series to date.