Qwen
/

Qwen3-VL-235B-A22B-Thinking-FP8

Image-Text-to-Text

Model card Files Files and versions

ShuaiBai623 commited on Oct 4

Commit

e083d31

·

verified ·

1 Parent(s): 155c4c4

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ license: apache-2.0
 # Qwen3-VL-235B-A22B-Thinking-FP8
-> This repository contains an FP8 quantized version of the [Qwen3-VL-235B-A22B-Thinking](https://huggingface.co/Qwen/Qwen3-VL-235B-A22B-Thinking) model. The quantization method is fine-grained `fp8` quantization with block size of 128, and its performance metrics are nearly identical to those of the original BF16 model. Enjoy!
 Meet Qwen3-VL — the most powerful vision-language model in the Qwen series to date.

 # Qwen3-VL-235B-A22B-Thinking-FP8
+> This repository contains an FP8 quantized version of the [Qwen3-VL-235B-A22B-Thinking](https://huggingface.co/Qwen/Qwen3-VL-235B-A22B-Thinking) model. The quantization method is fine-grained fp8 quantization with block size of 128, and its performance metrics are nearly identical to those of the original BF16 model. Enjoy!
 Meet Qwen3-VL — the most powerful vision-language model in the Qwen series to date.