Qwen
/

Qwen3-VL-4B-Thinking-FP8

Image-Text-to-Text

Model card Files Files and versions

ShuaiBai623 commited on 14 days ago

Commit

3545920

·

verified ·

1 Parent(s): b51a32c

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -8,7 +8,7 @@ pipeline_tag: image-text-to-text
 # Qwen3-VL-4B-Thinking-FP8
-> This repository contains an FP8 quantized version of the [Qwen3-VL-4B-Thinking](https://huggingface.co/Qwen/Qwen3-VL-4B-Thinking-FP8) model. The quantization method is fine-grained fp8 quantization with block size of 128, and its performance metrics are nearly identical to those of the original BF16 model. Enjoy!
 Meet Qwen3-VL — the most powerful vision-language model in the Qwen series to date.

 # Qwen3-VL-4B-Thinking-FP8
+> This repository contains an FP8 quantized version of the [Qwen3-VL-4B-Thinking](https://huggingface.co/Qwen/Qwen3-VL-4B-Thinking) model. The quantization method is fine-grained fp8 quantization with block size of 128, and its performance metrics are nearly identical to those of the original BF16 model. Enjoy!
 Meet Qwen3-VL — the most powerful vision-language model in the Qwen series to date.