Qwen3-VL
Collection
Models from the Qwen3-VL family
•
5 items
•
Updated
This is a MXFP4_MOE quantization of the model Qwen3-VL-30B-A3B-Instruct
Original model: https://huggingface.co/Qwen/Qwen3-VL-30B-A3B-Instruct
This GGUF quant has been made possible due to the excellent work from [yairpatch] (https://huggingface.co/yairpatch) and [Thireus] (https://huggingface.co/Thireus), and anyone else I forgot to mention
As of 2025-10-22 this is still experimental and should be treated as such.
In order to run it you must download a custom version of llama.cpp from here:
https://github.com/Thireus/llama.cpp/releases/tag/tr-qwen3-vl-6-b7106-495c611
4-bit
Base model
Qwen/Qwen3-VL-30B-A3B-Instruct