noctrex
/

Qwen3-VL-30B-A3B-Instruct-MXFP4_MOE-GGUF

Image-Text-to-Text

Model card Files Files and versions

This is a MXFP4_MOE quantization of the model Qwen3-VL-30B-A3B-Instruct

Original model: https://huggingface.co/Qwen/Qwen3-VL-30B-A3B-Instruct

This GGUF quant has been made possible due to the excellent work from [yairpatch] (https://huggingface.co/yairpatch) and [Thireus] (https://huggingface.co/Thireus), and anyone else I forgot to mention

As of 2025-10-22 this is still experimental and should be treated as such.
In order to run it you must download a custom version of llama.cpp from here:
https://github.com/Thireus/llama.cpp/releases/tag/tr-qwen3-vl-6-b7106-495c611

Downloads last month: 1,313

GGUF

Model size

31B params

Architecture

qwen3vlmoe

Hardware compatibility

Log In to view the estimation

4-bit

Model tree for noctrex/Qwen3-VL-30B-A3B-Instruct-MXFP4_MOE-GGUF

Base model

Qwen/Qwen3-VL-30B-A3B-Instruct

Quantized

(18)

this model

Collections including noctrex/Qwen3-VL-30B-A3B-Instruct-MXFP4_MOE-GGUF

Qwen3-VL

Models from the Qwen3-VL family • 5 items • Updated 4 days ago

Qwen

Models from the Qwen team • 13 items • Updated 3 days ago • 1