Qwen/Qwen3-Next-80B-A3B-Instruct

Issues with Fine Tuning

#37 opened about 7 hours ago by

rirv938

Has anybody got MTP working on VLLM? ('GPUModelRunner' object has no attribute 'drafter')

#36 opened 6 days ago by

stev236

Generates nonsense when run with latest VLLM with Flashinfer 0.4

#35 opened 7 days ago by

stev236

Bug: Running the example gives nonsensical response on 8xH100

#33 opened 24 days ago by

kz919

return null

#32 opened 28 days ago by

sakuramiko35

How much Vram needed for the full context length?

5

#31 opened 29 days ago by

Aly87

求大神解读一下这行代码的含义

#30 opened 30 days ago by

bluelueSea

Int4 quantization broken

3

#28 opened about 1 month ago by

TheBigBlockPC

Could you release a 20B‑scale MoE version? Thank you very much.

🔥 1

1

#27 opened about 1 month ago by

houxiaowei

Awesome! Please be sure to train a 80B A3B next version coder model!

🔥 6

#26 opened about 1 month ago by

wukongai

Bug report with running with transformers

#25 opened about 1 month ago by

qsstcl

Only 2k max-tokens in lm-studio?

#24 opened about 1 month ago by

jkkit

qwen

#23 opened about 1 month ago by

Dumpy13

Test

#22 opened about 1 month ago by

vhm8356

VRAM requirement for maximum token length?

🚀 5

#21 opened about 1 month ago by

Donhuay

guide for runing this at 12gbvram and 180gb ram with dual cpu in vllm 0.5 to 0.6t/sec in vllm

🔥 👍 4

2

#20 opened about 1 month ago by

gopi87

Fix broken qwen3-next blog link

#19 opened about 1 month ago by

Smorty100

FP8 please

👀 ➕ 16

8

#18 opened about 1 month ago by

aliquis-pe

model_use

#17 opened about 1 month ago by

mohanpichikala

Will smaller Qwen3-Next models be released in the future?

➕ 👀 7

1

#15 opened about 1 month ago by

ZAID041

abhai

#14 opened about 1 month ago by

Abhai121

🚀 Best Practices for Evaluating the Qwen3-Next Model

🚀 👍 8

#13 opened about 1 month ago by

Yunxz

Is it possible to finetune with ms-swift?

🚀 1

3

#12 opened about 1 month ago by

phosira

reduced multi language quality

👍 1

3

#11 opened about 2 months ago by

rastegar

遥遥领先了

2

#10 opened about 2 months ago by

OrlandoHugBot

用readme的代码测试，返回乱码

5

#9 opened about 2 months ago by

tarjintor

Plan for AWQ?

➕ 24

3

#8 opened about 2 months ago by

hyunw55

How much GPU memory is needed for local deployment?

13

#7 opened about 2 months ago by

XuehangCang

fix the blog link

1

#6 opened about 2 months ago by

ryan-u

Will there be dedicated technical report for Qwen3-Next?

👍 6

#5 opened about 2 months ago by

Gmc2

The model is wholesome

🔥 1

2

#4 opened about 2 months ago by deleted

Local Installation Video and Testing On CPU - Step by Step

🤗 3

#3 opened about 2 months ago by

fahdmirzac

No base model

👍 14

8

#2 opened about 2 months ago by

ricardo-rei

GGUF when? 8 bit quant when?

➕ ❤️ 13

14

#1 opened about 2 months ago by

ouchiewouchie