Qwen3-4B Claude Sonnet Reasoning Distill

This model was trained on a Claude Sonnet 4 (non-reasoning) dataset and a Claude Sonnet 3.7 (reasoning) dataset.

  • 🧬 Datasets:

    • Liontix/claude-sonnet-4-100x
    • reedmayhew/claude-3.7-sonnet-reasoning
  • 🏗 Base Model:

    • unsloth/Qwen3-4B-unsloth-bnb-4bit

If you want to fine-tune this model:

  • Start from: Liontix/Qwen3-4B-Claude-Sonnet-4-Reasoning-Distill-Safetensor
  • Change dataset as needed in your training script or notebook

Prompt format uses Claude-style <|im_start|> / <|im_end|> markers with role tags.

Downloads last month
134
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Liontix/Qwen3-4B-Claude-Sonnet-4-Reasoning-Distill-Safetensor

Base model

Qwen/Qwen3-4B-Base
Finetuned
Qwen/Qwen3-4B
Finetuned
(160)
this model
Merges
1 model
Quantizations
2 models

Datasets used to train Liontix/Qwen3-4B-Claude-Sonnet-4-Reasoning-Distill-Safetensor