qwen3-4b-argentum LoRA

This repository contains a PEFT LoRA adapter for Qwen3-4B-Instruct-2507. It is intended for Spanish instruction following.

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
base = "Qwen/Qwen3-4B-Instruct-2507"
tok = AutoTokenizer.from_pretrained(base, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(base, trust_remote_code=True, device_map="auto")
model = PeftModel.from_pretrained(model, "frizynn/qwen3-4b-argentum")
prompt = tok.apply_chat_template([{"role":"user","content":"hola"}], tokenize=False, add_generation_prompt=True)
ids = tok(prompt, return_tensors="pt").to(model.device)
out = model.generate(**ids, max_new_tokens=64)
print(tok.decode(out[0], skip_special_tokens=True))

Training details

Describe data, steps, hyperparameters, and safety considerations here.

Downloads last month

-

Downloads are not tracked for this model. How to track
Safetensors
Model size
2B params
Tensor type
F32
·
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for frizynn/qwen3-4b-argentum

Adapter
(68)
this model