Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Menlo
/
ReZero-v0.1-llama-3.2-3b-it-grpo-250404-gguf
like
4
Follow
Menlo Research
642
Transformers
GGUF
English
llama
text-generation-inference
unsloth
conversational
arxiv:
2504.11001
License:
llama3.2
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
ReZero-v0.1-llama-3.2-3b-it-grpo-250404-gguf
Commit History
Update README.md
52c2705
verified
thinhlpg
commited on
Apr 17
Update README.md
4edb8be
verified
jan-hq
commited on
Apr 17
(Trained with Unsloth)
4999c08
verified
thinhlpg
commited on
Apr 7
(Trained with Unsloth)
d386667
verified
thinhlpg
commited on
Apr 7
Upload README.md with huggingface_hub
8a02800
verified
thinhlpg
commited on
Apr 7
initial commit
10ce393
verified
thinhlpg
commited on
Apr 7