Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
MasterControlAIML
/
DeepSeek-R1-Qwen2.5-3b-LLM-Judge-Reward-JSON-Unstructured-To-Structured-Lora-gguf
like
0
Follow
MasterControl
28
Transformers
GGUF
English
qwen2
text-generation-inference
unsloth
trl
grpo
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
85f4060
DeepSeek-R1-Qwen2.5-3b-LLM-Judge-Reward-JSON-Unstructured-To-Structured-Lora-gguf
/
README.md
Commit History
Update README.md
85f4060
verified
bhaviktheslider
commited on
Jun 18
Update README.md
f50b7e6
verified
bhaviktheslider
commited on
Jun 17
Update README.md
ed0c793
verified
bhaviktheslider
commited on
Jun 17
Update README.md
e0b66cd
verified
bhaviktheslider
commited on
Jun 17
Upload README.md with huggingface_hub
b85630b
verified
bhaviktheslider
commited on
Apr 26