| library_name: transformers | |
| pipeline_tag: text-generation | |
| base_model: | |
| - meta-llama/Llama-3.2-3B | |
| ## UFT | |
| This repository contains the model presented in [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://huggingface.co/papers/2505.16984). | |
| Code: https://github.com/liumy2010/UFT | |
| ## References | |
| * [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://arxiv.org/abs/2505.16984) | |