liumy2010's picture
Upload README.md with huggingface_hub
16e2d0b verified
metadata
library_name: transformers
pipeline_tag: text-generation
base_model:
  - Qwen/Qwen2.5-1.5B

UFT

This repository contains the model presented in UFT: Unifying Supervised and Reinforcement Fine-Tuning.

Code: https://github.com/liumy2010/UFT

## References

* [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://arxiv.org/abs/2505.16984)