liumy2010
/

Llama-3.2-3B-countdown-RFT

Text Generation

text-generation-inference

Model card Files Files and versions

Llama-3.2-3B-countdown-RFT / README.md

liumy2010's picture

Upload README.md with huggingface_hub

abf6979 verified 6 months ago

|

history blame contribute delete

425 Bytes

	---
	library_name: transformers
	pipeline_tag: text-generation
	base_model:
	- meta-llama/Llama-3.2-3B
	---

	## UFT

	This repository contains the model presented in [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://huggingface.co/papers/2505.16984).

	Code: https://github.com/liumy2010/UFT

	## References

	* [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://arxiv.org/abs/2505.16984)