Text Generation
Safetensors
English
llama

Add model card

#1
by nielsr HF Staff - opened

This PR adds a model card, linking the model to the paper OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling as well as adding the relevant metadata (license, pipeline tag, library name) for more discoverability.
Project page: https://huggingface.co/OctoThinker.

Cannot merge
This branch has merge conflicts in the following files:
  • README.md

Sign up or log in to comment