Add model card

by nielsr HF Staff - opened Jun 29

←

nielsr

Jun 29

This PR adds a model card, linking the model to the paper OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling as well as adding the relevant metadata (license, pipeline tag, library name) for more discoverability.
Project page: https://huggingface.co/OctoThinker.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Cannot merge

This branch has merge conflicts in the following files:

· Sign up or log in to comment