Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

6

Base only

Active filters: bgpo

THU-KEG/LLaDA-8B-BGPO-math

Reinforcement Learning • 8B • Updated Oct 14, 2025 • 6 • 1

THU-KEG/LLaDA-8B-BGPO-code

Reinforcement Learning • 8B • Updated Oct 14, 2025 • 3 • 1

THU-KEG/LLaDA-8B-BGPO-countdown

Reinforcement Learning • 8B • Updated Oct 14, 2025 • 4 • 1

THU-KEG/LLaDA-8B-BGPO-sudoku

Reinforcement Learning • 8B • Updated Oct 14, 2025 • 5 • 1

TheHierophant/LLaDA-8B-BGPO-math-Q5_K_M-GGUF

Reinforcement Learning • 8B • Updated about 23 hours ago

TheHierophant/LLaDA-8B-BGPO-math-Q6_K-GGUF

Reinforcement Learning • 8B • Updated about 23 hours ago