Inference Providers
Active filters: RL
bartowski/nvidia_Nemotron-Cascade-2-30B-A3B-GGUF
Text Generation
• 32B • Updated • 7.8k
• 39
Jackrong/Qwopus3.5-4B-Coder-MTP-GGUF
Image-Text-to-Text
• Updated • 30.4k
• 53
Jackrong/Qwopus3.5-4B-Coder-GGUF
Image-Text-to-Text
• 4B • Updated • 24.1k
• 26
NousResearch/DeepHermes-ToolCalling-Specialist-Atropos
Reinforcement Learning
• 8B • Updated • 12
• 20
nvidia/Nemotron-Cascade-2-30B-A3B
Text Generation
• 32B • Updated • 49.2k
• 505
trohrbaugh/Nemotron-Cascade-2-30B-A3B-heretic-uncensored
Text Generation
• 32B • Updated • 9
• 4
erreursyntax/DeepHermes-Egregore-v1-RLAIF-8b-Atropos
Reinforcement Learning
• 8B • Updated • 20
• 1
mradermacher/DeepHermes-Egregore-v1-RLAIF-8b-Atropos-GGUF
Reinforcement Learning
• 8B • Updated • 784
• 1
mradermacher/DeepHermes-Egregore-v1-RLAIF-8b-Atropos-i1-GGUF
Reinforcement Learning
• 8B • Updated • 2.33k
• 1
stanfordnlp/SteamSHP-flan-t5-xl
Updated • 11
• 43
stanfordnlp/SteamSHP-flan-t5-large
Updated • 311
• 33
SultanR/SmolTulu-1.7b-Reinforced
Text Generation
• 2B • Updated • 10
• 5
mradermacher/SmolTulu-1.7b-Reinforced-GGUF
2B • Updated • 110
Daemontatox/Llama3.3-70B-CogniLink
Text Generation
• 71B • Updated • 34
• • 3
mradermacher/Llama3.3-70B-CogniLink-GGUF
Text Generation
• 71B • Updated • 18
mradermacher/Llama3.3-70B-CogniLink-i1-GGUF
Text Generation
• 71B • Updated • 158
JHuel/Mistral-Nemo-Instruct-2407_DPO_qlora
Reinforcement Learning
• Updated JHuel/Mistral-Nemo-Instruct-2407_ORPO
Text Generation
• Updated Ihor/Text2Graph-R1-Qwen2.5-0.5b
Text Generation
• 0.5B • Updated • 45
• • 24
Reinforcement Learning
• Updated • 2
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF
0.5B • Updated • 32
• 1
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF
0.5B • Updated • 224
• 1
mradermacher/QuadConnect2.5-0.5B-v0.0.3b-GGUF
0.5B • Updated • 48
Text Generation
• 684B • Updated • 33
• 1
mradermacher/QuadConnect2.5-0.5B-v0.0.8b-GGUF
0.5B • Updated • 36
Lyte/QuadConnect2.5-0.5B-v0.0.9b
Text Generation
• 0.5B • Updated • 102
• mradermacher/QuadConnect2.5-0.5B-v0.0.9b-GGUF
0.5B • Updated • 46
Lyte/QuadConnect2.5-1.5B-v0.1.0b
Text Generation
• 2B • Updated • 60
• • 1
mradermacher/QuadConnect2.5-1.5B-v0.1.0b-GGUF
2B • Updated • 29
• 1
mradermacher/Zireal-0-GGUF