GRPO/PPO Finetunes for Creative Writing
DV
AI & ML interests
Post training @ https://dphn.ai
Recent Activity
updated a dataset about 17 hours ago
NewEden/RL-Seed-Mix-Iter-1 published a dataset about 17 hours ago
NewEden/RL-Seed-Mix-Iter-1 published a model 7 days ago
NewEden/Apertus-SFT-Stage-1Organizations
Austral
Got bored - Did a weird tune on harbinger and now there's 10K of these, Models meant for Adventure/RP, Creative and smartz.
-
Delta-Vector/Austral-70B-Winton
Text Generation • 71B • Updated • 7 • • 6 -
Delta-Vector/Austral-32B-GLM4-Winton
Text Generation • 33B • Updated • 7 • 8 -
Delta-Vector/MS3.2-Austral-Winton
Text Generation • 24B • Updated • 33 • 12 -
Delta-Vector/Austral-24B-Winton
Text Generation • 24B • Updated • 44 • 15
Nanuq-R1
GRPO/PPO Finetunes for Creative Writing
Austral
Got bored - Did a weird tune on harbinger and now there's 10K of these, Models meant for Adventure/RP, Creative and smartz.
-
Delta-Vector/Austral-70B-Winton
Text Generation • 71B • Updated • 7 • • 6 -
Delta-Vector/Austral-32B-GLM4-Winton
Text Generation • 33B • Updated • 7 • 8 -
Delta-Vector/MS3.2-Austral-Winton
Text Generation • 24B • Updated • 33 • 12 -
Delta-Vector/Austral-24B-Winton
Text Generation • 24B • Updated • 44 • 15
models 112
Delta-Vector/Rei-24B-KTO
Text Generation • 24B • Updated • 193 • 16
Delta-Vector/Dr-House-Evals
Updated
Delta-Vector/Qwen-ckpt-100
Text Generation • Updated • 2
Delta-Vector/Austral-4.5B-Winton
Text Generation • 5B • Updated • 8 • 11
Delta-Vector/Nanuq-R1-9B
Text Generation • 11B • Updated • 5 • 4
Delta-Vector/Nanuq-R1-14B
Text Generation • 14B • Updated • 7 • 2
Delta-Vector/Austral-AFM-SFT
5B • Updated • 3
Delta-Vector/Elenchus
545k • Updated • 2
Delta-Vector/Austral-32B-GLM4-Winton
Text Generation • 33B • Updated • 7 • 8
Delta-Vector/Austral-GLM4-SFT
33B • Updated • 2
datasets 123
Delta-Vector/CAI-critic-revision-8k-cleaned-sharegpt
Viewer • Updated • 8.1k • 17
Delta-Vector/Ursa-Armored-Core-6-Lore
Viewer • Updated • 166 • 34
Delta-Vector/wordlist
Viewer • Updated • 253 • 15
Delta-Vector/Tauri-RL-Styles
Viewer • Updated • 32 • 59
Delta-Vector/Hydrus-Olmo-3-sft-dedup-ngram-filter-r1
Viewer • Updated • 1.67M • 7
Delta-Vector/Ursa-Armored-Core-Lore-Kimi
Viewer • Updated • 286 • 8
Delta-Vector/Hydrus-Hardcode-Dphn
Viewer • Updated • 220 • 18
Delta-Vector/Hydrus-Smoltalk-3-Subset-Demarkdownified
Viewer • Updated • 92.1k • 9
Delta-Vector/Hydrus-Next-Coder-Single-turn
Viewer • Updated • 17.3k • 46
Delta-Vector/Tauri-Complex-JSON-Formatting
Viewer • Updated • 8.05k • 36 • 1