HF Spaces for demoing chat completion models—no ZeroGPU, WebGPU, or BYOK included. Thank you so much to these devs!
Urro
urroxyz
AI & ML interests
i like research on empowering small LMs to do better 😮
i DISLIKE video & image generation (esp. ai "art") 🤢
Recent Activity
updated
a collection
1 day ago
✨ free demo spaces updated
a collection
1 day ago
HUMAN-WRITTEN & LEGALLY-SOURCED* updated
a collection
1 day ago
✨ free demo spaces Organizations
TINY MODELS WITH BIG INTELLIGENCE
Tiny (<30B) models that tend to outperform their same-parameter counterparts.
-
Qwen/Qwen3.5-27B
Image-Text-to-Text • 28B • Updated • 218k • • 457 -
cerebras/GLM-4.7-Flash-REAP-23B-A3B
Text Generation • 23B • Updated • 8.34k • 65 -
janhq/Jan-v3-4B-base-instruct
Text Generation • 4B • Updated • 3.46k • 53 -
ServiceNow-AI/Apriel-1.6-15b-Thinker
Image-Text-to-Text • Updated • 5.61k • • 287
HUMAN-WRITTEN & LEGALLY-SOURCED*
Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis *...mostly.
WTF GENIUS PAPERS
Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models.
-
Diffusion Language Models Know the Answer Before Decoding
Paper • 2508.19982 • Published • 27 -
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding
Paper • 2512.13586 • Published • 93 -
LSRIF: Logic-Structured Reinforcement Learning for Instruction Following
Paper • 2601.06431 • Published • 12 -
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning
Paper • 2601.09088 • Published • 63
ETHICALLY-DECENT & LEGALLY-ADJACENT
Depending on your definitions, these models may not be strictly "ethical" or "legal", yet they are 100% more ethical and legal than GPT or Claude.
ATTENTIVE ASR MODELS FOR ONNX
ONNX conversions of ASR models with attentions enabled for output. Especially useful for word-level timestamp extraction.
✨ free demo spaces
HF Spaces for demoing chat completion models—no ZeroGPU, WebGPU, or BYOK included. Thank you so much to these devs!
- RunningFeatured38
Step-3.5-Flash Chatbot
🚀38Run interactive Streamlit apps directly in your browser
- Running
MiniMax M2.5 Chat
👀Chat with MiniMax M2.5 — 230B MoE model (10B active)
- Running5
Ling Space
🦉5Chat, code, and write with AI‑powered multilingual assistant
- Running on CPU UpgradeFeatured334
GPT-OSS-120B on AMD MI300X
💻334gpt-oss-120b on AMD MI300X GPUs
WTF GENIUS PAPERS
Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models.
-
Diffusion Language Models Know the Answer Before Decoding
Paper • 2508.19982 • Published • 27 -
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding
Paper • 2512.13586 • Published • 93 -
LSRIF: Logic-Structured Reinforcement Learning for Instruction Following
Paper • 2601.06431 • Published • 12 -
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning
Paper • 2601.09088 • Published • 63
TINY MODELS WITH BIG INTELLIGENCE
Tiny (<30B) models that tend to outperform their same-parameter counterparts.
-
Qwen/Qwen3.5-27B
Image-Text-to-Text • 28B • Updated • 218k • • 457 -
cerebras/GLM-4.7-Flash-REAP-23B-A3B
Text Generation • 23B • Updated • 8.34k • 65 -
janhq/Jan-v3-4B-base-instruct
Text Generation • 4B • Updated • 3.46k • 53 -
ServiceNow-AI/Apriel-1.6-15b-Thinker
Image-Text-to-Text • Updated • 5.61k • • 287
ETHICALLY-DECENT & LEGALLY-ADJACENT
Depending on your definitions, these models may not be strictly "ethical" or "legal", yet they are 100% more ethical and legal than GPT or Claude.
HUMAN-WRITTEN & LEGALLY-SOURCED*
Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis *...mostly.
ATTENTIVE ASR MODELS FOR ONNX
ONNX conversions of ASR models with attentions enabled for output. Especially useful for word-level timestamp extraction.