LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 12 days ago • 138
Running on Zero MCP 1.55k Wan2.2 14B Fast Preview 🐌 1.55k generate a video from an image with a text prompt
Running on Zero MCP Featured 193 Wan2.2 14B Fast Preview 🐌 193 generate a video from an image with a text prompt
Running on CPU Upgrade Featured 392 ML Intern 🤖 392 Chat with an AI assistant for machine learning help
Kronos: A Foundation Model for the Language of Financial Markets Paper • 2508.02739 • Published Aug 2, 2025 • 39
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 36 items • Updated 1 day ago • 209
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Image-Text-to-Text • 28B • Updated Apr 6 • 147k • • 2.87k
Running Featured 130 Voxtral Realtime WebGPU 💬 130 Real-time speech transcription, entirely in your browser.
Running on Zero Agents Featured 139 Qwen3-ASR Demo 🎙 139 Transcribe audio to text with timestamps and visualization