Submitted by osanseviero 52 Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming · 2 authors 3.42k 6
Submitted by twinsken 39 VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters · 6 authors 2