KaVa: Latent Reasoning via Compressed KV-Cache Distillation Paper • 2510.02312 • Published about 1 month ago • 1
MSViT: Dynamic Mixed-Scale Tokenization for Vision Transformers Paper • 2307.02321 • Published Jul 5, 2023 • 7