MUVERA: Multi-Vector Retrieval via Fixed Dimensional Encodings Paper • 2405.19504 • Published May 29, 2024 • 3
Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published Sep 19, 2024 • 140
A Simple and Effective L_2 Norm-Based Strategy for KV Cache Compression Paper • 2406.11430 • Published Jun 17, 2024 • 25