Artificial Hippocampus Networks for Efficient Long-Context Modeling Paper • 2510.07318 • Published 18 days ago • 27
Muon Outperforms Adam in Tail-End Associative Memory Learning Paper • 2509.26030 • Published 27 days ago • 19
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper • 2509.26507 • Published 26 days ago • 509