Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence Paper • 2511.07384 • Published Nov 10, 2025 • 19
smcleish/Recurrent-Llama-3.2-train-recurrence-32 Text Generation • 1B • Updated Nov 11, 2025 • 30 • 1
smcleish/Recurrent-TinyLlama-3T-train-recurrence-32 Text Generation • 0.8B • Updated Nov 11, 2025 • 3 • 1
smcleish/Recurrent-TinyLlama-3T-train-recurrence-16 Text Generation • 0.8B • Updated Nov 11, 2025 • 3 • 1
smcleish/Recurrent-OLMo-2-0425-train-recurrence-32 Text Generation • 1B • Updated Nov 11, 2025 • 12 • 2
smcleish/Recurrent-OLMo-2-0425-train-recurrence-4 Text Generation • 1B • Updated Nov 11, 2025 • 1 • 1
smcleish/Recurrent-TinyLlama-3T-train-recurrence-4-single-phase Text Generation • 0.8B • Updated Nov 11, 2025 • 2
smcleish/Recurrent-TinyLlama-3T-train-recurrence-4-two-phase Text Generation • 0.8B • Updated Nov 11, 2025 • 1