view article Article DeepMath: A lightweight math reasoning Agent with smolagents +1 23 days ago • 30
Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging Paper • 2412.19512 • Published Dec 27, 2024 • 9