ToolRL ToolRL: Reward is All Tool Learning Needs emrecanacikgoz/Qwen2.5-7B-Instruct-ToolRL-grpo-cold Updated Apr 22, 2025 • 234 • 3 emrecanacikgoz/ToolRL Viewer • Updated Apr 22, 2025 • 4k • 20 • 2 ToolRL: Reward is All Tool Learning Needs Paper • 2504.13958 • Published Apr 16, 2025 • 48
Hippocrates emrecanacikgoz/hippollama Text Generation • Updated Apr 26, 2024 • 14 • 5 emrecanacikgoz/hippomistral Text Generation • Updated Apr 26, 2024 • 13 • 8
SMART Collection of SMART models fine-tuned for improved self-awareness to reduce unnecessary tool use. emrecanacikgoz/SMARTAgent-Llama-3.1-8B Updated Feb 17, 2025 • 7 • 1 emrecanacikgoz/SMARTAgent-Llama-3.1-70B Updated Feb 17, 2025 • 5 emrecanacikgoz/SMARTAgent-Mistral-7B-Instruct-v0.3 Updated Feb 17, 2025 • 8 • 1 emrecanacikgoz/SMARTAgent-Mistral-Nemo-Instruct-2407 Updated Feb 17, 2025 • 9 • 1
Turkish-LLMs Official model collections for Bridging the Bosphorus paper including pre-trained from scracth and fine-tuned models. emrecanacikgoz/hamza-small Text Generation • Updated May 2, 2024 • 14 emrecanacikgoz/hamza-large Text Generation • Updated May 2, 2024 • 60 • 2 emrecanacikgoz/hamza-xl Text Generation • Updated May 2, 2024 • 14 • 3 emrecanacikgoz/hamza-mistral Text Generation • Updated May 9, 2024 • 21 • 1
ToolRL ToolRL: Reward is All Tool Learning Needs emrecanacikgoz/Qwen2.5-7B-Instruct-ToolRL-grpo-cold Updated Apr 22, 2025 • 234 • 3 emrecanacikgoz/ToolRL Viewer • Updated Apr 22, 2025 • 4k • 20 • 2 ToolRL: Reward is All Tool Learning Needs Paper • 2504.13958 • Published Apr 16, 2025 • 48
SMART Collection of SMART models fine-tuned for improved self-awareness to reduce unnecessary tool use. emrecanacikgoz/SMARTAgent-Llama-3.1-8B Updated Feb 17, 2025 • 7 • 1 emrecanacikgoz/SMARTAgent-Llama-3.1-70B Updated Feb 17, 2025 • 5 emrecanacikgoz/SMARTAgent-Mistral-7B-Instruct-v0.3 Updated Feb 17, 2025 • 8 • 1 emrecanacikgoz/SMARTAgent-Mistral-Nemo-Instruct-2407 Updated Feb 17, 2025 • 9 • 1
Hippocrates emrecanacikgoz/hippollama Text Generation • Updated Apr 26, 2024 • 14 • 5 emrecanacikgoz/hippomistral Text Generation • Updated Apr 26, 2024 • 13 • 8
Turkish-LLMs Official model collections for Bridging the Bosphorus paper including pre-trained from scracth and fine-tuned models. emrecanacikgoz/hamza-small Text Generation • Updated May 2, 2024 • 14 emrecanacikgoz/hamza-large Text Generation • Updated May 2, 2024 • 60 • 2 emrecanacikgoz/hamza-xl Text Generation • Updated May 2, 2024 • 14 • 3 emrecanacikgoz/hamza-mistral Text Generation • Updated May 9, 2024 • 21 • 1