Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
6
Farid Bagirov
kraalfar
Follow
lippytm's profile picture
21world's profile picture
waleko's profile picture
3 followers
·
2 following
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation
upvoted
a
paper
1 day ago
The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation
upvoted
a
paper
1 day ago
Diff-XYZ: A Benchmark for Evaluating Diff Understanding
View all activity
Organizations
kraalfar
's models
1
Sort: Recently updated
kraalfar/Qwen2.5-Coder-7B-GRPO
Text Generation
•
8B
•
Updated
Aug 12
•
4