Dmitri Babaev
dllllb
AI & ML interests
PLP, RL, sequential data
Recent Activity
upvoted
a
paper
7 days ago
Multimodal Evaluation of Russian-language Architectures
upvoted
an
article
about 2 months ago
From GRPO to DAPO and GSPO: What, Why, and How
authored
a paper
2 months ago
SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language
Models on Software Engineering Tasks