Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Rostislav Golubev's picture
9

Rostislav Golubev

mika5883
·

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago
mika5883/t5_dpo_rulec_beam_comet_mismatch_v1
published a model about 1 month ago
mika5883/t5_dpo_rulec_beam_comet_mismatch_v1
updated a model 3 months ago
mika5883/qwen3-14b_lorugec
View all activity

Organizations

None yet

Collections 1

interesting
  • DPO Meets PPO: Reinforced Token Optimization for RLHF

    Paper • 2404.18922 • Published Apr 29, 2024 • 1
interesting
  • DPO Meets PPO: Reinforced Token Optimization for RLHF

    Paper • 2404.18922 • Published Apr 29, 2024 • 1

models 55

mika5883/t5_dpo_rulec_beam_comet_mismatch_v1

0.2B • Updated Dec 4, 2025 • 2

mika5883/qwen3-14b_lorugec

Updated Oct 3, 2025

mika5883/qwen3-14b_rugec_v2

Updated Jun 24, 2025

mika5883/qwen3-14b_rugec

Updated Jun 1, 2025

mika5883/qwen3-4b_rugec

Updated May 27, 2025

mika5883/gec_t5_dpo_A_v2

0.2B • Updated May 27, 2025

mika5883/rugec_A_comet_v3

0.2B • Updated May 25, 2025 • 3

mika5883/gec_t5_dpo_A_v1

0.2B • Updated May 24, 2025 • 3

mika5883/gec_t5_dpo

0.2B • Updated May 23, 2025

mika5883/gec_Ae_yanArt

0.2B • Updated May 19, 2025
View 55 models

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs