Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhuokai Zhao's picture
2 8 1

Zhuokai Zhao

zhuokai
StarGazerrr's profile picture EchoRaven's profile picture
·
https://zhuokai-zhao.com/
  • zhuokaiz
  • zhuokaizhao

AI & ML interests

Data-Efficient Learning, LLM Reasoning and Safety, Active Learning, Recommender System

Recent Activity

authored a paper 3 days ago
From Uncertainty to Trust: Enhancing Reliability in Vision-Language Models with Uncertainty-Guided Dropout Decoding
authored a paper 3 days ago
Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment
authored a paper 3 days ago
Transfer between Modalities with MetaQueries
View all activity

Organizations

MJ-Bench-Team's profile picture Project of MoE reward model's profile picture

zhuokai 's models 8

zhuokai/dapo_baseline_without_dynamic_sampling_temperature_1.2_Qwen2.5-Math-1.5B_zzk

Updated Aug 26

zhuokai/dapo_baseline_without_dynamic_sampling_temperature_1.0_Qwen2.5-Math-1.5B_zzk

Updated Aug 26

zhuokai/dapo_baseline_without_dynamic_sampling_temperature_0.6_Qwen2.5-Math-1.5B_zzk

Updated Aug 26

zhuokai/as_negexp_explore_1.2_stable_0.1_decay_freq_25_warmup_period_10_negexp_Qwen2.5-Math-1.5B_zzk

Updated Aug 26

zhuokai/gpg_baseline_temperature_1.0_Qwen2.5-Math-1.5B_zzk

Updated Aug 25

zhuokai/initial_grpo_baseline_temperature_0.6_Qwen2.5-Math-1.5B_zzk

Updated Aug 25

zhuokai/initial_grpo_baseline_temperature_1.0_Qwen2.5-Math-1.5B_zzk

Updated Aug 25

zhuokai/initial_grpo_baseline_temperature_1.2_Qwen2.5-Math-1.5B_zzk

Updated Aug 25
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs