Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Gandharv Patil's picture
1

Gandharv Patil

gp02-mcgill
sasha's profile picture
·
  • gp1702

AI & ML interests

Reinforcement Learning, Stochastic Optimisation, Probabilistic Inference

Organizations

Mila – Quebec Artificial Intelligence Institute's profile picture MOMA-models's profile picture

Papers 1

arxiv:2506.16507

models 1

gp02-mcgill/zephyr-7b-dpo-qlora

Updated Jan 8

datasets 3

gp02-mcgill/ultrafeedback_binarised_all_max

Viewer • Updated Jan 31 • 176k • 6

gp02-mcgill/ultrafeedback_binarised_rnd_max

Viewer • Updated Jan 31 • 60.9k • 4

gp02-mcgill/ultrafeedback_binarised_min_max

Viewer • Updated Jan 31 • 60.9k • 17
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs