Roda De's picture

6

Roda De

rodade9168

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

CEPO: RLVR Self-Distillation using Contrastive Evidence Policy Optimization

upvoted a paper 8 months ago

Dr.LLM: Dynamic Layer Routing in LLMs

upvoted a paper about 1 year ago

PersonaFeedback: A Large-scale Human-annotated Benchmark For Personalization

View all activity

Organizations

None yet

models 0

None public yet

datasets 0

None public yet