wongyukim
wongyukim
AI & ML interests
None yet
Recent Activity
upvoted a paper about 5 hours ago
CoffeeBench: Benchmarking Long-Horizon LLM Agents in Heterogeneous Multi-Agent Economies upvoted a paper about 5 hours ago
The Verification Horizon: No Silver Bullet for Coding Agent Rewards upvoted a paper about 5 hours ago
OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning