韩千怡's picture

韩千怡

noahga

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Beyond Monolingual Deep Research: Evaluating Agents and Retrievers with Cross-Lingual BrowseComp-Plus

upvoted a paper 26 days ago

ESPO: Early-Stopping Proximal Policy Optimization

liked a dataset 26 days ago

liuxu030724/QuRating-GPT3.5-Judgments-Test

View all activity

Organizations

None yet

upvoted a paper 7 days ago

Beyond Monolingual Deep Research: Evaluating Agents and Retrievers with Cross-Lingual BrowseComp-Plus

Paper • 2606.15345 • Published 15 days ago • 16

upvoted a paper 26 days ago

ESPO: Early-Stopping Proximal Policy Optimization

Paper • 2605.29860 • Published about 1 month ago • 20

upvoted 3 papers about 1 month ago

SAM 3D Animal: Promptable Animal 3D Reconstruction from Images in the Wild

Paper • 2605.07604 • Published May 8 • 4

WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation

Paper • 2605.25874 • Published May 25 • 103

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published May 7 • 116

upvoted a paper about 2 months ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published May 3 • 171

upvoted a paper 2 months ago

GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents

Paper • 2604.07429 • Published Apr 8 • 123

upvoted 3 papers 3 months ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 248

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 509

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 373

upvoted 2 papers 4 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 526

From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Paper • 2602.22859 • Published Feb 26 • 150