1 19 1

Nick Yang

RadioBlue

AI & ML interests

None yet

Recent Activity

upvoted a paper 29 days ago

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

upvoted a paper about 1 month ago

FlowRL: Matching Reward Distributions for LLM Reasoning

upvoted a paper about 2 months ago

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

View all activity

Organizations

upvoted a paper 29 days ago

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

Paper • 2509.25123 • Published 29 days ago • 18

upvoted a paper about 1 month ago

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18 • 110

upvoted 4 papers about 2 months ago

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Paper • 2509.07894 • Published Sep 9 • 32

upvoted a paper 2 months ago

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published Aug 14 • 94

authored a paper 3 months ago

OpenCUA: Open Foundations for Computer-Use Agents

Paper • 2508.09123 • Published Aug 12 • 31

upvoted a collection 3 months ago

OpenCUA: Open Foundations for Computer-Use Agents

Collection

This is the official versions of OpenCUA models and AgentNet datasets. Website: https://opencua.xlang.ai/ • 8 items • Updated 12 days ago • 20

upvoted a paper 3 months ago

OpenCUA: Open Foundations for Computer-Use Agents

Paper • 2508.09123 • Published Aug 12 • 31

upvoted an article 5 months ago

Article

GRPO for GUI Grounding Done Right

•

Jun 11

• 34

upvoted a paper 5 months ago

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Paper • 2506.03143 • Published Jun 3 • 52

New activity in xlangai/Jedi-3B-1080p 5 months ago

unable to run demo.py, plz help

#2 opened 5 months ago by

rdhoundiyal

upvoted a paper 5 months ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28 • 130

authored a paper 5 months ago

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

Paper • 2505.13227 • Published May 19 • 45

upvoted a paper 5 months ago

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

Paper • 2505.13227 • Published May 19 • 45

upvoted a paper 6 months ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published Apr 15 • 62

upvoted a paper 7 months ago

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

Paper • 2504.08942 • Published Apr 11 • 27

upvoted a paper 8 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 207

liked a model 11 months ago

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Feb 6 • 2.1M • • 1.24k

Nick Yang

AI & ML interests

Recent Activity

Organizations

RadioBlue's activity

GRPO for GUI Grounding Done Right

unable to run demo.py, plz help