Tian Wang's picture

3 1

Tian Wang

L-I-M-I-T

L-I-M-I-T

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

Detecting Data Contamination from Reinforcement Learning Post-training for Large Language Models

upvoted a collection about 2 months ago

upvoted a paper 2 months ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

View all activity

Organizations

None yet

upvoted a paper 17 days ago

Detecting Data Contamination from Reinforcement Learning Post-training for Large Language Models

Paper • 2510.09259 • Published 22 days ago • 2

upvoted a collection about 2 months ago

RLPR

Extrapolating RLVR to General Domains without Verifiers • 6 items • Updated Aug 7 • 4

upvoted a paper 2 months ago

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

Paper • 2508.14029 • Published Aug 19 • 118