Tian Wang
L-I-M-I-T
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
Detecting Data Contamination from Reinforcement Learning Post-training
for Large Language Models
upvoted
a
collection
3 months ago
RLPR
upvoted
a
paper
3 months ago
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains
RLVR
Organizations
None yet