18 2

YutaoXie

AndreasX1206

Andreas1206

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

upvoted a paper 5 days ago

IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL

updated a model 7 months ago

AndreasX1206/test

View all activity

Organizations

upvoted 2 papers 5 days ago

TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

Paper • 2603.22293 • Published Mar 11 • 1

IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL

Paper • 2603.12151 • Published Mar 12 • 2

updated a model 7 months ago

AndreasX1206/test

8B • Updated Oct 3, 2025

published 2 models 7 months ago

AndreasX1206/deepseek_esft_translation_lr4e-4_ste

Updated Sep 16, 2025

AndreasX1206/deepseek_esft_translation_lr5e-4_ste

Updated Sep 16, 2025

published a dataset 8 months ago

AndreasX1206/ckpt

Updated Aug 29, 2025 • 2

published a model 9 months ago

AndreasX1206/test

8B • Updated Oct 3, 2025

New activity in LLM360/guru-RL-92k 10 months ago

Update README.md

#12 opened 10 months ago by

AndreasX1206

Update README.md

#11 opened 10 months ago by

AndreasX1206

updated a dataset 10 months ago

ucsd-wang-lab-lm/bird_execution_correct_data

Updated Jul 9, 2025 • 3

New activity in LLM360/guru-RL-92k-extra-info-compressed 10 months ago

Delete offline_eval/math__aime2025_repeated_8x_240.parquet

#7 opened 10 months ago by

AndreasX1206

New activity in LLM360/guru-RL-92k 10 months ago

Delete offline_eval/math__aime2025_repeated_8x_240.parquet

#9 opened 10 months ago by

AndreasX1206

New activity in LLM360/guru-RL-92k-extra-info-compressed 10 months ago

Delete online_eval/math__olympiad_bench_675.parquet

#6 opened 10 months ago by

AndreasX1206

Delete online_eval/codegen__leetcode2k_386.parquet

#5 opened 10 months ago by

AndreasX1206

Rename online_eval/simulation__arcagi1_200.parquet to online_eval/logic__arcagi1_200.parquet

#4 opened 10 months ago by

AndreasX1206

Delete online_eval/math__minerva_272.parquet

#3 opened 10 months ago by

AndreasX1206

New activity in LLM360/guru-RL-92k 10 months ago

Delete online_eval/logic__graph_logical_dataset_77.parquet

#8 opened 10 months ago by

AndreasX1206

Delete online_eval/codegen__leetcode2k_386.parquet

#7 opened 10 months ago by

AndreasX1206

Delete online_eval/math__minerva_272.parquet

#6 opened 10 months ago by

AndreasX1206

Rename online_eval/table__hitab_300.parquet to online_eval/table__hitab_200.parquet

#5 opened 10 months ago by

AndreasX1206

YutaoXie

AI & ML interests

Recent Activity

Organizations

AndreasX1206's activity

Update README.md

Update README.md

Delete offline_eval/math__aime2025_repeated_8x_240.parquet

Delete offline_eval/math__aime2025_repeated_8x_240.parquet

Delete online_eval/math__olympiad_bench_675.parquet

Delete online_eval/codegen__leetcode2k_386.parquet

Rename online_eval/simulation__arcagi1_200.parquet to online_eval/logic__arcagi1_200.parquet

Delete online_eval/math__minerva_272.parquet

Delete online_eval/logic__graph_logical_dataset_77.parquet

Delete online_eval/codegen__leetcode2k_386.parquet

Delete online_eval/math__minerva_272.parquet

Rename online_eval/table__hitab_300.parquet to online_eval/table__hitab_200.parquet