Xinyu Zhu

TianHongZXY

https://zhuxinyu.top

AI & ML interests

Large Language Models; Reasoning; Reinforcement Learning

Recent Activity

upvoted a paper about 1 month ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

authored a paper about 1 month ago

RAST: Reasoning Activation in LLMs via Small-model Transfer

updated a dataset 2 months ago

TianHongZXY/similar_problems_with_three_in_context_problems

View all activity

Organizations

upvoted a paper about 1 month ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30 • 54

authored a paper about 1 month ago

RAST: Reasoning Activation in LLMs via Small-model Transfer

Paper • 2506.15710 • Published May 30

updated a dataset 2 months ago

TianHongZXY/similar_problems_with_three_in_context_problems

Viewer • Updated Sep 4 • 2.16k • 2.81k

published a dataset 2 months ago

TianHongZXY/similar_problems_with_three_in_context_problems

Viewer • Updated Sep 4 • 2.16k • 2.81k

upvoted a paper 2 months ago

A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

Paper • 2508.18106 • Published Aug 25 • 342

updated a dataset 2 months ago

TianHongZXY/Top_5_similar_question-NVIDIA-OpenScienceReasoning-2

Viewer • Updated Aug 28 • 2.16k • 4.23k

published a dataset 2 months ago

TianHongZXY/Top_5_similar_question-NVIDIA-OpenScienceReasoning-2

Viewer • Updated Aug 28 • 2.16k • 4.23k

liked a dataset 2 months ago

cais/hle

Viewer • Updated Sep 10 • 2.5k • 11.8k • 499

liked a dataset 3 months ago

nvidia/OpenScienceReasoning-2

Viewer • Updated Jul 31 • 803k • 721 • 46

liked a model 3 months ago

Qwen/Qwen3-235B-A22B-Thinking-2507

Text Generation • 235B • Updated Aug 17 • 33.7k • • 375

liked a dataset 3 months ago

nvidia/Nemotron-Post-Training-Dataset-v1

Viewer • Updated Aug 25 • 25.7M • 10.6k • 158

upvoted a collection 3 months ago

RLVR-Decomposed

Collection

The collection for the Paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning" • 9 items • Updated Jun 1 • 2

updated a model 3 months ago

TianHongZXY/Qwen2.5-Math-7B-GRPO

8B • Updated Jul 28 • 8

updated a model 4 months ago

TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-Math-7B-RoPE-40K-GRPO-use_guide-clip_ratio_upper_0.28

Updated Jul 12

published a model 4 months ago

TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-Math-7B-RoPE-40K-GRPO-use_guide-clip_ratio_upper_0.28

Updated Jul 12

updated 2 models 4 months ago

TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-7B-Instruct-GRPO-clip_0.28

Updated Jul 8

TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-7B-Instruct-GRPO-gpt-4o-summary_wo_think-clip_0.28

Updated Jul 8

published 2 models 4 months ago

TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-7B-Instruct-GRPO-clip_0.28

Updated Jul 8

TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-7B-Instruct-GRPO-gpt-4o-summary_wo_think-clip_0.28

Updated Jul 8

upvoted a collection 5 months ago

AdaDecode

Collection

[ICML 2025] AdaDecode: Accelerating LLM Decoding with Adaptive Layer Parallelism. • 18 items • Updated Jun 4 • 3

Xinyu Zhu

AI & ML interests

Recent Activity

Organizations

TianHongZXY's activity