Zhicheng YANG

yangzhch6

https://yangzhch6.github.io/

yangzhch6

AI & ML interests

reasoning with LLMs

Recent Activity

liked a model 1 day ago

deepseek-ai/DeepSeek-OCR

updated a collection 1 day ago

DeepInformal

updated a dataset 1 day ago

yangzhch6/DeepInformal-Putnam-1995-2024

View all activity

Organizations

None yet

liked a model 1 day ago

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated 4 days ago • 1.18M • 2.18k

updated a collection 1 day ago

DeepInformal

Collection

2 items • Updated 1 day ago

updated a dataset 1 day ago

yangzhch6/DeepInformal-Putnam-1995-2024

Viewer • Updated 1 day ago • 356 • 4

published a dataset 1 day ago

yangzhch6/DeepInformal-Putnam-1995-2024

Viewer • Updated 1 day ago • 356 • 4

updated a collection 1 day ago

DeepInformal

Collection

2 items • Updated 1 day ago

updated a dataset 1 day ago

yangzhch6/DeepInformal-DeepTheorem-84k

Viewer • Updated 1 day ago • 84.1k • 4

updated a dataset 2 days ago

yangzhch6/Putnam-Informal-1995-2024

Viewer • Updated 2 days ago • 360 • 22

published 2 datasets 2 days ago

yangzhch6/Putnam-Informal-1995-2024

Viewer • Updated 2 days ago • 360 • 22

yangzhch6/DeepInformal-DeepTheorem-84k

Viewer • Updated 1 day ago • 84.1k • 4

upvoted a paper 5 days ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published 26 days ago • 45

liked a dataset 10 days ago

Jiahao004/DeepTheorem

Viewer • Updated Jul 3 • 121k • 472 • 25

updated a model 16 days ago

yangzhch6/cuda-12.8-tar

Updated 16 days ago

published a model 16 days ago

yangzhch6/cuda-12.8-tar

Updated 16 days ago

published a dataset 16 days ago

yangzhch6/cuda-12.8-tar

Updated 16 days ago • 8

updated a model 16 days ago

yangzhch6/cuda-12.8

Updated 16 days ago

published a model 16 days ago

yangzhch6/cuda-12.8

Updated 16 days ago

upvoted a paper 18 days ago

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28 • 113

upvoted 2 papers 19 days ago

Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration

Paper • 2508.13755 • Published Aug 19 • 14

Reinforcing Diffusion Models by Direct Group Preference Optimization

Paper • 2510.08425 • Published 20 days ago • 10

updated a dataset 25 days ago

yangzhch6/tmp

Viewer • Updated 25 days ago • 8.03k • 95

Zhicheng YANG

AI & ML interests

Recent Activity

Organizations

yangzhch6's activity