Critique to Verify: Accurate and Honest Test-Time Scaling with RL-Trained Verifiers (https://arxiv.org/abs/2509.23152)
Zhicheng YANG
yangzhch6
AI & ML interests
reasoning with LLMs
Recent Activity
updated
a dataset
about 10 hours ago
yangzhch6/Putnam-Informal-1995-2024
published
a dataset
about 10 hours ago
yangzhch6/Putnam-Informal-1995-2024
updated
a dataset
about 11 hours ago
yangzhch6/DeepInformal
Organizations
None yet
models
27
yangzhch6/cuda-12.8-tar
Updated
yangzhch6/cuda-12.8
Updated
yangzhch6/Mirror-Verifier-1.5B
2B
•
Updated
•
6
yangzhch6/Mirror-Verifier-7B
8B
•
Updated
•
14
yangzhch6/Zero-Solver-Qwen2.5-Math-7B-L
8B
•
Updated
•
12
yangzhch6/Zero-Solver-Qwen2.5-Math-1.5B-L
2B
•
Updated
•
12
yangzhch6/Qwen2.5-Math-7B-L
Text Generation
•
8B
•
Updated
•
5
yangzhch6/Qwen2.5-7B-openr1-nothink-3k-f3
Updated
yangzhch6/Qwen2.5-1.5B-openr1-nothink-3k-f3
Updated
yangzhch6/mix-rlvr-verify
Updated
datasets
9
yangzhch6/Putnam-Informal-1995-2024
Viewer
•
Updated
•
360
yangzhch6/DeepInformal
Viewer
•
Updated
•
10.8k
yangzhch6/cuda-12.8-tar
Updated
•
7
yangzhch6/tmp
Viewer
•
Updated
•
8.03k
•
94
yangzhch6/Mirror-Critique
Viewer
•
Updated
•
62.7k
•
86
yangzhch6/Qwen2.5-Math-7B-L-openr1-nothink-3k-f3-step500
Viewer
•
Updated
•
504
•
23
yangzhch6/Qwen2.5-Math-1.5B-L-openr1-nothink-3k-f3-step500
Viewer
•
Updated
•
504
•
34
yangzhch6/DARS-Dataset
Viewer
•
Updated
•
1.56k
•
33
yangzhch6/cuda12.4
Updated
•
6