dev-store (dev-store)

zhangchenxu

authored a paper 3 months ago

TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Paper • 2510.01179 • Published Oct 1 • 25

zhangchenxu

authored 2 papers 7 months ago

VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL

Paper • 2505.23977 • Published May 29 • 10

TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning

Paper • 2505.14625 • Published May 20 • 13

fqjiang

updated a model 8 months ago

dev-store/grpo_sc_rl_v2_1.5B_ep6

2B • Updated Apr 28

fqjiang

published a model 8 months ago

dev-store/grpo_sc_rl_v2_1.5B_ep6

2B • Updated Apr 28

fqjiang

updated a model 8 months ago

dev-store/grpo_sc_rl_v2_1.5B_ep4

2B • Updated Apr 27

fqjiang

published a model 8 months ago

dev-store/grpo_sc_rl_v2_1.5B_ep4

2B • Updated Apr 27

fqjiang

updated a model 9 months ago

dev-store/7b_grpo_gsm8k-blurchain-v1_ep4

8B • Updated Apr 13 • 8

fqjiang

published a model 9 months ago

dev-store/7b_grpo_gsm8k-blurchain-v1_ep4

8B • Updated Apr 13 • 8

fqjiang

updated a model 9 months ago

dev-store/grpo_gsm8k-blurchain-v1_ep8

2B • Updated Apr 10 • 6

fqjiang

published a model 9 months ago

dev-store/grpo_gsm8k-blurchain-v1_ep8

2B • Updated Apr 10 • 6

fqjiang

updated a model 9 months ago

dev-store/grpo_gsm8k-blurchain-v1_grpo_gsm8k-blurchain-v1_blur-1.5b_202504100040_step116_202504100844

2B • Updated Apr 10 • 6

fqjiang

published a model 9 months ago

dev-store/grpo_gsm8k-blurchain-v1_grpo_gsm8k-blurchain-v1_blur-1.5b_202504100040_step116_202504100844

2B • Updated Apr 10 • 6

fqjiang

updated a model 9 months ago

dev-store/grpo_gsm8k-blurchain-v1_blur-1.5b_202504100040_step116

2B • Updated Apr 10

fqjiang

published a model 9 months ago

dev-store/grpo_gsm8k-blurchain-v1_blur-1.5b_202504100040_step116

2B • Updated Apr 10

fqjiang

updated a dataset 9 months ago

dev-store/math-diff-data

Viewer • Updated Apr 9 • 16.8k • 6

fqjiang

published a dataset 9 months ago

dev-store/math-diff-data

Viewer • Updated Apr 9 • 16.8k • 6

fqjiang

updated a model 9 months ago

dev-store/blur-1.5b

Text Generation • 2B • Updated Apr 4

fqjiang

published a model 9 months ago

dev-store/blur-1.5b

Text Generation • 2B • Updated Apr 4

fqjiang

updated a model 9 months ago

dev-store/blur-7b

Text Generation • 8B • Updated Apr 3

AI & ML interests

Team members 3

dev-store's activity