2 7 4

yaoyifan

yyf12

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities

upvoted a paper 4 months ago

ViDiC: Video Difference Captioning

upvoted a paper 4 months ago

SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models

View all activity

Organizations

upvoted a paper about 2 months ago

Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities

Paper • 2601.21937 • Published Jan 29 • 19

upvoted 2 papers 4 months ago

ViDiC: Video Difference Captioning

Paper • 2512.03405 • Published Dec 3, 2025 • 28

SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models

Paper • 2511.05459 • Published Nov 7, 2025 • 4

upvoted 2 papers 5 months ago

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 229

AgenTracer: Who Is Inducing Failure in the LLM Agentic Systems?

Paper • 2509.03312 • Published Sep 3, 2025 • 5

upvoted a paper 6 months ago

OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs

Paper • 2510.10689 • Published Oct 12, 2025 • 47

updated a dataset 8 months ago

m-a-p/FineLeanCorpus

Viewer • Updated Jul 28, 2025 • 509k • 364 • 10

New activity in m-a-p/FineLeanCorpus 8 months ago

Update README.md

#4 opened 8 months ago by

yyf12

updated 3 models 9 months ago

published 3 models 9 months ago

m-a-p/CriticLeanGPT-Qwen2.5-7B-RL

15B • Updated Jul 12, 2025 • 8 • 1

m-a-p/CriticLeanGPT-Qwen2.5-14B-RL

15B • Updated Jul 12, 2025 • 4 • 1

m-a-p/CriticLeanGPT-Qwen2.5-32B-RL

33B • Updated Jul 12, 2025 • 4

updated 2 models 9 months ago

m-a-p/CriticLeanGPT-Qwen2.5-7B-Instruct-SFT-RL

8B • Updated Jul 11, 2025 • 5 • 1

m-a-p/CriticLeanGPT-Qwen2.5-32B-Instruct-SFT-RL

33B • Updated Jul 11, 2025 • 8

published a model 9 months ago

m-a-p/CriticLeanGPT-Qwen2.5-7B-Instruct-SFT-RL

8B • Updated Jul 11, 2025 • 5 • 1

updated a model 9 months ago

m-a-p/CriticLeanGPT-Qwen2.5-32B-Instruct-SFT

Text Generation • Updated Jul 11, 2025 • 6