39 196 49

KABI

dongguanting

https://dongguanting.github.io/

AI & ML interests

Reasoning and Alignment for Large Language Models

Recent Activity

liked a dataset about 12 hours ago

XXHStudyHard/EnvScaler-SFT-Traj-9K

upvoted a paper 1 day ago

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

upvoted a paper 1 day ago

ROI-Reasoning: Rational Optimization for Inference via Pre-Computation Meta-Cognition

View all activity

Organizations

liked a dataset about 12 hours ago

XXHStudyHard/EnvScaler-SFT-Traj-9K

Viewer • Updated about 9 hours ago • 9.02k • 8 • 1

liked a model 12 days ago

dongguanting/QwQ-32B-AEPO-DeepSearch

Text Generation • 33B • Updated 20 days ago • 13 • 1

liked a model 20 days ago

dongguanting/Qwen3-8B-AEPO-DeepSearch

Text Generation • 8B • Updated 20 days ago • 21 • 2

liked 3 datasets 2 months ago

liked a model 4 months ago

meituan-longcat/LongCat-Flash-Chat

Text Generation • 562B • Updated Sep 24, 2025 • 19.9k • 517

liked a dataset 4 months ago

inclusionAI/ASearcher-train-data

Preview • Updated Aug 13, 2025 • 247 • 24

liked 2 datasets 5 months ago

We-Math/We-Math2.0-Pro

Viewer • Updated Aug 19, 2025 • 4.55k • 265 • 21

We-Math/We-Math2.0-Standard

Viewer • Updated 2 days ago • 5.84k • 356 • 23

liked 2 models 5 months ago

Kwai-Klear/Klear-Reasoner-8B

8B • Updated Sep 27, 2025 • 25 • 19

dongguanting/RAG-Critic-3B

Text Generation • 3B • Updated Jun 28, 2025 • 48 • 4

liked 3 datasets 6 months ago

dongguanting/ARPO-SFT-54K

Viewer • Updated Oct 17, 2025 • 54.6k • 122 • 14

dongguanting/ARPO-RL-DeepSearch-1K

Viewer • Updated Oct 17, 2025 • 1.07k • 68 • 6

dongguanting/ARPO-RL-Reasoning-10K

Viewer • Updated Oct 17, 2025 • 10k • 134 • 4

liked 5 models 6 months ago

dongguanting/Llama3.1-8B-ARPO

Text Generation • 8B • Updated Aug 12, 2025 • 11 • 1

dongguanting/Qwen3-14B-ARPO-DeepSearch

Text Generation • 15B • Updated Aug 12, 2025 • 13 • 5

dongguanting/Qwen2.5-7B-ARPO

Text Generation • 8B • Updated Aug 19, 2025 • 34 • 2

dongguanting/Qwen3-8B-ARPO-DeepSearch

8B • Updated Jul 29, 2025 • 15 • 2

dongguanting/Qwen2.5-3B-ARPO

Text Generation • 3B • Updated Aug 12, 2025 • 3 • 3

KABI

AI & ML interests

Recent Activity

Organizations

dongguanting's activity