Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
39
196
49
KABI
dongguanting
Follow
Altoculumus's profile picture
taicheng's profile picture
AndroidGuy's profile picture
59 followers
·
97 following
https://dongguanting.github.io/
kakakbibibi
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
liked
a dataset
about 12 hours ago
XXHStudyHard/EnvScaler-SFT-Traj-9K
upvoted
a
paper
1 day ago
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting
upvoted
a
paper
1 day ago
ROI-Reasoning: Rational Optimization for Inference via Pre-Computation Meta-Cognition
View all activity
Organizations
dongguanting
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
about 12 hours ago
XXHStudyHard/EnvScaler-SFT-Traj-9K
Viewer
•
Updated
about 9 hours ago
•
9.02k
•
8
•
1
liked
a model
12 days ago
dongguanting/QwQ-32B-AEPO-DeepSearch
Text Generation
•
33B
•
Updated
20 days ago
•
13
•
1
liked
a model
20 days ago
dongguanting/Qwen3-8B-AEPO-DeepSearch
Text Generation
•
8B
•
Updated
20 days ago
•
21
•
2
liked
3 datasets
2 months ago
We-Math/VTBench
Viewer
•
Updated
Nov 26, 2025
•
500
•
99
•
7
We-Math/V-Perception-40K
Viewer
•
Updated
Nov 7, 2025
•
36.7k
•
106
•
7
We-Math/V-Interaction-400K
Viewer
•
Updated
Nov 7, 2025
•
253k
•
829
•
14
liked
a model
4 months ago
meituan-longcat/LongCat-Flash-Chat
Text Generation
•
562B
•
Updated
Sep 24, 2025
•
19.9k
•
517
liked
a dataset
4 months ago
inclusionAI/ASearcher-train-data
Preview
•
Updated
Aug 13, 2025
•
247
•
24
liked
2 datasets
5 months ago
We-Math/We-Math2.0-Pro
Viewer
•
Updated
Aug 19, 2025
•
4.55k
•
265
•
21
We-Math/We-Math2.0-Standard
Viewer
•
Updated
2 days ago
•
5.84k
•
356
•
23
liked
2 models
5 months ago
Kwai-Klear/Klear-Reasoner-8B
8B
•
Updated
Sep 27, 2025
•
25
•
19
dongguanting/RAG-Critic-3B
Text Generation
•
3B
•
Updated
Jun 28, 2025
•
48
•
4
liked
3 datasets
6 months ago
dongguanting/ARPO-SFT-54K
Viewer
•
Updated
Oct 17, 2025
•
54.6k
•
122
•
14
dongguanting/ARPO-RL-DeepSearch-1K
Viewer
•
Updated
Oct 17, 2025
•
1.07k
•
68
•
6
dongguanting/ARPO-RL-Reasoning-10K
Viewer
•
Updated
Oct 17, 2025
•
10k
•
134
•
4
liked
5 models
6 months ago
dongguanting/Llama3.1-8B-ARPO
Text Generation
•
8B
•
Updated
Aug 12, 2025
•
11
•
1
dongguanting/Qwen3-14B-ARPO-DeepSearch
Text Generation
•
15B
•
Updated
Aug 12, 2025
•
13
•
5
dongguanting/Qwen2.5-7B-ARPO
Text Generation
•
8B
•
Updated
Aug 19, 2025
•
34
•
2
dongguanting/Qwen3-8B-ARPO-DeepSearch
8B
•
Updated
Jul 29, 2025
•
15
•
2
dongguanting/Qwen2.5-3B-ARPO
Text Generation
•
3B
•
Updated
Aug 12, 2025
•
3
•
3
Load more