Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
WildEval
non-profit
wild_eval
WildEval
Activity Feed
Request to join this org
Follow
14
AI & ML interests
None defined yet.
Recent Activity
yuntian-deng
authored
a paper
15 days ago
TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar
DongfuJiang
authored
a paper
24 days ago
Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning
DongfuJiang
authored
a paper
24 days ago
VideoScore2: Think before You Score in Generative Video Evaluation
View all activity
Team members
9
WildEval
's datasets
9
Sort: Recently updated
WildEval/ZebraLogic
Viewer
•
Updated
Feb 4
•
4.26k
•
1.17k
•
12
WildEval/G-PlanET
Viewer
•
Updated
Aug 1, 2024
•
1.42k
•
11
•
1
WildEval/ZeroEval
Viewer
•
Updated
Jul 23, 2024
•
4.61k
•
3.24k
WildEval/WildBench-V2
Viewer
•
Updated
May 22, 2024
•
2.05k
•
15
WildEval/WildBench-Results-v2-internal
Viewer
•
Updated
May 21, 2024
•
30k
•
99
WildEval/WildBench-Results-V2
Viewer
•
Updated
May 20, 2024
•
10.2k
•
21
WildEval/WildBench-v2-dev
Viewer
•
Updated
Apr 19, 2024
•
5.99k
•
4
WildEval/WildBench-dev
Viewer
•
Updated
Apr 19, 2024
•
14.1k
•
8
•
1
WildEval/NaturalChats
Viewer
•
Updated
Apr 18, 2024
•
641k
•
4