Add evaluation results (GPQA, MMLU-Pro, SWE-bench Verified, HLE)

#6
by SaylorTwift HF Staff - opened
Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment