hlnchen commited on
Commit
9f58406
·
verified ·
1 Parent(s): d2b624a

Align task_ids to dataset: overall -> chi_bench (per-domain ids unchanged)

Browse files
Files changed (1) hide show
  1. .eval_results/chi-bench.yaml +1 -1
.eval_results/chi-bench.yaml CHANGED
@@ -4,7 +4,7 @@
4
  # Values are pass@1 (%) for the best-performing harness for this model: OpenAI Agents SDK.
5
  - dataset:
6
  id: actava/chi-bench
7
- task_id: overall
8
  value: 14.2
9
  date: "2026-05-08"
10
  source:
 
4
  # Values are pass@1 (%) for the best-performing harness for this model: OpenAI Agents SDK.
5
  - dataset:
6
  id: actava/chi-bench
7
+ task_id: chi_bench
8
  value: 14.2
9
  date: "2026-05-08"
10
  source: