Update README.md
Browse files
README.md
CHANGED
|
@@ -52,8 +52,8 @@ dataset for this model. This results in a DPO dataset composed by triplets < ”
|
|
| 52 |
|
| 53 |
| | OR Bench 80K (refusal) ↓ | OR Bench Hard (refusal) ↓ |
|
| 54 |
|------------------------------|:---------------------:|:---------------:|
|
| 55 |
-
| Qwen-2.5-
|
| 56 |
-
| Qwen-2.5-
|
| 57 |
|
| 58 |
Note that this refusal ratio is computed as keyword matching with a curated list of kewords. For more information, check the paper.
|
| 59 |
|
|
|
|
| 52 |
|
| 53 |
| | OR Bench 80K (refusal) ↓ | OR Bench Hard (refusal) ↓ |
|
| 54 |
|------------------------------|:---------------------:|:---------------:|
|
| 55 |
+
| Qwen-2.5-72B-Instruct | 0.015 | 0.102 |
|
| 56 |
+
| Qwen-2.5-72B-Instruct-Egida-DPO | 0.016 | 0.170 |
|
| 57 |
|
| 58 |
Note that this refusal ratio is computed as keyword matching with a curated list of kewords. For more information, check the paper.
|
| 59 |
|