Commit
·
5f3d251
1
Parent(s):
5267da9
Update README.md
Browse files
README.md
CHANGED
|
@@ -47,14 +47,17 @@ We evaluated orca_mini_v3_70b on a wide range of tasks using [Language Model Eva
|
|
| 47 |
|
| 48 |
Here are the results on metrics used by [HuggingFaceH4 Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
| 49 |
|
| 50 |
-
|
| 51 |
-
|
| 52 |
-
|**Task**|**
|
| 53 |
-
|*
|
| 54 |
-
|*
|
| 55 |
-
|*
|
| 56 |
-
|*
|
| 57 |
-
|
|
|
|
|
|
|
|
|
|
| 58 |
|
| 59 |
|
| 60 |
<br>
|
|
@@ -84,7 +87,7 @@ tokenizer = AutoTokenizer.from_pretrained("psmathur/orca_mini_v3_70b")
|
|
| 84 |
model = AutoModelForCausalLM.from_pretrained(
|
| 85 |
"psmathur/orca_mini_v3_70b",
|
| 86 |
torch_dtype=torch.float16,
|
| 87 |
-
|
| 88 |
low_cpu_mem_usage=True,
|
| 89 |
device_map="auto"
|
| 90 |
)
|
|
@@ -120,6 +123,7 @@ Please kindly cite using the following BibTeX:
|
|
| 120 |
@misc{orca_mini_v3_70b,
|
| 121 |
author = {Pankaj Mathur},
|
| 122 |
title = {orca_mini_v3_70b: An Orca Style Llama2-70b model},
|
|
|
|
| 123 |
year = {2023},
|
| 124 |
publisher = {HuggingFace},
|
| 125 |
journal = {HuggingFace repository},
|
|
|
|
| 47 |
|
| 48 |
Here are the results on metrics used by [HuggingFaceH4 Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
| 49 |
|
| 50 |
+
|||
|
| 51 |
+
|:------:|:--------:|
|
| 52 |
+
|**Task**|**Value**|
|
| 53 |
+
|*ARC*|0.7125|
|
| 54 |
+
|*HellaSwag*|0.8785|
|
| 55 |
+
|*MMLU*|0.7018|
|
| 56 |
+
|*TruthfulQA*|0.6127|
|
| 57 |
+
|*Winogrande*|0.8272|
|
| 58 |
+
|*GSM8K*|0.4086|
|
| 59 |
+
|*DROP*|0.4017|
|
| 60 |
+
|**Total Average**|**0.649**|
|
| 61 |
|
| 62 |
|
| 63 |
<br>
|
|
|
|
| 87 |
model = AutoModelForCausalLM.from_pretrained(
|
| 88 |
"psmathur/orca_mini_v3_70b",
|
| 89 |
torch_dtype=torch.float16,
|
| 90 |
+
load_in_4bit=True,
|
| 91 |
low_cpu_mem_usage=True,
|
| 92 |
device_map="auto"
|
| 93 |
)
|
|
|
|
| 123 |
@misc{orca_mini_v3_70b,
|
| 124 |
author = {Pankaj Mathur},
|
| 125 |
title = {orca_mini_v3_70b: An Orca Style Llama2-70b model},
|
| 126 |
+
month = {august},
|
| 127 |
year = {2023},
|
| 128 |
publisher = {HuggingFace},
|
| 129 |
journal = {HuggingFace repository},
|