Update README.md
Browse files
README.md
CHANGED
|
@@ -197,7 +197,7 @@ extra_gated_button_content: Submit
|
|
| 197 |
|
| 198 |
Using llama.cpp commit [268c566](https://github.com/ggerganov/llama.cpp/commit/268c5660062270a2c19a36fc655168aa287aaec2) for quantization, featuring llama 3.1 rope scaling factors. This fixes low-quality issues when using 8-128k context lengths.
|
| 199 |
|
| 200 |
-
Original model:
|
| 201 |
|
| 202 |
All quants were made using the imatrix option and Bartowski's [calibration file](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8).
|
| 203 |
|
|
|
|
| 197 |
|
| 198 |
Using llama.cpp commit [268c566](https://github.com/ggerganov/llama.cpp/commit/268c5660062270a2c19a36fc655168aa287aaec2) for quantization, featuring llama 3.1 rope scaling factors. This fixes low-quality issues when using 8-128k context lengths.
|
| 199 |
|
| 200 |
+
Original model: https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct
|
| 201 |
|
| 202 |
All quants were made using the imatrix option and Bartowski's [calibration file](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8).
|
| 203 |
|