Update README.md
Browse files
README.md
CHANGED
|
@@ -19,12 +19,12 @@ Run with [mistral.rs](https://github.com/EricLBuehler/mistral.rs). Documentation
|
|
| 19 |
3) **Customizable** 🛠️: Make and publish your own UQFF files in minutes.
|
| 20 |
## Files
|
| 21 |
|
| 22 |
-
|
|
| 23 |
-
|
| 24 |
-
|
|
| 25 |
-
|
|
| 26 |
-
|
|
| 27 |
-
|
|
| 28 |
-
|
|
| 29 |
-
|
|
| 30 |
-
|
|
|
|
|
| 19 |
3) **Customizable** 🛠️: Make and publish your own UQFF files in minutes.
|
| 20 |
## Files
|
| 21 |
|
| 22 |
+
|Quantization type(s)|Example|
|
| 23 |
+
|--|--|
|
| 24 |
+
|FP8|`./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff gemma1.1-2b-instruct-f8e4m3.uqff`|
|
| 25 |
+
|HQQ4|`./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff gemma1.1-2b-instruct-hqq4.uqff`|
|
| 26 |
+
|HQQ8|`./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff gemma1.1-2b-instruct-hqq8.uqff`|
|
| 27 |
+
|Q3K|`./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff gemma1.1-2b-instruct-q3k.uqff`|
|
| 28 |
+
|Q4K|`./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff gemma1.1-2b-instruct-q4k.uqff`|
|
| 29 |
+
|Q5K|`./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff gemma1.1-2b-instruct-q5k.uqff`|
|
| 30 |
+
|Q8_0|`./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff gemma1.1-2b-instruct-q8_0.uqff`|
|