Update README.md
Browse files
README.md
CHANGED
|
@@ -58,17 +58,6 @@ library_name: transformers
|
|
| 58 |
--ctx-size 128000
|
| 59 |
```
|
| 60 |
|
| 61 |
-
- Run as LlamaEdge command app
|
| 62 |
-
|
| 63 |
-
```bash
|
| 64 |
-
wasmedge --dir .:. \
|
| 65 |
-
--nn-preload default:GGML:AUTO:Qwen2-VL-7B-Instruct-Q5_K_M.gguf \
|
| 66 |
-
llama-chat.wasm \
|
| 67 |
-
--prompt-template qwen2-vision \
|
| 68 |
-
--llava-mmproj Qwen2-VL-7B-Instruct-vision-encoder.gguf
|
| 69 |
-
--ctx-size 128000
|
| 70 |
-
```
|
| 71 |
-
|
| 72 |
## Quantized GGUF Models
|
| 73 |
|
| 74 |
| Name | Quant method | Bits | Size | Use case |
|
|
|
|
| 58 |
--ctx-size 128000
|
| 59 |
```
|
| 60 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 61 |
## Quantized GGUF Models
|
| 62 |
|
| 63 |
| Name | Quant method | Bits | Size | Use case |
|