Update README.md
Browse files
README.md
CHANGED
|
@@ -15,6 +15,37 @@ license_link: https://mistral.ai/licenses/MRL-0.1.md
|
|
| 15 |
This model was converted to GGUF format from [`allura-org/MS-Meadowlark-22B`](https://huggingface.co/allura-org/MS-Meadowlark-22B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
| 16 |
Refer to the [original model card](https://huggingface.co/allura-org/MS-Meadowlark-22B) for more details on the model.
|
| 17 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 18 |
## Use with llama.cpp
|
| 19 |
Install llama.cpp through brew (works on Mac and Linux)
|
| 20 |
|
|
|
|
| 15 |
This model was converted to GGUF format from [`allura-org/MS-Meadowlark-22B`](https://huggingface.co/allura-org/MS-Meadowlark-22B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
| 16 |
Refer to the [original model card](https://huggingface.co/allura-org/MS-Meadowlark-22B) for more details on the model.
|
| 17 |
|
| 18 |
+
---
|
| 19 |
+
Model details:
|
| 20 |
+
-
|
| 21 |
+
A roleplay and storywriting model based on Mistral Small 22B.
|
| 22 |
+
|
| 23 |
+
GGUF models: https://huggingface.co/mradermacher/MS-Meadowlark-22B-GGUF/
|
| 24 |
+
|
| 25 |
+
EXL2 models: https://huggingface.co/CalamitousFelicitousness/MS-Meadowlark-22B-exl2
|
| 26 |
+
|
| 27 |
+
Datasets used in this model:
|
| 28 |
+
|
| 29 |
+
Dampfinchen/Creative_Writing_Multiturn at 16k
|
| 30 |
+
Fizzarolli/rosier-dataset + Alfitaria/body-inflation-org at 16k
|
| 31 |
+
ToastyPigeon/SpringDragon at 8k
|
| 32 |
+
|
| 33 |
+
Each dataset was trained separately onto Mistral Small Instruct, and then the component models were merged along with nbeerbower/Mistral-Small-Gutenberg-Doppel-22B to create Meadowlark.
|
| 34 |
+
|
| 35 |
+
I tried different blends of the component models, and this one seems to be the most stable while retaining creativity and unpredictability added by the trained data.
|
| 36 |
+
Instruct Format
|
| 37 |
+
|
| 38 |
+
Rosier/bodyinf and SpringDragon were trained in completion format. This model should work with Kobold Lite in Adventure Mode and Story Mode.
|
| 39 |
+
|
| 40 |
+
Creative_Writing_Multiturn and Gutenberg-Doppel were trained using the official instruct format of Mistral Small Instruct:
|
| 41 |
+
|
| 42 |
+
<s>[INST] {User message}[/INST] {Assistant response}</s>
|
| 43 |
+
|
| 44 |
+
This is the Mistral Small V2&V3 preset in SillyTavern and Kobold Lite.
|
| 45 |
+
|
| 46 |
+
For SillyTavern in particular I've had better luck getting good output from Mistral Small using a custom instruct template that formats the assembled context as a single user turn. This prevents SillyTavern from confusing the model by assembling user/assistant turns in a nonstandard way. Note: This preset is not compatible with Stepped Thinking, use the Mistral V2&V3 preset for that.
|
| 47 |
+
|
| 48 |
+
---
|
| 49 |
## Use with llama.cpp
|
| 50 |
Install llama.cpp through brew (works on Mac and Linux)
|
| 51 |
|