Wwayu commited on
Commit
7edc277
·
verified ·
1 Parent(s): 6d9b2ac

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +50 -0
README.md ADDED
@@ -0,0 +1,50 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ - zh
6
+ tags:
7
+ - unsloth
8
+ - QiMing
9
+ - vllm
10
+ - sales
11
+ - b2b
12
+ - Strategist
13
+ - saas
14
+ - fine-tuned
15
+ - instruction-following
16
+ - role-playing
17
+ - cognitive-simulator
18
+ - mlx
19
+ - mlx-my-repo
20
+ pipeline_tag: text-generation
21
+ model_name: QiMing-Strategist-20B
22
+ library_name: transformers
23
+ base_model: aifeifei798/QiMing-Strategist-20B-MXFP4
24
+ ---
25
+
26
+ # Wwayu/QiMing-Strategist-20B-MXFP4-mlx-4Bit
27
+
28
+ The Model [Wwayu/QiMing-Strategist-20B-MXFP4-mlx-4Bit](https://huggingface.co/Wwayu/QiMing-Strategist-20B-MXFP4-mlx-4Bit) was converted to MLX format from [aifeifei798/QiMing-Strategist-20B-MXFP4](https://huggingface.co/aifeifei798/QiMing-Strategist-20B-MXFP4) using mlx-lm version **0.26.4**.
29
+
30
+ ## Use with mlx
31
+
32
+ ```bash
33
+ pip install mlx-lm
34
+ ```
35
+
36
+ ```python
37
+ from mlx_lm import load, generate
38
+
39
+ model, tokenizer = load("Wwayu/QiMing-Strategist-20B-MXFP4-mlx-4Bit")
40
+
41
+ prompt="hello"
42
+
43
+ if hasattr(tokenizer, "apply_chat_template") and tokenizer.chat_template is not None:
44
+ messages = [{"role": "user", "content": prompt}]
45
+ prompt = tokenizer.apply_chat_template(
46
+ messages, tokenize=False, add_generation_prompt=True
47
+ )
48
+
49
+ response = generate(model, tokenizer, prompt=prompt, verbose=True)
50
+ ```