lbourdois commited on
Commit
6b7cf82
·
verified ·
1 Parent(s): 41be8eb

Improve language tag

Browse files

Hi! As the model is multilingual, this is a PR to add other languages than English to the language tag to improve the referencing. Note that 29 languages are announced in the README, but only 13 are explicitly listed. I was therefore only able to add these 13 languages.

Files changed (1) hide show
  1. README.md +57 -43
README.md CHANGED
@@ -1,44 +1,58 @@
1
- ---
2
- base_model:
3
- - Qwen/QwQ-32B-Preview
4
- - Qwen/Qwen2.5-32B-Instruct
5
- library_name: transformers
6
- tags:
7
- - mergekit
8
- - merge
9
- inference: true
10
- license: apache-2.0
11
- ---
12
- # merge
13
-
14
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
15
-
16
- ## Merge Details
17
- ### Merge Method
18
-
19
- This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [Qwen/QwQ-32B-Preview](https://huggingface.co/Qwen/QwQ-32B-Preview) as a base.
20
-
21
- ### Models Merged
22
-
23
- The following models were included in the merge:
24
- * [Qwen/Qwen2.5-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-32B-Instruct)
25
-
26
- ### Configuration
27
-
28
- The following YAML configuration was used to produce this model:
29
-
30
- ```yaml
31
- models:
32
- - model: Qwen/Qwen2.5-32B-Instruct
33
- parameters:
34
- weight: 1
35
- density: 1
36
- merge_method: ties
37
- base_model: Qwen/QwQ-32B-Preview
38
- parameters:
39
- weight: 1
40
- density: 1
41
- normalize: true
42
- int8_mask: true
43
- dtype: bfloat16
 
 
 
 
 
 
 
 
 
 
 
 
 
 
44
  ```
 
1
+ ---
2
+ base_model:
3
+ - Qwen/QwQ-32B-Preview
4
+ - Qwen/Qwen2.5-32B-Instruct
5
+ library_name: transformers
6
+ tags:
7
+ - mergekit
8
+ - merge
9
+ inference: true
10
+ license: apache-2.0
11
+ language:
12
+ - zho
13
+ - eng
14
+ - fra
15
+ - spa
16
+ - por
17
+ - deu
18
+ - ita
19
+ - rus
20
+ - jpn
21
+ - kor
22
+ - vie
23
+ - tha
24
+ - ara
25
+ ---
26
+ # merge
27
+
28
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
29
+
30
+ ## Merge Details
31
+ ### Merge Method
32
+
33
+ This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [Qwen/QwQ-32B-Preview](https://huggingface.co/Qwen/QwQ-32B-Preview) as a base.
34
+
35
+ ### Models Merged
36
+
37
+ The following models were included in the merge:
38
+ * [Qwen/Qwen2.5-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-32B-Instruct)
39
+
40
+ ### Configuration
41
+
42
+ The following YAML configuration was used to produce this model:
43
+
44
+ ```yaml
45
+ models:
46
+ - model: Qwen/Qwen2.5-32B-Instruct
47
+ parameters:
48
+ weight: 1
49
+ density: 1
50
+ merge_method: ties
51
+ base_model: Qwen/QwQ-32B-Preview
52
+ parameters:
53
+ weight: 1
54
+ density: 1
55
+ normalize: true
56
+ int8_mask: true
57
+ dtype: bfloat16
58
  ```