entfane commited on
Commit
078123a
·
verified ·
1 Parent(s): efedcaf

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -18
README.md CHANGED
@@ -1,20 +1,16 @@
1
- ---
2
- library_name: transformers
3
- tags:
4
- - trl
5
- - dpo
6
- language:
7
- - en
8
- base_model:
9
- - entfane/math-professor-3B
10
- pipeline_tag: text-generation
11
- metrics:
12
- - type: accuracy
13
- name: GSM8K accuracy
14
- value: 0.58
15
- verified: false
16
- source: Evaluated manually on the GSM8K test set using final response match.
17
- ---
18
 
19
  <img src="https://huggingface.co/entfane/math-professor-3B-dpo/resolve/main/math-professor-image.png" width="300" height="300"/>
20
 
@@ -47,4 +43,9 @@ input = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_p
47
  encoded_input = tokenizer(input, return_tensors = "pt").to(model.device)
48
  output = model.generate(**encoded_input, max_new_tokens=1024)
49
  print(tokenizer.decode(output[0], skip_special_tokens=False))
50
- ```
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ tags:
4
+ - trl
5
+ - dpo
6
+ language:
7
+ - en
8
+ base_model:
9
+ - entfane/math-professor-3B
10
+ pipeline_tag: text-generation
11
+ metrics:
12
+ - accuracy
13
+ ---
 
 
 
 
14
 
15
  <img src="https://huggingface.co/entfane/math-professor-3B-dpo/resolve/main/math-professor-image.png" width="300" height="300"/>
16
 
 
43
  encoded_input = tokenizer(input, return_tensors = "pt").to(model.device)
44
  output = model.generate(**encoded_input, max_new_tokens=1024)
45
  print(tokenizer.decode(output[0], skip_special_tokens=False))
46
+ ```
47
+
48
+ ### Evaluation
49
+
50
+ Model was tested on final response match on [openai/gsm8k](https://huggingface.co/datasets/openai/gsm8k) dataset.
51
+ Reaching accuracy of correct final response: <b>58%</b>