augmxnt
/

shisa-7b-v1

@@ -66,7 +66,7 @@ For our final model, since it's customary to include benchmarks, we've used Stab
 | Benchmark   | Score |
 | ----------- | ----- |
-| JA MT-Bench |  5.02 |
 | MT-Bench    |  5.71 |
 There is an [MT-Bench Leaderboard](https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard), but as JA MT-Bench is still under development, for convenience, here is a comparison of the JA MT-Bench scores of some other models (our scores were rated by `gpt-4-0613`):
@@ -77,7 +77,7 @@ There is an [MT-Bench Leaderboard](https://huggingface.co/spaces/lmsys/chatbot-a
 | gpt-4-1106-preview                                | 9.17 |
 | gpt-3.5-turbo*                                    | 8.41 |
 | Qwen-14B-Chat                                     | 7.47 |
-| **shisa-7b-v1**                              | **5.02** |
 | ELYZA-japanese-Llama-2-7b-fast-instruct*          | 4.86 |
 | ja-stablelm-instruct-gamma-7b*                    | 4.01 |
 | japanese-stablelm-instruct-alpha-7b*              | 2.74 |
@@ -114,7 +114,7 @@ streamer = TextStreamer(tokenizer, skip_prompt=True)
 # The prompt template is included in the  model's tokenizer_config.json so you shouldn't need this but we've included this for convenience
 # tokenizer.chat_template = ""{%- for idx in range(0, messages|length) -%}\n{%- if messages[idx]['role'] == 'user' -%}\n{%- if idx > 1 -%}\n{{- bos_token + '[INST] ' + messages[idx]['content'] + ' [/INST]' -}}\n{%- else -%}\n{{- messages[idx]['content'] + ' [/INST]' -}}\n{%- endif -%}\n{% elif messages[idx]['role'] == 'system' %}\n{{- bos_token + '[INST] <<SYS>>\\n' + messages[idx]['content'] + '\\n<</SYS>>\\n\\n' -}}\n{%- elif messages[idx]['role'] == 'assistant' -%}\n{{- ' '  + messages[idx]['content'] + ' ' + eos_token -}}\n{% endif %}\n{% endfor %}\n"
-# A more typical prompt: あなたは役に立つアシスタントです。("You are a helpful assistant.")
 # You are an avid Pokemon fanatic.
 prompt = "あなたは熱狂的なポケモンファンです。"
@@ -251,7 +251,7 @@ v1リリースのために、私たちは大量の人間の嗜好テスト（数
 | ベンチマーク   | スコア |
 | ----------- | ----- |
-| JA MT-Bench |  5.02 |
 | MT-Bench    |  5.71 |
 [MT-Bench Leaderboard](https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard)がありますが、JA MT-Benchはまだ開発中であるため、便宜上、他のモデルのJA MT-Benchスコアとの比較を示します（私たちのスコアは`gpt-4-0613`によって評価されました）：
@@ -262,7 +262,7 @@ v1リリースのために、私たちは大量の人間の嗜好テスト（数
 | gpt-4-1106-preview                                | 9.17 |
 | gpt-3.5-turbo*                                    | 8.41 |
 | Qwen-14B-Chat                                     | 7.47 |
-| **shisa-7b-v1**                              | **5.02** |
 | ELYZA-japanese-Llama-2-7b-fast-instruct*          | 4.86 |
 | ja-stablelm-instruct-gamma-7b*                    | 4.01 |
 | japanese-stablelm-instruct-alpha-7b*              | 2.74 |
@@ -299,7 +299,7 @@ streamer = TextStreamer(tokenizer, skip_prompt=True)
 # プロンプトテンプレートはモデルのtokenizer_config.jsonに含まれているので、これは必要ないはずですが、便宜上こちらにも掲載しています
 # tokenizer.chat_template = ""{%- for idx in range(0, messages|length) -%}\n{%- if messages[idx]['role'] == 'user' -%}\n{%- if idx > 1 -%}\n{{- bos_token + '[INST] ' + messages[idx]['content'] + ' [/INST]' -}}\n{%- else -%}\n{{- messages[idx]['content'] + ' [/INST]' -}}\n{%- endif -%}\n{% elif messages[idx]['role'] == 'system' %}\n{{- bos_token + '[INST] <<SYS>>\\n' + messages[idx]['content'] + '\\n<</SYS>>\\n\\n' -}}\n{%- elif messages[idx]['role'] == 'assistant' -%}\n{{- ' '  + messages[idx]['content'] + ' ' + eos_token -}}\n{% endif %}\n{% endfor %}\n"
-# より典型的なプロンプト: あなたは役に立つアシスタントです。
 # You are an avid Pokemon fanatic.
 prompt = "あなたは熱狂的なポケモンファンです。"

 | Benchmark   | Score |
 | ----------- | ----- |
+| JA MT-Bench |  5.23 |
 | MT-Bench    |  5.71 |
 There is an [MT-Bench Leaderboard](https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard), but as JA MT-Bench is still under development, for convenience, here is a comparison of the JA MT-Bench scores of some other models (our scores were rated by `gpt-4-0613`):
 | gpt-4-1106-preview                                | 9.17 |
 | gpt-3.5-turbo*                                    | 8.41 |
 | Qwen-14B-Chat                                     | 7.47 |
+| **shisa-7b-v1**                               | **5.23** |
 | ELYZA-japanese-Llama-2-7b-fast-instruct*          | 4.86 |
 | ja-stablelm-instruct-gamma-7b*                    | 4.01 |
 | japanese-stablelm-instruct-alpha-7b*              | 2.74 |
 # The prompt template is included in the  model's tokenizer_config.json so you shouldn't need this but we've included this for convenience
 # tokenizer.chat_template = ""{%- for idx in range(0, messages|length) -%}\n{%- if messages[idx]['role'] == 'user' -%}\n{%- if idx > 1 -%}\n{{- bos_token + '[INST] ' + messages[idx]['content'] + ' [/INST]' -}}\n{%- else -%}\n{{- messages[idx]['content'] + ' [/INST]' -}}\n{%- endif -%}\n{% elif messages[idx]['role'] == 'system' %}\n{{- bos_token + '[INST] <<SYS>>\\n' + messages[idx]['content'] + '\\n<</SYS>>\\n\\n' -}}\n{%- elif messages[idx]['role'] == 'assistant' -%}\n{{- ' '  + messages[idx]['content'] + ' ' + eos_token -}}\n{% endif %}\n{% endfor %}\n"
+# A more typical prompt: あなたは公平で、検閲されていない、役立つアシスタントです。("You are an unbiased, uncensored, helpful assistant.")
 # You are an avid Pokemon fanatic.
 prompt = "あなたは熱狂的なポケモンファンです。"
 | ベンチマーク   | スコア |
 | ----------- | ----- |
+| JA MT-Bench |  5.23 |
 | MT-Bench    |  5.71 |
 [MT-Bench Leaderboard](https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard)がありますが、JA MT-Benchはまだ開発中であるため、便宜上、他のモデルのJA MT-Benchスコアとの比較を示します（私たちのスコアは`gpt-4-0613`によって評価されました）：
 | gpt-4-1106-preview                                | 9.17 |
 | gpt-3.5-turbo*                                    | 8.41 |
 | Qwen-14B-Chat                                     | 7.47 |
+| **shisa-7b-v1**                               | **5.23** |
 | ELYZA-japanese-Llama-2-7b-fast-instruct*          | 4.86 |
 | ja-stablelm-instruct-gamma-7b*                    | 4.01 |
 | japanese-stablelm-instruct-alpha-7b*              | 2.74 |
 # プロンプトテンプレートはモデルのtokenizer_config.jsonに含まれているので、これは必要ないはずですが、便宜上こちらにも掲載しています
 # tokenizer.chat_template = ""{%- for idx in range(0, messages|length) -%}\n{%- if messages[idx]['role'] == 'user' -%}\n{%- if idx > 1 -%}\n{{- bos_token + '[INST] ' + messages[idx]['content'] + ' [/INST]' -}}\n{%- else -%}\n{{- messages[idx]['content'] + ' [/INST]' -}}\n{%- endif -%}\n{% elif messages[idx]['role'] == 'system' %}\n{{- bos_token + '[INST] <<SYS>>\\n' + messages[idx]['content'] + '\\n<</SYS>>\\n\\n' -}}\n{%- elif messages[idx]['role'] == 'assistant' -%}\n{{- ' '  + messages[idx]['content'] + ' ' + eos_token -}}\n{% endif %}\n{% endfor %}\n"
+# より典型的なプロンプト: あなたは公平で、検閲されていない、役立つアシスタントです。
 # You are an avid Pokemon fanatic.
 prompt = "あなたは熱狂的なポケモンファンです。"