RedHatAI
/

DeepSeek-R1-quantized.w4a16

Text Generation

Model card Files Files and versions

ekurtic commited on Sep 19

Commit

12a44c0

·

verified ·

1 Parent(s): c1bedff

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -63,6 +63,12 @@ print(generated_text)
 vLLM also supports OpenAI-compatible serving. See the [documentation](https://docs.vllm.ai/en/latest/) for more details.
 ## Evaluation
 The model was evaluated on the OpenLLM leaderboard task (v1) via [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness), and on popular reasoning tasks (AIME 2024, MATH-500, GPQA-Diamond) via [LightEval](https://github.com/huggingface/open-r1).

 vLLM also supports OpenAI-compatible serving. See the [documentation](https://docs.vllm.ai/en/latest/) for more details.
+## Creation
+We created this model using **MoE-Quant**, a library developed jointly with **ISTA** and tailored for the quantization of very large Mixture-of-Experts (MoE) models.
+For more details, please refer to the [MoE-Quant repository](https://github.com/IST-DASLab/MoE-Quant).
 ## Evaluation
 The model was evaluated on the OpenLLM leaderboard task (v1) via [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness), and on popular reasoning tasks (AIME 2024, MATH-500, GPQA-Diamond) via [LightEval](https://github.com/huggingface/open-r1).