medmekk
/

Llama-3.2-1B-ao-int8wo

Text Generation

feature-extraction

torchao-my-repo

text-generation-inference

Model card Files Files and versions

medmekk HF Staff commited on Mar 31

Commit

1760ed9

·

verified ·

1 Parent(s): ea89f19

Upload folder using huggingface_hub

Files changed (1) hide show

README.md +18 -0

README.md CHANGED Viewed

@@ -1,6 +1,8 @@
 ---
 base_model:
 - medmekk/Llama-3.2-1B-ao-int8wo
 ---
 # medmekk/Llama-3.2-1B-ao-int8wo (Quantized)
@@ -13,3 +15,19 @@ It's quantized using the TorchAO library using the [torchao-my-repo](https://hug
 - **Quantization Type**: int8_weight_only
 - **Group Size**: None

 ---
 base_model:
 - medmekk/Llama-3.2-1B-ao-int8wo
+tags:
+- torchao-my-repo
 ---
 # medmekk/Llama-3.2-1B-ao-int8wo (Quantized)
 - **Quantization Type**: int8_weight_only
 - **Group Size**: None
+# 📄 Original Model Information
+# medmekk/Llama-3.2-1B-ao-int8wo (Quantized)
+## Description
+This model is a quantized version of the original model [`medmekk/Llama-3.2-1B-ao-int8wo`](https://huggingface.co/medmekk/Llama-3.2-1B-ao-int8wo).
+It's quantized using the TorchAO library using the [torchao-my-repo](https://huggingface.co/spaces/pytorch/torchao-my-repo) space.
+## Quantization Details
+- **Quantization Type**: int8_weight_only
+- **Group Size**: None