marcsun13
/

Meta-Llama-3-8B-torchao-int8_weight_only

Model card Files Files and versions

Meta-Llama-3-8B-torchao-int8_weight_only / README.md

marcsun13's picture

marcsun13 HF Staff

Upload folder using huggingface_hub

89de331 verified about 1 year ago

|

569 Bytes

	---
	base_model:
	- meta-llama/Meta-Llama-3-8B
	---

	# meta-llama/Meta-Llama-3-8B (Quantized)

	## Description
	This model is a quantized version of the original model `meta-llama/Meta-Llama-3-8B`. It has been quantized using int8_weight_only quantization with torchao.

	## Quantization Details
	- Quantization Type: int8_weight_only
	- Group Size: None

	## Usage
	You can use this model in your applications by loading it directly from the Hugging Face Hub:

	```python
	from transformers import AutoModel

	model = AutoModel.from_pretrained("meta-llama/Meta-Llama-3-8B")