Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w4a16
like
0
Follow
Red Hat AI
1.64k
Text Generation
Transformers
Safetensors
English
llama
nvidia
llama3.1
w4a16
int4
vllm
conversational
text-generation-inference
compressed-tensors
License:
llama3.1
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w4a16
39.5 GB
2 contributors
History:
2 commits
Eldar Kurtic
add model
59eb0db
11 months ago
.gitattributes
Safe
1.57 kB
add model
11 months ago
README.md
Safe
9.7 kB
add model
11 months ago
config.json
Safe
25.7 kB
add model
11 months ago
generation_config.json
Safe
125 Bytes
add model
11 months ago
model-00001-of-00008.safetensors
Safe
4.95 GB
xet
add model
11 months ago
model-00002-of-00008.safetensors
Safe
4.98 GB
xet
add model
11 months ago
model-00003-of-00008.safetensors
Safe
4.98 GB
xet
add model
11 months ago
model-00004-of-00008.safetensors
Safe
4.93 GB
xet
add model
11 months ago
model-00005-of-00008.safetensors
Safe
4.98 GB
xet
add model
11 months ago
model-00006-of-00008.safetensors
Safe
4.98 GB
xet
add model
11 months ago
model-00007-of-00008.safetensors
Safe
4.98 GB
xet
add model
11 months ago
model-00008-of-00008.safetensors
Safe
4.75 GB
xet
add model
11 months ago
model.safetensors.index.json
Safe
210 kB
add model
11 months ago
recipe.yaml
Safe
433 Bytes
add model
11 months ago
special_tokens_map.json
Safe
296 Bytes
add model
11 months ago
tokenizer.json
Safe
17.2 MB
xet
add model
11 months ago
tokenizer_config.json
Safe
55.2 kB
add model
11 months ago