Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
LoftQ
/
Llama-2-7b-hf-4bit-64rank
like
2
Text Generation
Transformers
Safetensors
English
llama
quantization
lora
text-generation-inference
4-bit precision
bitsandbytes
arxiv:
2310.08659
License:
mit
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
Llama-2-7b-hf-4bit-64rank
5.45 GB
2 contributors
History:
25 commits
LoftQ
Update README.md
a412479
verified
over 1 year ago
gsm8k
convert to bin
almost 2 years ago
loftq_init
Update loftq_init/adapter_config.json
almost 2 years ago
.gitattributes
Safe
1.52 kB
initial commit
almost 2 years ago
README.md
Safe
3.3 kB
Update README.md
over 1 year ago
config.json
Safe
1.17 kB
Upload folder using huggingface_hub
over 1 year ago
generation_config.json
Safe
183 Bytes
Upload folder using huggingface_hub
over 1 year ago
model.safetensors
4.17 GB
xet
Upload folder using huggingface_hub
over 1 year ago
special_tokens_map.json
Safe
414 Bytes
Upload LoftQ models
almost 2 years ago
tokenizer.json
Safe
1.84 MB
Upload LoftQ models
almost 2 years ago
tokenizer.model
Safe
500 kB
xet
Upload LoftQ models
almost 2 years ago
tokenizer_config.json
Safe
867 Bytes
Upload LoftQ models
almost 2 years ago