Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
XXXXyu
's Collections
vlut.cpp
vlut.cpp
updated
16 days ago
SOTA ternary-packed versions of 1.58-bit LLMs for efficient on-device inference with vlut.cpp.
Upvote
1
XXXXyu/Llama3-8B-1.58-100B-tokens-vlut-gguf
Text Generation
•
8B
•
Updated
16 days ago
•
176
XXXXyu/bitnet_b1_58-3B-vlut-gguf
Text Generation
•
3B
•
Updated
16 days ago
•
117
XXXXyu/Falcon3-1B-Instruct-1.58bit-vlut-gguf
Text Generation
•
2B
•
Updated
16 days ago
•
108
Upvote
1
Share collection
View history
Collection guide
Browse collections