Susant-Achary commited on
Commit
cb2e929
·
verified ·
1 Parent(s): 043a1c7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -2
README.md CHANGED
@@ -97,10 +97,8 @@ While this card is **3-bit**, teams often publish multiple precisions. Use this
97
 
98
  | Variant | Typical Peak RAM | Relative Speed | Typical Behavior | When to choose |
99
  |---|---:|:---:|---|---|
100
- | **2-bit** | ~3.5–4.5 GB | 🔥🔥🔥🔥 | Smallest, most lossy | Absolute minimum RAM / smoke tests |
101
  | **3-bit** *(this repo)* | **~4.4–8.8 GB** | **🔥🔥🔥🔥** | **Direct, concise**, great latency | **Default** on 8–16 GB Macs |
102
  | **4-bit** | ~6–8 GB | 🔥🔥🔥 | Better detail retention vs 3-bit | If 3-bit misses small details |
103
- | **5-bit** | ~8–9.5 GB | 🔥🔥☆ | Higher fidelity | Documents/JSON extraction |
104
  | **6-bit** | ~7.5–12.5 GB | 🔥🔥 | Best quality under quant | Choose if RAM allows |
105
  | **8-bit** | ~9.5–12+ GB | 🔥🔥 | Largest quantized size / highest fidelity | When you prefer simpler 8-bit workflows |
106
 
 
97
 
98
  | Variant | Typical Peak RAM | Relative Speed | Typical Behavior | When to choose |
99
  |---|---:|:---:|---|---|
 
100
  | **3-bit** *(this repo)* | **~4.4–8.8 GB** | **🔥🔥🔥🔥** | **Direct, concise**, great latency | **Default** on 8–16 GB Macs |
101
  | **4-bit** | ~6–8 GB | 🔥🔥🔥 | Better detail retention vs 3-bit | If 3-bit misses small details |
 
102
  | **6-bit** | ~7.5–12.5 GB | 🔥🔥 | Best quality under quant | Choose if RAM allows |
103
  | **8-bit** | ~9.5–12+ GB | 🔥🔥 | Largest quantized size / highest fidelity | When you prefer simpler 8-bit workflows |
104