Nishith Jain
AI & ML interests
AI is fun actually.
Recent Activity
upvoted
a
paper
about 4 hours ago
SAMTok: Representing Any Mask with Two Words
reacted
to
alvarobartt's
post
with š„
about 4 hours ago
š„ `hf-mem` v0.4.1 now also estimates KV cache memory requirements for any context length and batch size with the `--experimental` flag!
`uvx hf-mem --model-id ... --experimental` will automatically pull the required information from the Hugging Face Hub to include the KV cache estimation, when applicable.
š” Alternatively, you can also set the `--max-model-len`, `--batch-size` and `--kv-cache-dtype` arguments (Ć la vLLM) manually if preferred.
liked
a model
1 day ago
meituan-longcat/LongCat-Flash-Lite