Generate throughput plot for LLMs on devices
Convert models to Safetensors and open a PR
Duplicate Hugging Face repositories