1D-subchaneled DeepSeek checkpoints for usage on Google TPUs
Building on HF
Jacob Platin
jrplatin
AI & ML interests
None yet
Recent Activity
updated
a model
3 days ago
jrplatin/DeepSeek-R1-1D-Subchannel-512
published
a model
3 days ago
jrplatin/DeepSeek-R1-1D-Subchannel-512
updated
a model
3 days ago
jrplatin/DeepSeek-R1-1D-Subchannel-256-Packed
Organizations
None yet