HaploOmni: Unified Single Transformer for Multimodal Video Understanding and Generation

Paper: https://arxiv.org/pdf/2506.02975

Code: https://github.com/Tencent/HaploVLM/tree/main/haploomni

Downloads last month
37
Safetensors
Model size
9B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support