HaploOmni: Unified Single Transformer for Multimodal Video Understanding and Generation
Paper: https://arxiv.org/pdf/2506.02975
Code: https://github.com/Tencent/HaploVLM/tree/main/haploomni
- Downloads last month
- 37
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support