Generate a video from a single image
Analyze images to generate descriptive prompts
Engage in multimedia chat with LLMs and ML models
Generate realistic audio from text
Generate images from text prompts