F5-TTS
π£
2.76k
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Generate text from an image and question
Generate images based on prompts and input images
Generate images using prompts and LoRA models
More advanced and challenging multi-task evaluation
Display and analyze reward model evaluation results
Generate Python code solutions for coding problems
Annotate and describe images with text prompts
Analyze images to detect objects, generate captions, or perform OCR
Generate captions and analyze images with various tasks
Upload images or text for analysis and responses
a tiny vision language model
Transcribe audio with emotions and events