Generate text and images based on prompts
Search arXiv papers, read with TTS voice
Generate high-quality speech from text
Transform music style using reference track