absl-py accelerate==1.6.0 aiortc av diffusers librosa ml-collections numpy scipy soundfile torch tqdm transformers git+https://github.com/microsoft/VibeVoice.git pydub>=0.25.1