tarun7r
/

vibevoice-hindi-lora

audio-generation

Model card Files Files and versions

tarun7r commited on 28 days ago

Commit

d8ce093

·

verified ·

1 Parent(s): efedef6

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -35,6 +35,11 @@ LoRA is a parameter-efficient fine-tuning technique that adds trainable rank dec
   - Diffusion head (full fine-tuning)
   - Acoustic and Semantic connectors
 ## What's Included
 This repository contains:
@@ -128,11 +133,6 @@ python demo/gradio_demo.py \
 - Works with both 1.5B and 7B models (ensure checkpoint matches model size)
 - Make sure `hi-Priya_woman.wav` is in the `demo/voices/` directory
-## Demo
-### Sample Output:
-<audio controls src="https://huggingface.co/tarun7r/vibevoice-hindi-lora/resolve/main/demo.wav" style="width: 100%;"></audio>
 **Important Note:** The quality of the generated audio depends heavily on the reference voice file you provide in the `demo/voices/` directory. For best results:
 - Use high-quality, clear voice samples
 - Ensure the reference voice matches the desired speaking style

   - Diffusion head (full fine-tuning)
   - Acoustic and Semantic connectors
+## Demo
+### Sample Output:
+<audio controls src="https://huggingface.co/tarun7r/vibevoice-hindi-lora/resolve/main/demo.wav" style="width: 100%;"></audio>
 ## What's Included
 This repository contains:
 - Works with both 1.5B and 7B models (ensure checkpoint matches model size)
 - Make sure `hi-Priya_woman.wav` is in the `demo/voices/` directory
 **Important Note:** The quality of the generated audio depends heavily on the reference voice file you provide in the `demo/voices/` directory. For best results:
 - Use high-quality, clear voice samples
 - Ensure the reference voice matches the desired speaking style