Update README.md
Browse files
README.md
CHANGED
|
@@ -35,6 +35,11 @@ LoRA is a parameter-efficient fine-tuning technique that adds trainable rank dec
|
|
| 35 |
- Diffusion head (full fine-tuning)
|
| 36 |
- Acoustic and Semantic connectors
|
| 37 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 38 |
## What's Included
|
| 39 |
|
| 40 |
This repository contains:
|
|
@@ -128,11 +133,6 @@ python demo/gradio_demo.py \
|
|
| 128 |
- Works with both 1.5B and 7B models (ensure checkpoint matches model size)
|
| 129 |
- Make sure `hi-Priya_woman.wav` is in the `demo/voices/` directory
|
| 130 |
|
| 131 |
-
## Demo
|
| 132 |
-
|
| 133 |
-
### Sample Output:
|
| 134 |
-
<audio controls src="https://huggingface.co/tarun7r/vibevoice-hindi-lora/resolve/main/demo.wav" style="width: 100%;"></audio>
|
| 135 |
-
|
| 136 |
**Important Note:** The quality of the generated audio depends heavily on the reference voice file you provide in the `demo/voices/` directory. For best results:
|
| 137 |
- Use high-quality, clear voice samples
|
| 138 |
- Ensure the reference voice matches the desired speaking style
|
|
|
|
| 35 |
- Diffusion head (full fine-tuning)
|
| 36 |
- Acoustic and Semantic connectors
|
| 37 |
|
| 38 |
+
## Demo
|
| 39 |
+
|
| 40 |
+
### Sample Output:
|
| 41 |
+
<audio controls src="https://huggingface.co/tarun7r/vibevoice-hindi-lora/resolve/main/demo.wav" style="width: 100%;"></audio>
|
| 42 |
+
|
| 43 |
## What's Included
|
| 44 |
|
| 45 |
This repository contains:
|
|
|
|
| 133 |
- Works with both 1.5B and 7B models (ensure checkpoint matches model size)
|
| 134 |
- Make sure `hi-Priya_woman.wav` is in the `demo/voices/` directory
|
| 135 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 136 |
**Important Note:** The quality of the generated audio depends heavily on the reference voice file you provide in the `demo/voices/` directory. For best results:
|
| 137 |
- Use high-quality, clear voice samples
|
| 138 |
- Ensure the reference voice matches the desired speaking style
|