Dataset

by rahul7star - opened 1 day ago

1 day ago

I will try this soon but
Can i use dataset which just has snac codes only for training
Or we feed audio data set which will be best bet from these 2 dataset

rahul7star/hindi-speech-dataset

With nano layers
rahul7star/vaani-snac-cleaned

YatharthS

Owner 1 day ago

•

edited 1 day ago

@rahul7star You unfortunately have to either

Convert snac codes back into audio and then convert to bicodec codes(MiraTTS uses that).
Directly convert audio into bicodec codes. The notebook has a process audio section which does this.

So probably use rahul7star/hindi-speech-dataset

rahul7star

about 21 hours ago

Also you said MiraTTS is a fine tune of spark but i dont see the mira listing under spark fine tunes , spark has https://huggingface.co/models?other=base_model:finetune:SparkAudio/Spark-TTS-0.5B which one is yours

rahul7star

about 10 hours ago

•

edited 18 minutes ago

does your model works with hindi ? I trained on hindi data set seems not working
space - https://huggingface.co/spaces/rahul7star/Mira-TTS.
model - rahul7star/mir-TTS

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment