@mrfakename on Hugging Face: "Trained a model for emotion-controllable TTS based on MiMo audio on LAION's…"

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

posted an update Oct 28, 2025

Post

6472

Trained a model for emotion-controllable TTS based on MiMo audio on LAION's dataset.

Still very early and does have an issue with hallucinating but results seem pretty good so far, given that it is very early into the training run.

Will probably kick off a new run later with some settings tweaked.

Put up a demo here: https://huggingface.co/spaces/mrfakename/EmoAct-MiMo

(Turn 🔊 on to hear audio samples)

victor

Oct 28, 2025

wait how did you do that 🤯

mrfakename

Oct 28, 2025

Fine-tuned MiMo Audio to accept text/emotion captions (e.g. "intense fury, rage, hate") as input, trained a LoRA for 1k steps on LAION's voice acting dataset.

Thanks to HF for the GPUs to train 🤗

AtAndDev

Oct 29, 2025

Whaaaaa damn thats really good!

jsob7

Dec 2, 2025

wow

In this post

mrfakename mrfakename
victor Victor Mustar
blanchon Julien BLANCHON
AtAndDev alkinun
jsob7 9