You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What steps would I need to take to train my own voice model? I have audio files/clips I can use, but I am unfamiliar with the process of training an AI Voice model.
I've trained a stable diffusion textual inversion model on the webui, but am unsure how different this process would be.
The text was updated successfully, but these errors were encountered:
What steps would I need to take to train my own voice model? I have audio files/clips I can use, but I am unfamiliar with the process of training an AI Voice model. I've trained a stable diffusion textual inversion model on the webui, but am unsure how different this process would be.
Hi~ Training a voice conversion model is almost the same with the process you train a stable diffusion model. But there are, of course, some tiny differences. I think a good starting point could be StarGANv2-VC, which is totlally self-supervised and you only need to feed it with audio data. The author has also provided detailed training and inference procedure so it won't be difficult to get started:)
What steps would I need to take to train my own voice model? I have audio files/clips I can use, but I am unfamiliar with the process of training an AI Voice model.
I've trained a stable diffusion textual inversion model on the webui, but am unsure how different this process would be.
The text was updated successfully, but these errors were encountered: