-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Generating samples from generated Mel-spectrograms #13
Comments
Hi @francislata, So the |
@bshall - Can you be more specific which parameters that needs to match? If the Mel-spectrogram given to me is generated by any TTS system, then can I just not take that and put it through the vocoder? The generated audio by following the padding of the Mel-spectrogram in |
Hi @francislata, sorry about the delay. I used Unfortuately different preprocessing does have a big effect so its very important that the preprocessing pipeline for the TTS system and the vocoder line up. |
@bshall - First of all, thank you for this implementation. In this issue, you pointed out that you've generated a sample audio from generated Mel-spectrogram from VQVAE. It sounds pretty good.
My question is: how would one go about generating audio from Mel-spectrograms? Do we need to preprocess the Mel-spectrogram, if that's the only thing we're given?
The text was updated successfully, but these errors were encountered: