Generating samples from generated Mel-spectrograms #13

francislata · 2019-11-09T16:17:40Z

@bshall - First of all, thank you for this implementation. In this issue, you pointed out that you've generated a sample audio from generated Mel-spectrogram from VQVAE. It sounds pretty good.

My question is: how would one go about generating audio from Mel-spectrograms? Do we need to preprocess the Mel-spectrogram, if that's the only thing we're given?

bshall · 2019-11-11T08:29:40Z

Hi @francislata,

So the generate.py script does generate audio from Mel-spectrograms (if you look at the code it converts the raw audio into a Mel-spectrogram and then feeds that to the vocoder). If you want to use spectrograms created from another process (like tacotron or something) they need to use the same parameters as I've used. You can find the parameters in config.json and the steps I used for preprocessing in preprocess.py.

francislata · 2019-11-12T04:23:44Z

@bshall - Can you be more specific which parameters that needs to match?

If the Mel-spectrogram given to me is generated by any TTS system, then can I just not take that and put it through the vocoder?

The generated audio by following the padding of the Mel-spectrogram in preprocess.py creates a silent audio throughout. So I'm wondering how you preprocessed the Mel-spectrogram you sampled here to make it produce the sound without having the reference waveform at all.

bshall · 2019-11-14T11:44:27Z

Hi @francislata, sorry about the delay.

I used librosa to generate the Mel-spectrograms and the specific parameters hop_length, win_length, etc. can be found here. If you've got mels from a TTS system the best approach would be to retrain the Vocoder. To do that you should replace the steps in preprocess.py with the exact steps used for preprocessing the mels for the TTS system (but include the padding step).

Unfortuately different preprocessing does have a big effect so its very important that the preprocessing pipeline for the TTS system and the vocoder line up.

francislata changed the title ~~Generating samples generated Mel-spectrograms~~ Generating samples from generated Mel-spectrograms Nov 9, 2019

francislata closed this as completed Nov 17, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generating samples from generated Mel-spectrograms #13

Generating samples from generated Mel-spectrograms #13

francislata commented Nov 9, 2019

bshall commented Nov 11, 2019

francislata commented Nov 12, 2019 •

edited

Loading

bshall commented Nov 14, 2019

Generating samples from generated Mel-spectrograms #13

Generating samples from generated Mel-spectrograms #13

Comments

francislata commented Nov 9, 2019

bshall commented Nov 11, 2019

francislata commented Nov 12, 2019 • edited Loading

bshall commented Nov 14, 2019

francislata commented Nov 12, 2019 •

edited

Loading