Multi-channel Audio Enhancement #140

TankyFranky · 2022-12-01T16:30:30Z

Hello,

I am wondering what the best approach would be to adapt the denoiser for multi-channel audio. I have a four microphone array that I would like to apply denoiser to as a pre-processing step.

Can the model.chin and model.chout paramters be changed when performing inference on a network that has been trained on only one channel? Will the inference/forward step adapt if the input tensor is multiple channels of audio (all of the same frame size). I have modified the live.py example to perform sequential forward passes (one for each channel), but obviously this tanks the real time performance.

Any advice on applying denoiser to multi-channel audio would be appreciated.

Thanks.

The text was updated successfully, but these errors were encountered:

adiyoss · 2022-12-02T13:27:43Z

Hi @TankyFranky,
You can definitely reconfigured the model to get more than one channel as input and output. However, if you are going that way you should train a new model prom scratch.
If you want to use the pre-trained models, so what you did (process each channel independently) would be the best/easiest way. Regarding the real-time constraints, maybe you can process the channels in parallel?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-channel Audio Enhancement #140

Multi-channel Audio Enhancement #140

TankyFranky commented Dec 1, 2022

adiyoss commented Dec 2, 2022

Multi-channel Audio Enhancement #140

Multi-channel Audio Enhancement #140

Comments

TankyFranky commented Dec 1, 2022

adiyoss commented Dec 2, 2022