-
Notifications
You must be signed in to change notification settings - Fork 220
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AssertionError: choose a window size 400 that is [2, 1] #133
Comments
16kHz 5second should work, our ESC-50 recipe is in this setting. Which line report this error? is your audio monochannel or multi-channel? check shape of -Yuan |
First of all, thanks for the answer! My waveform looks like this:
80000 because of the 16000Hz and the 5 seconds. And the error happens in:
I know that's a lot to ask, but do you have any ideas about what could be wrong? i'm lost. Thanks a lot. UPDATE: do i use the wrong feature extractor? # Load the model and feature extractor
model_name = "MIT/ast-finetuned-audioset-10-10-0.4593"
model = ASTForAudioClassification.from_pretrained(model_name)
feature_extractor = ASTFeatureExtractor.from_pretrained(model_name) |
I try to use the feature extractor on my audiofiles.
My audio files are all 16000Hz and 5 seconds long.
The
waveform.shape[1]
is 80000I get the error:
AssertionError: choose a window size 400 that is [2, 1]
and I don't really know what to do with it.
Here is the whole thing:
The text was updated successfully, but these errors were encountered: