Last dropout is disconnected from fc_audioset layer #52

ilic-mezza · 2022-03-21T13:42:41Z

It appears that, in all CNN models, the last dropout, i.e., embedding = F.dropout(x, p=0.5, training=self.training), is actually disconnected from the output linear layer, i.e., self.fc_audioset(x).
Indeed, the forward method of these models reads:

x = F.relu_(self.fc1(x))
embedding = F.dropout(x, p=0.5, training=self.training)
clipwise_output = torch.sigmoid(self.fc_audioset(x))

By reading the arXiv paper, it seems that the last dropout should have instead connected the 2048-embedding layer to the 527-output layer. Indeed, the paper reads:

"Dropout [38] is applied after each downsampling operation and fully connected layers to prevent systems from overfitting."

Therefore, I expected to see the following:

x = F.relu_(self.fc1(x))
embedding = F.dropout(x, p=0.5, training=self.training)
clipwise_output = torch.sigmoid(self.fc_audioset(embedding))

Am I missing something?

Thank you,
Alessandro

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Last dropout is disconnected from fc_audioset layer #52

Last dropout is disconnected from fc_audioset layer #52

ilic-mezza commented Mar 21, 2022 •

edited

Loading

Last dropout is disconnected from fc_audioset layer #52

Last dropout is disconnected from fc_audioset layer #52

Comments

ilic-mezza commented Mar 21, 2022 • edited Loading

ilic-mezza commented Mar 21, 2022 •

edited

Loading