Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What's the meaning of the line 104 of file yolo_net.py? #47

Open
LeoCHANGcn opened this issue Mar 3, 2018 · 1 comment
Open

What's the meaning of the line 104 of file yolo_net.py? #47

LeoCHANGcn opened this issue Mar 3, 2018 · 1 comment

Comments

@LeoCHANGcn
Copy link

Why you set the first fully connected layer's input dim 49 * 1024?

@wuhao5688
Copy link

In the Yolo paper, Yolo actually uses 7x7 cells for predicting bboxes and doing classification. So after performing several convs and poolings, the activation maps will have the dimension of 7x7 (and actually the depth of 1024). The origin resolution of the image is 448x448. After four 2-stride-pooling layers and two 2-stride-conv layers (which can be found in the inference() function), the activation maps will be down-sampled into 448/(2^4)/(2^2) = 7. So the final activation maps will have the shape of 7x7x1024 which will be flattened into 49*1024 (in local() function) and fed into the FC layer.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants