Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Results seem a little off when trained from scratch #22

Open
varunjain3 opened this issue Nov 11, 2022 · 1 comment
Open

Results seem a little off when trained from scratch #22

varunjain3 opened this issue Nov 11, 2022 · 1 comment

Comments

@varunjain3
Copy link

varunjain3 commented Nov 11, 2022

Hi, I trained the model using the given model script and the hyperparameters for IFRnet(base model) on 8xV100 and 4xV100s. I am getting a PSNR of around 34.5 on the 8GPUs trained model and 35 on the 4 GPUs trained model.

Are there some hyperparameters that I should change to reproduce the results given in the paper? Also, any intuition on the results varies. I m using the vimeo90k dataset for training.

I am using a batchsize of 55, my V100 are 32gigs

@ltkong218
Copy link
Owner

Thanks for your question. I think the total training iterations should keep the same whether you use 4 or 8 GPUs.

You can try to double the training epochs when using double GPUs. In this situation, the training iterations can be the same.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants