Inquiry: Scaling the Lr #2

rb876 · 2019-06-20T13:40:48Z

Hi,
it is not clear to me how the Lr is scaled throughout the learning.

Many Thanks

henripal · 2019-06-20T17:10:21Z

Here:

sgld/sgld/sgld/trainer.py

Line 94 in 60bd564

optimizer.step(model_desc['lr']/(epoch//model_desc['lr_epoch'] + 1))

Contrary to pytorch optimizer, the SGLD optimizer takes in an optional lr as argument!

rb876 · 2019-06-20T19:19:40Z

Oh thanks, and how do you scale the learning rate with respect to the number of data points ? Looking at ChunyuanLI implementation (ChunyuanLI/pSGLD#2) and at gmarceaucaron implementation (https://github.com/gmarceaucaron/natural-langevin-dynamics-for-neural-networks) they scale the langevin noise with respect to the number of data points in the training (Ntrain) or the square root of Ntrain to allow convergence. Do you do something similar ?

henripal · 2019-06-22T19:32:33Z

@rb876 and I discusssing this by email - closing the issue!

henripal closed this as completed Jun 22, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inquiry: Scaling the Lr #2

Inquiry: Scaling the Lr #2

rb876 commented Jun 20, 2019

henripal commented Jun 20, 2019 •

edited

Loading

rb876 commented Jun 20, 2019

henripal commented Jun 22, 2019

Inquiry: Scaling the Lr #2

Inquiry: Scaling the Lr #2

Comments

rb876 commented Jun 20, 2019

henripal commented Jun 20, 2019 • edited Loading

rb876 commented Jun 20, 2019

henripal commented Jun 22, 2019

henripal commented Jun 20, 2019 •

edited

Loading