Correct the formulation in the description of the centered rmsprop
PiperOrigin-RevId: 299879992 Change-Id: I8e0ec675cc933a11ff8ac8a7e29ca0f4de030d36
This commit is contained in:
parent
105c91f2c4
commit
b5085d2014
@ -30,7 +30,7 @@ This implementation of RMSProp uses plain momentum, not Nesterov momentum.
|
||||
The centered version additionally maintains a moving (discounted) average of the
|
||||
gradients, and uses that average to estimate the variance:
|
||||
|
||||
mean_grad = decay * mean_square{t-1} + (1-decay) * gradient
|
||||
mean_grad = decay * mean_grad{t-1} + (1-decay) * gradient
|
||||
mean_square = decay * mean_square{t-1} + (1-decay) * gradient ** 2
|
||||
mom = momentum * mom{t-1} + learning_rate * g_t /
|
||||
sqrt(mean_square - mean_grad**2 + epsilon)
|
||||
|
Loading…
Reference in New Issue
Block a user