Fix TPU embedding implementation initialization of proximal yogi optimizer.
Similar to Adagrad, setting initialization to 0.1 (instead of 0.0) will avoid numerical instability issues. PiperOrigin-RevId: 288327127 Change-Id: I66a89bbb4e18fe653479cb64003b7c42daba7158
This commit is contained in:
parent
28018f2cd7
commit
8033e41d7c
@ -265,7 +265,7 @@ Status GetOptimizationAlgorithmStateVariables(
|
||||
}
|
||||
case OptimizationAlgorithm::kProximalYogi: {
|
||||
state_variables->push_back(
|
||||
MakeStandardStateVariableSpecification("v", 0.0));
|
||||
MakeStandardStateVariableSpecification("v", 0.1));
|
||||
state_variables->push_back(
|
||||
MakeStandardStateVariableSpecification("m", 0.0));
|
||||
break;
|
||||
|
Loading…
Reference in New Issue
Block a user