[Enhancement]: Wrong gains for weight initialization #1559

OliEfr · 2023-06-16T11:20:01Z

Enhancement

The recommended gains for the weight init depend on the used activation function, see torch docs. However, as for now the used gains are statically implemented and always the same in ActorCriticPolicies. See here.

I recommend making the gains dependent on the activation function used(, i.e. probably mainly ReLU and tanh).

If you agree with this, I would like to implement it myself and PR.

Thanks and a good day!

To Reproduce

--

Relevant log output / Error message

--

System Info

--

Checklist

I have checked that there is no similar issue in the repo
I have read the documentation
I have provided a minimal working example to reproduce the bug
I've used the markdown code blocks for both code and stack traces.

araffin · 2023-07-20T12:03:09Z

Hello,
those gains are for orthogonal initialization only (https://pytorch.org/docs/stable/_modules/torch/nn/init.html#orthogonal_), when they are not used, the default pytorch initialization is used.

The gains are from OpenAI Baselines, to keep results consistent, but compared to other initialization, I didn't see any investigation on the effect of the gain so far (this would be already a good contribution), or at least if using tanh/relu with constant gain has an effect.

OliEfr · 2023-07-21T07:44:33Z

Yes, I am talking about orthogonal init. I agree that it is useful to keep it consistent with OpenAI Baselines. A study regarding the effect of gain towards convergence will be useful.

It seems a coincidence (?) that the standard gain listed for ReLU for any initialization is also sqrt(2) Link. (The gain implemented in OpenAI Baselines and sb3 is also sqrt(2). Maybe they just used ReLU by default and never investigated the gain?)

One study that partly investigates impact of weight init is this. They find:

initializing the policy MLP with smaller weights in the last layer

network initialization scheme (C56) does not matter too much

OliEfr added the bug Something isn't working label Jun 16, 2023

OliEfr changed the title ~~[Bug]: wrong gains for weight initialization~~ [Bug]: Wrong gains for weight initialization Jun 16, 2023

araffin added enhancement New feature or request and removed bug Something isn't working labels Jun 16, 2023

OliEfr changed the title ~~[Bug]: Wrong gains for weight initialization~~ [Enhancement]: Wrong gains for weight initialization Jun 17, 2023

araffin self-assigned this Jul 20, 2023

araffin added the help wanted Help from contributors is welcomed label Jul 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Enhancement]: Wrong gains for weight initialization #1559

[Enhancement]: Wrong gains for weight initialization #1559

OliEfr commented Jun 16, 2023 •

edited

Loading

araffin commented Jul 20, 2023

OliEfr commented Jul 21, 2023

[Enhancement]: Wrong gains for weight initialization #1559

[Enhancement]: Wrong gains for weight initialization #1559

Comments

OliEfr commented Jun 16, 2023 • edited Loading

Enhancement

To Reproduce

Relevant log output / Error message

System Info

Checklist

araffin commented Jul 20, 2023

OliEfr commented Jul 21, 2023

OliEfr commented Jun 16, 2023 •

edited

Loading