SSL: fix MLP head and remove L2 normalization #141

mattersoflight · 2024-08-17T02:17:44Z

Fix the sequence of batchnorm and linear in the last MLP layer of the contrastive encoder model.

Remove L2 normalization before computing triplet loss. This works best when also reducing the dimension of projections.

Refactor light module into representation and translation to separate pipelining code for different tasks.

Fix #139, fix #138.

…nd size). #139

mattersoflight · 2024-08-17T02:18:35Z

@ziw-liu started the draft. I'll let you complete it. Feel free to make other improvements in architecture as you go.

the projected features saved during prediction is now *not* normalized

Soorya19Pradeep · 2024-08-21T17:25:25Z

@ziw-liu , I get the following error:

[rank3]: else self._run_ddp_forward(*inputs, **kwargs)
[rank3]: File "/hpc/mydata/soorya.pradeep/scratch/viscy_test/lib/python3.10/site-packages/torch/nn/parallel/distributed.py", line 1454, in _run_ddp_forward
[rank3]: return self.module(*inputs, **kwargs) # type: ignore[index]
[rank3]: File "/hpc/mydata/soorya.pradeep/scratch/viscy_test/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1553, in _wrapped_call_impl
[rank3]: return self._call_impl(*args, **kwargs)
[rank3]: File "/hpc/mydata/soorya.pradeep/scratch/viscy_test/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1562, in _call_impl
[rank3]: return forward_call(args, **kwargs)
[rank3]: File "/hpc/mydata/soorya.pradeep/scratch/viscy_test/lib/python3.10/site-packages/lightning/pytorch/strategies/strategy.py", line 633, in wrapped_forward
[rank3]: out = method(_args, **_kwargs)
[rank3]: File "/hpc/mydata/soorya.pradeep/scratch/viscy_infection_phenotyping/VisCy/viscy/representation/engine.py", line 144, in validation_step
[rank3]: self._log_metrics(
[rank3]: File "/hpc/mydata/soorya.pradeep/scratch/viscy_infection_phenotyping/VisCy/viscy/representation/engine.py", line 74, in _log_metrics
[rank3]: cosine_sim_pos = F.cosine_similarity(anchor, positive, dim=1).mean()
[rank3]: IndexError: Dimension out of range (expected to be in range of [-1, 0], but got 1)

Sanity Checking: | | 0/? [00:00<?, ?it/s]
Sanity Checking: 0%| | 0/2 [00:00<?, ?it/s]
Sanity Checking DataLoader 0: 0%| | 0/2 [00:00<?, ?it/s]srun: error: gpu-f-2: tasks 0,2: Exited with exit code 1
srun: error: gpu-f-2: task 3: Exited with exit code 1

ziw-liu · 2024-08-21T18:50:38Z

@Soorya19Pradeep Fixed in 11fa65a. This is still very much work in progress. I have not trained a full model yet.

ziw-liu · 2024-08-23T17:44:41Z

~~@mattersoflight With batch normalization in the MLP head, the features still have much lower rank (11) than projection (96).~~

Edit: see full analysis here.

mattersoflight · 2024-08-23T19:27:48Z

@ziw-liu our next steps:

Reorder batchnorm layer and MLP layer at the network's end (MLP->batchnorm).
Do not L2-normalize projections during fitting, let the optimizer enforce normalization.
Perform two computational experiments: with the above changes with projection dim = 128 and projection dim ~ 16.

draft projection head per Update the projection head (normalization a…

2e9a1f8

…nd size). #139

mattersoflight requested a review from ziw-liu August 17, 2024 02:17

mattersoflight changed the title ~~draft projection head~~ projection head (and other architectural changes. Aug 17, 2024

mattersoflight changed the title ~~projection head (and other architectural changes.~~ projection head (and other architectural changes) Aug 17, 2024

ziw-liu added 5 commits August 19, 2024 11:10

reorganize comments in example fit config

63f5b04

configurable stem stride and projection dimensions

835b091

update type hint and docstring for ContrastiveEncoder

f59d94e

clarify embedding_dim

197bf7d

use the forward method directly for projected

aebce20

ziw-liu added bug Something isn't working breaking Breaking changes labels Aug 21, 2024

normalize projections only when fitting

913012a

the projected features saved during prediction is now *not* normalized

ziw-liu added the enhancement New feature or request label Aug 21, 2024

ziw-liu added 3 commits August 21, 2024 09:10

remove unused logger

4bbac80

refactor training code into translation and representation modules

f155e88

extract image logging functions

bdba0a5

ziw-liu added 3 commits August 21, 2024 11:25

use AdamW instead of Adam for contrastive learning

f3dbdef

inline single-use argument

f3d6c4d

fix normalization

11fa65a

ziw-liu modified the milestone: v0.2.0 release - DL@MBL exercise Aug 22, 2024

ziw-liu added 4 commits August 23, 2024 12:29

fix MLP layer order

ff988cc

fix output dimensions

23d04bf

remove L2 normalization before computing loss

8058960

compute rank of features and projections

3a926ba

mattersoflight marked this pull request as ready for review August 27, 2024 21:11

ziw-liu changed the base branch from main to representation-learning August 28, 2024 00:29

ziw-liu changed the title ~~projection head (and other architectural changes)~~ SSL: fix MLP head and remove L2 normalization Aug 28, 2024

ziw-liu deleted the branch representation-learning August 28, 2024 00:40

ziw-liu closed this Aug 28, 2024

ziw-liu mentioned this pull request Aug 28, 2024

SSL: fix MLP head and remove L2 normalization #145

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SSL: fix MLP head and remove L2 normalization #141

SSL: fix MLP head and remove L2 normalization #141

mattersoflight commented Aug 17, 2024 •

edited by ziw-liu

Loading

mattersoflight commented Aug 17, 2024

Soorya19Pradeep commented Aug 21, 2024

ziw-liu commented Aug 21, 2024

ziw-liu commented Aug 23, 2024 •

edited

Loading

mattersoflight commented Aug 23, 2024

SSL: fix MLP head and remove L2 normalization #141

SSL: fix MLP head and remove L2 normalization #141

Conversation

mattersoflight commented Aug 17, 2024 • edited by ziw-liu Loading

mattersoflight commented Aug 17, 2024

Soorya19Pradeep commented Aug 21, 2024

ziw-liu commented Aug 21, 2024

ziw-liu commented Aug 23, 2024 • edited Loading

mattersoflight commented Aug 23, 2024

mattersoflight commented Aug 17, 2024 •

edited by ziw-liu

Loading

ziw-liu commented Aug 23, 2024 •

edited

Loading