wearepal
diff --git a/‎LICENSE
+674 b/‎LICENSE
+674
diff --git a/‎README.md
+64 b/‎README.md
+64
diff --git a/‎external_confs/alg/iwildcam/clip.yaml
+25 b/‎external_confs/alg/iwildcam/clip.yaml
+25
diff --git a/‎external_confs/alg/iwildcam/fixmatch.yaml
+20 b/‎external_confs/alg/iwildcam/fixmatch.yaml
+20
diff --git a/‎external_confs/backbone/iw/convnext.yaml
+7 b/‎external_confs/backbone/iw/convnext.yaml
+7
diff --git a/‎external_confs/backbone/iw/rn50.yaml
+7 b/‎external_confs/backbone/iw/rn50.yaml
+7
diff --git a/‎external_confs/backbone/pm/convnext.yaml
+8 b/‎external_confs/backbone/pm/convnext.yaml
+8
diff --git a/‎external_confs/backbone/pm/resnet.yaml
+7 b/‎external_confs/backbone/pm/resnet.yaml
+7
diff --git a/‎external_confs/checkpointer/cc.yaml
+6 b/‎external_confs/checkpointer/cc.yaml
+6
diff --git a/‎external_confs/checkpointer/iwildcam.yaml
+6 b/‎external_confs/checkpointer/iwildcam.yaml
+6
diff --git a/‎external_confs/checkpointer/pm.yaml
+6 b/‎external_confs/checkpointer/pm.yaml
+6
diff --git a/‎external_confs/dm/erm_no_aug.yaml
+31 b/‎external_confs/dm/erm_no_aug.yaml
+31
diff --git a/‎external_confs/dm/iwildcam/clip.yaml
+36 b/‎external_confs/dm/iwildcam/clip.yaml
+36
diff --git a/‎external_confs/dm/iwildcam/erm_aug.yaml
+35 b/‎external_confs/dm/iwildcam/erm_aug.yaml
+35
diff --git a/‎external_confs/dm/iwildcam/fixmatch.yaml
+67 b/‎external_confs/dm/iwildcam/fixmatch.yaml
+67
diff --git a/‎external_confs/dm/iwildcam/okapi.yaml
+37 b/‎external_confs/dm/iwildcam/okapi.yaml
+37
diff --git a/‎external_confs/dm/pm/erm.yaml
+12 b/‎external_confs/dm/pm/erm.yaml
+12
@@ -0,0 +1,64 @@
+# Okapi: Generalising Better By Making Statistical Matches Match
+
+Official code for the NeurIPS 2022 paper _Okapi: Generalising Better By Making
+Statistical Matches Match_
+
+> We propose Okapi, a simple, efficient, and general method for robust
+semi-supervised learning based on online statistical matching. Our method uses
+a nearest-neighbours-based matching procedure to generate cross-domain views
+for a consistency loss, while eliminating statistical outliers. In order to
+perform the online matching in a runtime- and memory-efficient way, we
+draw upon the self-supervised literature and combine a memory bank with
+a slow-moving momentum encoder. The consistency loss is applied within
+the feature space, rather than on the predictive distribution, making
+the method agnostic to both the modality and the task in question. We
+experiment on the WILDS 2.0 datasets Sagawa et al., which significantly
+expands the range of modalities, applications, and shifts available for
+studying and benchmarking real-world unsupervised adaptation. Contrary
+to Sagawa et al., we show that it is in fact possible to leverage
+additional unlabelled data to improve upon empirical risk minimisation
+(ERM) results with the right method. Our method outperforms the
+baseline methods in terms of out-of-distribution (OOD) generalisation
+on the iWildCam (a multi-class classification task) and PovertyMap (a
+regression task) image datasets as well as the CivilComments (a binary
+classification task) text dataset. Furthermore, from a qualitative
+perspective, we show the matches obtained from the learned encoder are
+strongly semantically related.
+
+## Requirements
+- python >=3.9
+- [poetry](https://python-poetry.org/)
+- CUDA >=11.3 (if installing with ``install.sh``)
+
+## Installation
+We use [poetry](https://python-poetry.org/) for dependency management,
+installation of which is a prerequisite for installation of the python
+dependencies. With poetry installed, the dependencies can then be installed by
+running ``install.sh``, contingent on CUDA >=11.3 being installed if installing
+to a CUDA-equipped machine. This constraint can be bypassed by manually
+excuting the commands:
+- ``poetry install``
+- install the appropriate version of Pytorch and ``torch-scatter`` (required
+  for evaluation with [WILDS](https://github.com/p-lambda/wilds)) for the
+  version of CUDA installed on your machine.
+
+## Running the code
+We use [hydra](https://github.com/facebookresearch/hydra) for managing the
+configuration of our experiments. Experiment configurations are grouped by
+dataset in ``external_confs/experiments`` and can be imported via the
+commandline with the command ``python main.py +experiment={dataset}/{method}``;
+one can then override any desired configs/arguments with the syntax
+``{config}={name_of_config_file}`` or ``{config}.{attribute}={value}``
+(e.g.``seed=42`` (defined in the main config class), ``backbone=iw/rn50``,
+``alglr.=1.e-5``).
+
+
+## Citation
+```
+@article{bartlett2022okapi,
+  title={Okapi: Generalising Better by Making Statistical Matches Match},
+  author={Bartlett, Myles and Romiti, Sara and Sharmanska, Viktoriia and Quadrianto, Novi},
+  journal={Advances in neural information processing systems},
+  volume={35},
+  year={2022}
+}
@@ -0,0 +1,25 @@
+---
+defaults:
+  - /schema/alg: erm
+  - defaults
+  - _self_
+model: 
+evaluator: 
+lr: 5e-05
+optimizer_cls: 'torch.optim.AdamW'
+optimizer_kwargs: null
+use_sam: false
+sam_rho: 0.05
+scheduler_cls: null
+scheduler_kwargs: null
+lr_sched_interval: step
+lr_sched_freq: 1
+loss_fn: null
+batch_transforms:
+  - _target_: ranzen.torch.transforms.RandomCutMix
+    alpha: 1.0
+    num_classes: 182
+  - _target_: ranzen.torch.transforms.RandomMixUp.with_beta_dist
+    alpha: 0.2
+    num_classes: 182
+    inplace: true
@@ -0,0 +1,20 @@
+---
+defaults:
+  - /schema/alg: fixmatch
+  - defaults
+  - _self_
+model: 
+evaluator: 
+lr: 3e-05
+optimizer_cls: 'torch.optim.AdamW'
+optimizer_kwargs: null
+use_sam: false
+sam_rho: 0.05
+scheduler_cls: null
+scheduler_kwargs: null
+lr_sched_interval: step
+lr_sched_freq: 1
+batch_transforms: null
+confidence_threshold: 0.70
+loss_u_weight: 1.0
+temperature: 1.0
@@ -0,0 +1,7 @@
+---
+defaults:
+  - /schema/backbone: convnext
+  - _self_
+version: TINY
+in_channels: 3
+pretrained: true
@@ -0,0 +1,7 @@
+---
+defaults:
+  - /schema/backbone: resnet
+  - _self_
+version: RN50
+in_channels: 3
+pretrained: true
@@ -0,0 +1,8 @@
+---
+defaults:
+  - /schema/backbone: convnext
+  - _self_
+in_channels: 8
+pretrained: true
+version: TINY
+checkpoint_path: ''
@@ -0,0 +1,7 @@
+---
+defaults:
+  - /schema/backbone: resnet
+  - _self_
+version: RN18
+in_channels: 8
+pretrained: true
@@ -0,0 +1,6 @@
+---
+defaults:
+  - /schema/checkpointer: base
+  - _self_
+monitor: "validate/OOD/acc_wg"
+mode: 'max'
@@ -0,0 +1,6 @@
+---
+defaults:
+  - /schema/checkpointer: base
+  - _self_
+monitor: "validate/OOD/F1-macro_all"
+mode: 'max'
@@ -0,0 +1,6 @@
+---
+defaults:
+  - /schema/checkpointer: base
+  - _self_
+monitor: "validate/OOD/r_wg"
+mode: 'max'
@@ -0,0 +1,31 @@
+defaults:
+  - iwildcam
+  - _self_
+
+groupby_fields: ['location']
+train_batch_size_l: 24
+training_mode: step
+
+train_transforms_l: 
+  _target_: torchvision.transforms.Compose
+  transforms: 
+    - _target_: torchvision.transforms.Resize
+      size: ${ dm.target_resolution }
+    - _target_: torchvision.transforms.CenterCrop
+      size: ${ dm.target_resolution }
+    - _target_: torchvision.transforms.ToTensor
+    - _target_: torchvision.transforms.Normalize
+      mean: [ 0.485, 0.456, 0.406 ]
+      std: [ 0.229, 0.224, 0.225 ]
+
+test_transforms: 
+  _target_: torchvision.transforms.Compose
+  transforms: 
+    - _target_: torchvision.transforms.Resize
+      size: ${ dm.target_resolution }
+    - _target_: torchvision.transforms.CenterCrop
+      size: ${ dm.target_resolution }
+    - _target_: torchvision.transforms.ToTensor
+    - _target_: torchvision.transforms.Normalize
+      mean: [ 0.485, 0.456, 0.406 ]
+      std: [ 0.229, 0.224, 0.225 ]
@@ -0,0 +1,36 @@
+---
+defaults:
+  - iwildcam
+  - _self_
+
+train_batch_size_l: 24
+training_mode: step
+target_resolution: 224
+
+train_transforms_l:
+  _target_: torchvision.transforms.Compose
+  transforms:
+    - _target_: torchvision.transforms.Resize
+      size: ${ target_resolution }
+    - _target_: torchvision.transforms.CenterCrop
+      size: ${ target_resolution }
+    - _target_: torchvision.transforms.RandomHorizontalFlip
+    - _target_: torchvision.transforms.RandAugment
+      num_ops: 2
+    - _target_: torchvision.transforms.ToTensor
+    - _target_: torchvision.transforms.Normalize
+      mean: [0.48145466, 0.4578275, 0.40821073]
+      std: [0.26862954, 0.26130258, 0.27577711]
+
+
+test_transforms: 
+  _target_: torchvision.transforms.Compose
+  transforms: 
+    - _target_: torchvision.transforms.Resize
+      size: ${ target_resolution }
+    - _target_: torchvision.transforms.CenterCrop
+      size: ${ target_resolution }
+    - _target_: torchvision.transforms.ToTensor
+    - _target_: torchvision.transforms.Normalize
+      mean: [0.48145466, 0.4578275, 0.40821073]
+      std: [0.26862954, 0.26130258, 0.27577711]
@@ -0,0 +1,35 @@
+---
+defaults:
+  - iwildcam
+  - _self_
+
+groupby_fields: ['location']
+train_batch_size_l: 24
+training_mode: step
+
+train_transforms_l: 
+  _target_: torchvision.transforms.Compose
+  transforms: 
+    - _target_: torchvision.transforms.Resize
+      size: ${ dm.target_resolution }
+    - _target_: torchvision.transforms.CenterCrop
+      size: ${ dm.target_resolution }
+    - _target_: torchvision.transforms.RandomHorizontalFlip
+    - _target_: torchvision.transforms.RandAugment
+      num_ops: 2
+    - _target_: torchvision.transforms.ToTensor
+    - _target_: torchvision.transforms.Normalize
+      mean: [ 0.485, 0.456, 0.406 ]
+      std: [ 0.229, 0.224, 0.225 ]
+
+test_transforms: 
+  _target_: torchvision.transforms.Compose
+  transforms: 
+    - _target_: torchvision.transforms.Resize
+      size: ${ dm.target_resolution }
+    - _target_: torchvision.transforms.CenterCrop
+      size: ${ dm.target_resolution }
+    - _target_: torchvision.transforms.ToTensor
+    - _target_: torchvision.transforms.Normalize
+      mean: [ 0.485, 0.456, 0.406 ]
+      std: [ 0.229, 0.224, 0.225 ]
@@ -0,0 +1,67 @@
+---
+defaults:
+  - iwildcam
+  - _self_
+
+training_mode: step
+use_unlabeled: true
+target_resolution: 448
+train_batch_size_l: 16
+train_batch_size_u: 16
+
+train_transforms_l: 
+  _target_: torchvision.transforms.Compose
+  transforms: 
+    - _target_: torchvision.transforms.Resize
+      size: ${ dm.target_resolution }
+    - _target_: torchvision.transforms.CenterCrop
+      size: ${ dm.target_resolution }
+    - _target_: torchvision.transforms.RandomHorizontalFlip
+    - _target_: torchvision.transforms.RandAugment
+      num_ops: 2
+    - _target_: torchvision.transforms.ToTensor
+    - _target_: torchvision.transforms.Normalize
+      mean: [ 0.485, 0.456, 0.406 ]
+      std: [ 0.229, 0.224, 0.225 ]
+
+train_transforms_u: 
+  _target_: src.transforms.FixMatchTransform
+  shared_transform_start:
+    _target_: torchvision.transforms.Compose
+    transforms:
+      - _target_: torchvision.transforms.Resize
+        size: ${ dm.target_resolution }
+  strong_transform: 
+    _target_: torchvision.transforms.Compose
+    transforms:
+      - _target_: torchvision.transforms.RandomHorizontalFlip
+      - _target_: torchvision.transforms.RandomCrop
+        size: ${ dm.target_resolution }
+      - _target_: src.transforms.FixMatchRandAugment
+        num_ops: 2
+  weak_transform:
+    _target_: torchvision.transforms.Compose
+    transforms:
+      - _target_: torchvision.transforms.RandomHorizontalFlip
+      - _target_: torchvision.transforms.RandomCrop
+        size: ${ dm.target_resolution }
+  shared_transform_end:
+    _target_: torchvision.transforms.Compose
+    transforms: 
+      - _target_: torchvision.transforms.ToTensor
+      - _target_: torchvision.transforms.Normalize
+        mean: [ 0.485, 0.456, 0.406 ]
+        std: [ 0.229, 0.224, 0.225 ]
+
+test_transforms: 
+  _target_: torchvision.transforms.Compose
+  transforms: 
+    - _target_: torchvision.transforms.Resize
+      size: ${ dm.target_resolution }
+    - _target_: torchvision.transforms.CenterCrop
+      size: ${ dm.target_resolution }
+    - _target_: torchvision.transforms.ToTensor
+    - _target_: torchvision.transforms.Normalize
+      mean: [ 0.485, 0.456, 0.406 ]
+      std: [ 0.229, 0.224, 0.225 ]
+
@@ -0,0 +1,37 @@
+---
+defaults:
+  - iwildcam
+  - _self_
+
+training_mode: step
+groupby_fields: [location]
+train_batch_size_l: 16
+train_batch_size_u: 16
+use_unlabeled: true
+
+train_transforms_l: 
+  _target_: torchvision.transforms.Compose
+  transforms: 
+    - _target_: torchvision.transforms.Resize
+      size: ${ dm.target_resolution }
+    - _target_: torchvision.transforms.CenterCrop
+      size: ${ dm.target_resolution }
+    - _target_: torchvision.transforms.RandomHorizontalFlip
+    - _target_: torchvision.transforms.RandAugment
+      num_ops: 2
+    - _target_: torchvision.transforms.ToTensor
+    - _target_: torchvision.transforms.Normalize
+      mean: [ 0.485, 0.456, 0.406 ]
+      std: [ 0.229, 0.224, 0.225 ]
+
+test_transforms: 
+  _target_: torchvision.transforms.Compose
+  transforms: 
+    - _target_: torchvision.transforms.Resize
+      size: ${ dm.target_resolution }
+    - _target_: torchvision.transforms.CenterCrop
+      size: ${ dm.target_resolution }
+    - _target_: torchvision.transforms.ToTensor
+    - _target_: torchvision.transforms.Normalize
+      mean: [ 0.485, 0.456, 0.406 ]
+      std: [ 0.229, 0.224, 0.225 ]
@@ -0,0 +1,12 @@
+---
+defaults:
+  - /schema/dm: poverty_map
+  - _self_
+
+fold: A
+train_batch_size_l: 128
+training_mode: step
+use_unlabeled: false
+groupby_fields: ['country']
+train_transforms_l: 
+  _target_: src.transforms.Identity