Supervised finetuning minor changes #456

sanagno · 2023-01-06T21:11:28Z

Small changes

small typo when masking with padding
better collator with comments
started quantization

theblackcat102

can we force label_mask to be boolean at the collator stage? such that we don't need .bool() in trainer.py line 73 ?

theblackcat102 · 2023-01-07T00:27:47Z

but I think it's just minor so going to merge it

sanagno · 2023-01-07T00:31:42Z

Good point, that how I had it but it didn't work with deepspeed, I assume it casts tensors in a specific ways. I am going to fix it in the next commit.

mrcabbage972 · 2023-01-08T21:46:13Z

model/supervised_finetuning/models/gptj.py

@@ -0,0 +1,187 @@
+# Taken from https://github.com/sleekmike/Finetune_GPT-J_6B_8-bit/blob/master/gpt-j-6b-8-bit.py
+
+import torch


@theblackcat102 Maybe instead use bitsandbytes for a generic solution for 8-bit quantization? Are there any downsides to that?

Not really! feel free to edit

Ok, I'm adding an option to train with 8-bit Adam from BNB, as suggested here.

Also I'm pretty sure I've previously seen a generic library that implements a whole pile of different adapter types, could be convenient.

@sanagno Opened PR here

Sotirios Anagnostidis and others added 8 commits January 5, 2023 00:33

gptj 8bit

dfaa00d

quantization

ef02693

conf

9185375

merge

f2b125c

merge deepspeed

88ee3b3

refactor

1482444

Merge remote-tracking branch 'origin/main' into sft-gptjt-qa-labels

fb3266b

pre commits

d395235

sanagno requested a review from theblackcat102 as a code owner January 6, 2023 21:11

sanagno added the ml label Jan 6, 2023

sanagno mentioned this pull request Jan 6, 2023

SFT training #301

Merged

theblackcat102 approved these changes Jan 7, 2023

View reviewed changes

theblackcat102 merged commit 64a8543 into main Jan 7, 2023

theblackcat102 deleted the sft-gptjt-qa-labels branch January 7, 2023 00:27

mrcabbage972 reviewed Jan 8, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Supervised finetuning minor changes #456

Supervised finetuning minor changes #456

sanagno commented Jan 6, 2023

theblackcat102 left a comment

theblackcat102 commented Jan 7, 2023

sanagno commented Jan 7, 2023

mrcabbage972 Jan 8, 2023 •

edited

Loading

sanagno Jan 8, 2023

mrcabbage972 Jan 9, 2023 •

edited

Loading

mrcabbage972 Jan 10, 2023

		@@ -0,0 +1,187 @@
		# Taken from https://github.com/sleekmike/Finetune_GPT-J_6B_8-bit/blob/master/gpt-j-6b-8-bit.py

		import torch

Supervised finetuning minor changes #456

Supervised finetuning minor changes #456

Conversation

sanagno commented Jan 6, 2023

theblackcat102 left a comment

Choose a reason for hiding this comment

theblackcat102 commented Jan 7, 2023

sanagno commented Jan 7, 2023

mrcabbage972 Jan 8, 2023 • edited Loading

Choose a reason for hiding this comment

sanagno Jan 8, 2023

Choose a reason for hiding this comment

mrcabbage972 Jan 9, 2023 • edited Loading

Choose a reason for hiding this comment

mrcabbage972 Jan 10, 2023

Choose a reason for hiding this comment

mrcabbage972 Jan 8, 2023 •

edited

Loading

mrcabbage972 Jan 9, 2023 •

edited

Loading