add training code for reward model #222

theblackcat102 · 2023-01-01T02:57:53Z

trainer code to train a single score reward model. Currently support webgpt and raw datasets from humanfeed back summary by openai. See readme and rank_datasets.py for more details.

andreaskoepf

Thanks for the PR! First step: Please run/install pre-commit, it is mandatory for all code that enters this repo.

andreaskoepf · 2023-01-01T10:41:12Z

.vscode/settings.json

@@ -1,4 +1,4 @@
 {
-  "python.formatting.provider": "black",


Sorry, but we require all contributors to use the same pre-commit rules.

just updated, please revise

I still see the provider as autopep8, when it should be black. do you maybe have a local commit that you didn't push yet?

andreaskoepf

Overall nice training code! Thanks a lot ... also for instantly responding to change requests on discord.

model/reward/instructor/requirements.txt

andreaskoepf · 2023-01-01T12:07:24Z

model/reward/instructor/tests/test_dataset.py

+from rank_datasets import DataCollatorForPairRank, HFSummary, WebGPT
+from torch.utils.data import DataLoader
+from transformers import AutoTokenizer
+


very useful file, a short docstring at the beginning would be nice to explain how it is used during dev/purpose (e.g. dataloader test, batch-shape inspection)

yk

just reset the formatting provider in settings.json to black, otherwise LGTM, thank you very much!

yk · 2023-01-01T12:22:09Z

.vscode/settings.json

@@ -1,4 +1,4 @@
 {
-  "python.formatting.provider": "black",


I still see the provider as autopep8, when it should be black. do you maybe have a local commit that you didn't push yet?

theblackcat102 · 2023-01-01T13:31:04Z

@yk yeah, it's my problem. just reset the format setting

theblackcat102 and others added 13 commits December 30, 2022 17:25

[feature] add rank dataset for webgpt and human feedback summary

ad98a28

[feature] working trainer code

bcd5c52

[fix] Fix missing accuracy and eval loss

b2ef469

[fix] Fix truncation in collate fn

3a10f10

[fix] Add drop_token_type to use galactica

d2572d0

[feature] added configs argument for parameters training and recording

f3c2997

[fix] Fix missing configs

24e0662

[feature] Add galactica training config

918b7b7

[fix] fix freeze top N layers

ba336fb

[feature] update reamde

c5b31d0

[feature] Add support for bloomz

0119ee6

[fix] Tidy up todo and trainer comments

e27a3eb

[merge] most of the bugs should be fixed. LAION-AI#77

a5a2625

theblackcat102 requested review from yk and andreaskoepf as code owners January 1, 2023 02:57

theblackcat102 added 4 commits January 1, 2023 03:07

[fix] Use official split for eval

4b7f1f2

[feature] remove dependency to download hfsummary manually

8b15536

[fix] dataset split name

1197dcc

[feature] added summary quality rater

168e9ca

andreaskoepf requested changes Jan 1, 2023

View reviewed changes

theblackcat102 added 2 commits January 1, 2023 11:35

[fix] remove vscode settings

1ddd915

[fix] pre-commit update

fe99b46

andreaskoepf self-requested a review January 1, 2023 11:50

theblackcat102 added 3 commits January 1, 2023 11:56

[fix] rerun pre-commit

4d01704

Merge branch 'fix'

a388d1c

[fix] Revert deleted vscode

28e0b4f

andreaskoepf approved these changes Jan 1, 2023

View reviewed changes

yk approved these changes Jan 1, 2023

View reviewed changes

[fix] Fix provider

8f0028b

yk merged commit 29c6491 into LAION-AI:main Jan 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add training code for reward model #222

add training code for reward model #222

theblackcat102 commented Jan 1, 2023

andreaskoepf left a comment

andreaskoepf Jan 1, 2023

theblackcat102 Jan 1, 2023

yk Jan 1, 2023

andreaskoepf left a comment

andreaskoepf Jan 1, 2023

yk left a comment

yk Jan 1, 2023

theblackcat102 commented Jan 1, 2023

add training code for reward model #222

add training code for reward model #222

Conversation

theblackcat102 commented Jan 1, 2023

andreaskoepf left a comment

Choose a reason for hiding this comment

andreaskoepf Jan 1, 2023

Choose a reason for hiding this comment

theblackcat102 Jan 1, 2023

Choose a reason for hiding this comment

yk Jan 1, 2023

Choose a reason for hiding this comment

andreaskoepf left a comment

Choose a reason for hiding this comment

andreaskoepf Jan 1, 2023

Choose a reason for hiding this comment

yk left a comment

Choose a reason for hiding this comment

yk Jan 1, 2023

Choose a reason for hiding this comment

theblackcat102 commented Jan 1, 2023