Fix supervised pretraining bugs and add new datasets #699

theblackcat102 · 2023-01-14T06:50:20Z

Code change

Mainly typo and logic error by previous PR merge.

Code refactor in datasets

Dataset change

Added these datasets into training lists

SODA
Joke Explaination
GSM8k

theblackcat102 · 2023-01-14T08:13:32Z

And one big change was changing the question answer tokens to <human> and <bot> for better clarity. Since some of the datasets are about multi turns dialogue (SODA)

model/supervised_finetuning/configs/config.yaml

sanagno

Thanks a lot for taking care of this! This was a pain.

theblackcat102 · 2023-01-14T12:19:54Z

Thanks a lot for taking care of this! This was a pain.

We should only allow PR with tests case written.

theblackcat102 · 2023-01-14T12:20:07Z

Going to merge now

theblackcat102 added 5 commits January 14, 2023 03:49

[fix] @ekurtulus major logic bug in summarization

9451aff

Merge branch 'main' of github.com:LAION-AI/Open-Assistant into main

3cfb501

[fix] Fix summarizer bug and QA typo issue

3966024

[feature] added GSM8k and code refactoring

1546111

[fix] Disable task specific evaluation

6f6c590

theblackcat102 requested a review from sanagno as a code owner January 14, 2023 06:50

theblackcat102 added the ml label Jan 14, 2023

sanagno reviewed Jan 14, 2023

View reviewed changes

model/supervised_finetuning/configs/config.yaml Outdated Show resolved Hide resolved

sanagno approved these changes Jan 14, 2023

View reviewed changes

[fix] Fix config typo

670be60

theblackcat102 merged commit dbf8f77 into main Jan 14, 2023

theblackcat102 deleted the sft-fixes branch January 14, 2023 12:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix supervised pretraining bugs and add new datasets #699

Fix supervised pretraining bugs and add new datasets #699

theblackcat102 commented Jan 14, 2023

theblackcat102 commented Jan 14, 2023

sanagno left a comment

theblackcat102 commented Jan 14, 2023

theblackcat102 commented Jan 14, 2023

Fix supervised pretraining bugs and add new datasets #699

Fix supervised pretraining bugs and add new datasets #699

Conversation

theblackcat102 commented Jan 14, 2023

theblackcat102 commented Jan 14, 2023

sanagno left a comment

Choose a reason for hiding this comment

theblackcat102 commented Jan 14, 2023

theblackcat102 commented Jan 14, 2023