A more general support for QA and summarization (review and feedback needed) #576

ekurtulus · 2023-01-09T19:16:27Z

With this update, I am aiming for generalizing our training framework. I need feedback and review on the structure that I propose. Accordingly, I will make the codebase fully functional and extend it. Additionally, I added more papers to the docs side.

see this issue

I think we need support not only for QA but also summarization. In this PR, I wanted to extend the existing datasets, add support for seq2seq models like T5, PolyLoss and few other modifications. Normally, I was also planning to add support for Sharpness Aware Training, but it requires overriding the Huggingface Trainer's _inner_training_loop function. I can do that but I am sure if it is something we want. If we decide to do so, I can submit another PR.

Furthermore, if the structure of this commit is approved, I can add more tasks, a script for synthetic data generation through data augmentation, and more.

yk · 2023-01-09T20:14:42Z

you have some merge conflict markers in your files

sanagno · 2023-01-09T22:40:56Z

Looks good, after the merge! Do you know which models use the SAM optimizer?

andrewm4894 · 2023-01-09T22:42:21Z

docs/docs/research/general.md

@@ -2,6 +2,43 @@

 This page lists research papers that are relevant to the project.

+<<<<<<< HEAD


i think this means you might need to update from upstream/master

@ekurtulus it might be cleaner if you move your work onto a feature branch in your fork. That way you can keep your main in sync with LAION-AI/Open-Assistant:main and might make it easier for then updating your feature branch from main. I find this approach a bit easier to work with given how often main is changing with so much work going on.

ekurtulus · 2023-01-10T06:56:37Z

Looks good, after the merge! Do you know which models use the SAM optimizer?

None. It is an optimizer you use in conjunction with other optimizers. Here is the link to the paper. There are papers claiming it to be increasing generalization like this one.

sanagno · 2023-01-10T10:43:35Z

Looks great, If you can merge the final conflicts and add a default accuracy metric as before would be great!

yk · 2023-01-10T20:02:52Z

@ekurtulus did you close this on purpose?

ekurtulus · 2023-01-10T20:04:57Z

@ekurtulus did you close this on purpose?

Yes, I realized that I was not using pre-commit files and not branching properly. While trying to set everything up properly, I had to delete and re-fork the main repository. So, it was closed automatically. I will submit a new PR.

ekurtulus · 2023-01-11T08:41:47Z

Please see PR 619.

ekurtulus requested review from andrewm4894, andreaskoepf, yk, theblackcat102 and sanagno as code owners January 9, 2023 19:16

ekurtulus mentioned this pull request Jan 9, 2023

Try Supervised Fine-Tuning on pseudo-QA-data #48

Closed

andrewm4894 reviewed Jan 9, 2023

View reviewed changes

yk added the ml label Jan 9, 2023

ekurtulus closed this Jan 10, 2023

ekurtulus force-pushed the main branch from 04fe1c6 to 8ce84ec Compare January 10, 2023 18:03

ekurtulus mentioned this pull request Jan 11, 2023

Continuing #576 #619

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A more general support for QA and summarization (review and feedback needed) #576

A more general support for QA and summarization (review and feedback needed) #576

ekurtulus commented Jan 9, 2023 •

edited

Loading

yk commented Jan 9, 2023

sanagno commented Jan 9, 2023

andrewm4894 Jan 9, 2023

andrewm4894 Jan 9, 2023

ekurtulus commented Jan 10, 2023

sanagno commented Jan 10, 2023

yk commented Jan 10, 2023

ekurtulus commented Jan 10, 2023

ekurtulus commented Jan 11, 2023

		@@ -2,6 +2,43 @@

		This page lists research papers that are relevant to the project.

		<<<<<<< HEAD

A more general support for QA and summarization (review and feedback needed) #576

A more general support for QA and summarization (review and feedback needed) #576

Conversation

ekurtulus commented Jan 9, 2023 • edited Loading

yk commented Jan 9, 2023

sanagno commented Jan 9, 2023

andrewm4894 Jan 9, 2023

Choose a reason for hiding this comment

andrewm4894 Jan 9, 2023

Choose a reason for hiding this comment

ekurtulus commented Jan 10, 2023

sanagno commented Jan 10, 2023

yk commented Jan 10, 2023

ekurtulus commented Jan 10, 2023

ekurtulus commented Jan 11, 2023

ekurtulus commented Jan 9, 2023 •

edited

Loading