Simplify embedding model support and loading #569

joshdevins · 2023-07-26T09:19:08Z

We were attempting to load SentenceTransformers by looking at the model prefix, however SentenceTransformers can also be loaded from other orgs in the model hub, as well as from local disk. This prefix checking failed in those two cases. To simplify the loading logic and deciding which wrapper to use, we’ve removed support for text_embedding tasks to load a plain Transformer. We now only support DPR embedding models and SentenceTransformer embedding models. If you try to load a plain Transformer model, it will be loaded by SentenceTransformers and a mean pooling layer will automatically be added by the SentenceTransformer library. Since we no longer automatically support non-DPR and non-SentenceTransformers, we should include somewhere example code for how to load a custom model without DPR or SentenceTransformers.

Note: This change will allow us to support E5 embeddings uploaded with eland. This change will not yet solve adding the preamble instructions to query and index embeddings.

See: https://github.com/UKPLab/sentence-transformers/blob/v2.2.2/sentence_transformers/SentenceTransformer.py#L801

Resolves #531

joshdevins · 2023-07-26T11:33:15Z

Blog post to follow, draft WIP

We were attempting to load SentenceTransformers by looking at the model prefix, however SentenceTransformers can also be loaded from other orgs in the model hub, as well as from local disk. This prefix checking failed in those two cases. To simplify the loading logic and deciding which wrapper to use, we’ve removed support for text_embedding tasks to load a plain Transformer. We now only support DPR embedding models and SentenceTransformer embedding models. If you try to load a plain Transformer model, it will be loaded by SentenceTransformers and a mean pooling layer will automatically be added by the SentenceTransformer library. Since we no longer automatically support non-DPR and non-SentenceTransformers, we should include somewhere example code for how to load a custom model without DPR or SentenceTransformers. See: https://github.com/UKPLab/sentence-transformers/blob/v2.2.2/sentence_transformers/SentenceTransformer.py#L801 Resolves #531

This is mostly to force a rebuild of the PR workflow actions.

davidkyle · 2023-07-26T14:07:49Z

@picandocodigo the old Jenkins ci is still present in Eland PRs, you mentioned last week that it will go away once the infra repository is updated. Is there any progress?

#561 (comment)

The good news is that buildkite is working

joshdevins · 2023-07-26T14:13:13Z

@picandocodigo I've also rebased after you added that "ignore" file, and I've tried opening a new PR but I get the same build error.

picandocodigo · 2023-07-26T14:15:37Z

@davidkyle @joshdevins great that Buildkite is working now! I removed clients-ci from the requested checks in the branch protection rules, so it should be possible to merge without that passing. But there's some work still pending on infrastructure for this to be fully removed, I'll update you soon!

tveasey

LGTM

davidkyle

LGTM

I tested loading ST models from local disk and they were correctly identified as sentence transformer models

serenachou · 2023-08-01T22:01:14Z

related PR: https://github.com/elastic/enterprise-search-pubs/pull/3666

joshdevins added bug Something isn't working enhancement New feature or request topic:NLP Issue or PR about NLP model support and eland_import_hub_model labels Jul 26, 2023

joshdevins requested review from davidkyle and tveasey July 26, 2023 09:19

joshdevins self-assigned this Jul 26, 2023

joshdevins force-pushed the joshdevins/sbert-e5 branch 2 times, most recently from 4208a28 to d3b74b5 Compare July 26, 2023 11:15

joshdevins force-pushed the joshdevins/sbert-e5 branch from d3b74b5 to d285822 Compare July 26, 2023 13:09

Cleanup formatting, force a rebuild

a5c5269

This is mostly to force a rebuild of the PR workflow actions.

joshdevins closed this Jul 26, 2023

joshdevins reopened this Jul 26, 2023

tveasey approved these changes Jul 31, 2023

View reviewed changes

davidkyle approved these changes Jul 31, 2023

View reviewed changes

joshdevins merged commit f26fb8a into main Jul 31, 2023

joshdevins deleted the joshdevins/sbert-e5 branch July 31, 2023 16:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify embedding model support and loading #569

Simplify embedding model support and loading #569

joshdevins commented Jul 26, 2023

joshdevins commented Jul 26, 2023

davidkyle commented Jul 26, 2023

joshdevins commented Jul 26, 2023

picandocodigo commented Jul 26, 2023

tveasey left a comment

davidkyle left a comment

serenachou commented Aug 1, 2023

Simplify embedding model support and loading #569

Simplify embedding model support and loading #569

Conversation

joshdevins commented Jul 26, 2023

joshdevins commented Jul 26, 2023

davidkyle commented Jul 26, 2023

joshdevins commented Jul 26, 2023

picandocodigo commented Jul 26, 2023

tveasey left a comment

Choose a reason for hiding this comment

davidkyle left a comment

Choose a reason for hiding this comment

serenachou commented Aug 1, 2023