[ML] Refactor OpenAI to use ConstructingObjectParser and consolidate class locations #124380

jonathan-buttner · 2025-03-07T20:59:58Z

This PR refactors the OpenAI response parsing logic.

Switches to use a ConstructingObjectParser for the response parsing logic from OpenAI
Consolidates the OpenAI classes into external.openai

…tor-openai-2

jonathan-buttner · 2025-03-07T21:00:36Z

...va/org/elasticsearch/xpack/inference/external/openai/OpenAiChatCompletionResponseEntity.java

-            ensureExpectedToken(XContentParser.Token.START_OBJECT, token, jsonParser);
-
-            positionParserAtTokenAfterField(jsonParser, "choices", FAILED_TO_FIND_FIELD_TEMPLATE);
+        try (var p = XContentFactory.xContent(XContentType.JSON).createParser(XContentParserConfiguration.EMPTY, response.body())) {


The major changes here are to use the ConstructingObjectParser instead of iterating by token.

jonathan-buttner · 2025-03-07T21:01:04Z

...n/java/org/elasticsearch/xpack/inference/external/openai/OpenAiEmbeddingsResponseEntity.java

@@ -0,0 +1,107 @@
+/*


This class was moved and transitioned to use the ConstructingObjectParser.

jonathan-buttner · 2025-03-07T21:01:20Z

...g/elasticsearch/xpack/inference/external/response/openai/OpenAiEmbeddingsResponseEntity.java

@@ -1,110 +0,0 @@
-/*


This was moved to external.openai and transitioned to use the ConstructingObjectParser.

jonathan-buttner · 2025-03-07T21:02:47Z

...elasticsearch/xpack/inference/external/action/azureopenai/AzureOpenAiActionCreatorTests.java

@@ -219,7 +219,7 @@ public void testCreate_AzureOpenAiEmbeddingsModel_FailsFromInvalidResponseFormat
            PlainActionFuture<InferenceServiceResults> listener = new PlainActionFuture<>();
            action.execute(new DocumentsOnlyInput(List.of("abc")), InferenceAction.Request.DEFAULT_TIMEOUT, listener);

-            var failureCauseMessage = "Failed to find required field [data] in OpenAI embeddings response";
+            var failureCauseMessage = "Required [data]";


If we encounter a parsing failure where a field is missing, the error message will be decorated on the way back in one of the upstream listeners. The error message will have the openai information included elsewhere.

jonathan-buttner · 2025-03-07T21:04:00Z

.../java/org/elasticsearch/xpack/inference/external/action/openai/OpenAiActionCreatorTests.java

@@ -533,7 +533,7 @@ public void testCreate_OpenAiChatCompletionModel_FailsFromInvalidResponseFormat(
            PlainActionFuture<InferenceServiceResults> listener = new PlainActionFuture<>();
            action.execute(new ChatCompletionInput(List.of("abc")), InferenceAction.Request.DEFAULT_TIMEOUT, listener);

-            var failureCauseMessage = "Failed to find required field [choices] in OpenAI chat completions response";
+            var failureCauseMessage = "Required [choices]";
            var thrownException = expectThrows(ElasticsearchStatusException.class, () -> listener.actionGet(TIMEOUT));
            assertThat(
                thrownException.getMessage(),


Here's an example of the message containing openai and chat completions information already.

elasticsearchmachine · 2025-03-07T22:12:03Z

Pinging @elastic/ml-core (Team:ML)

jonathan-buttner added 4 commits March 7, 2025 15:31

Switching openai to ConstructingObjectParser

4aa5ad0

Moving files

8c5bd9d

Merge branch 'main' of github.com:elastic/elasticsearch into ml-refac…

18f1e3f

…tor-openai-2

Fixing package errors

96fc06a

jonathan-buttner added >non-issue :ml Machine learning Team:ML Meta label for the ML team auto-backport Automatically create backport pull requests when merged v8.19.0 v9.1.0 Feature:GenAI Features around GenAI labels Mar 7, 2025

jonathan-buttner commented Mar 7, 2025

View reviewed changes

jonathan-buttner marked this pull request as ready for review March 7, 2025 22:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Refactor OpenAI to use ConstructingObjectParser and consolidate class locations #124380

[ML] Refactor OpenAI to use ConstructingObjectParser and consolidate class locations #124380

jonathan-buttner commented Mar 7, 2025

jonathan-buttner Mar 7, 2025

jonathan-buttner Mar 7, 2025

jonathan-buttner Mar 7, 2025

jonathan-buttner Mar 7, 2025

jonathan-buttner Mar 7, 2025

elasticsearchmachine commented Mar 7, 2025

[ML] Refactor OpenAI to use ConstructingObjectParser and consolidate class locations #124380

Are you sure you want to change the base?

[ML] Refactor OpenAI to use ConstructingObjectParser and consolidate class locations #124380

Conversation

jonathan-buttner commented Mar 7, 2025

jonathan-buttner Mar 7, 2025

Choose a reason for hiding this comment

jonathan-buttner Mar 7, 2025

Choose a reason for hiding this comment

jonathan-buttner Mar 7, 2025

Choose a reason for hiding this comment

jonathan-buttner Mar 7, 2025

Choose a reason for hiding this comment

jonathan-buttner Mar 7, 2025

Choose a reason for hiding this comment

elasticsearchmachine commented Mar 7, 2025