test: Add OpenAI frontend testing for LLM API backend #8040

krishung5 · 2025-02-27T09:42:03Z

What does the PR do?

Add OpenAI frontend testing for LLM API backend. Some of the test cases are skipped for now due to issues with the random seeds sampling parameter. Will follow up with the TRT-LLM team.

Checklist

Commit Type:

Check the conventional commit type
box here and add the label to the github PR.

Related PRs: LLM API backend MR.

Where should the reviewer start?

Test plan:

CI Pipeline ID: 24653671

Caveats:

Background

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

closes GitHub issue: #xxx

richardhuo-nv

LGTM! Thanks!

rmccorm4 · 2025-02-27T19:27:51Z

python/openai/tests/conftest.py

+        if LLMAPI_SETUP:
+            model = "tensorrt_llm"
+        else:
+            model = "tensorrt_llm_bls"


Does it make sense to set backend = "llmapi" and use the backend fixture to check if llmapi in the tests, following similar patterns for vllm/trtllm in the existing tests?

ex:

server/python/openai/tests/test_chat_completions.py

Lines 313 to 316 in 205f13c

if backend != "tensorrtllm":

pytest.skip(

reason="Only used to test TRT-LLM-specific temperature behavior"

)

Not sure what the config.pbtxt backend field will be for LLM API though - so let me know what you're thinking

The config.pbtxt backend field will be python for LLM API backend.

I thought that the LLM API backend is one of the tensorrtllm backends, so hence didn't have another "llmapi backend" fixture.

I think avoiding having an ENV makes more sense, however, right now the logic to determine which backend to use is to import tensorrtllm or vllm module and see if there's an import error, then set the backend and model name accordingly.

server/python/openai/tests/conftest.py

Lines 47 to 54 in 205f13c

try:

import tensorrt_llm as _

backend = "tensorrtllm"

model = "tensorrt_llm_bls"

return backend, model

except ImportError:

print("No tensorrt_llm installation found.")

Both general tensorrtllm and llmapi backend cases will import the tensorrtllm module successfully. I think some sort of flags would still be required to know if this is a general trtllm model or a llmapi one.

The config.pbtxt backend field will be python for LLM API backend.

Do we plan to add a python-based backend like vllm is?

Do you mean making it backend: llmapi in the config.pbtxt? I don't think there's a plan to do so right now. I could see it be less confusing having this after the pivot is more finalized, right now it's more like adding another python model just like the original one.

Oh right because they're not using the runtime config.pbtxt feature currently, they just swapped the backend to python instead, gotcha - makes sense.

Will the llmapi model.py replace the existing model.py in trtllm backend? Hopefully we're not planning to support 3 implementations for now 😅

Agree - confirming w/ TRT-LLM team on this.

rmccorm4 · 2025-02-28T00:16:19Z

python/openai/tests/conftest.py

 ### TEST ENVIRONMENT SETUP ###
+LLMAPI_SETUP = os.environ.get("LLMAPI_SETUP", 0)


Would be nice to clean up some of the if/else everywhere for llmapi vs trtllm stuff, but hopefully we can consolidate the supported backend implementations once its a bit more mature.

python/openai/tests/test_chat_completions.py

python/openai/tests/test_completions.py

krishung5 · 2025-02-28T23:22:23Z

I'll be able to merge this PR only after the LLM API backend MR is merged. Also pending on CI - having some issues with the build which are not related to the openai changes. I'll ask for final review once we're ready to merge. Thanks!

rmccorm4 · 2025-03-01T00:45:48Z

I'll be able to merge this PR only after the LLM API backend MR is merged. Also pending on CI - having some issues with the build which are not related to the openai changes. I'll ask for final review once we're ready to merge. Thanks!

Marking as draft for clarity - feel free to unmark it when ready

krishung5 added 2 commits February 27, 2025 00:17

Add openai testing for LLM API

fe8081f

Skip seed tests for LLM API

be339c5

krishung5 changed the title ~~ci: Add OpenAI frontend testing for LLM API backend~~ test: Add OpenAI frontend testing for LLM API backend Feb 27, 2025

krishung5 added the PR: test Adding missing tests or correcting existing test label Feb 27, 2025

richardhuo-nv requested review from richardhuo-nv and rmccorm4 February 27, 2025 17:31

richardhuo-nv previously approved these changes Feb 27, 2025

View reviewed changes

krishung5 added 2 commits February 27, 2025 11:00

Rewording

2f5f06e

Merge remote-tracking branch 'origin/main' into krish-llmapi-test

c84d215

krishung5 dismissed richardhuo-nv’s stale review via c84d215 February 27, 2025 19:01

rmccorm4 reviewed Feb 27, 2025

View reviewed changes

rmccorm4 reviewed Feb 28, 2025

View reviewed changes

rmccorm4 previously approved these changes Feb 28, 2025

View reviewed changes

Use backend fixture for llmapi

7026803

krishung5 dismissed rmccorm4’s stale review via 7026803 February 28, 2025 23:18

krishung5 requested a review from rmccorm4 February 28, 2025 23:20

github-advanced-security bot found potential problems Feb 28, 2025

View reviewed changes

python/openai/tests/test_chat_completions.py Fixed Show fixed Hide fixed

python/openai/tests/test_completions.py Fixed Show fixed Hide fixed

Remove unused import

d2daa15

rmccorm4 marked this pull request as draft March 1, 2025 00:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: Add OpenAI frontend testing for LLM API backend #8040

test: Add OpenAI frontend testing for LLM API backend #8040

krishung5 commented Feb 27, 2025 •

edited

Loading

richardhuo-nv left a comment

rmccorm4 Feb 27, 2025 •

edited

Loading

krishung5 Feb 27, 2025

rmccorm4 Feb 27, 2025

krishung5 Feb 28, 2025

rmccorm4 Feb 28, 2025

krishung5 Feb 28, 2025

rmccorm4 Feb 28, 2025

krishung5 commented Feb 28, 2025

rmccorm4 commented Mar 1, 2025

	if backend != "tensorrtllm":
	pytest.skip(
	reason="Only used to test TRT-LLM-specific temperature behavior"
	)

	try:
	import tensorrt_llm as _

	backend = "tensorrtllm"
	model = "tensorrt_llm_bls"
	return backend, model
	except ImportError:
	print("No tensorrt_llm installation found.")

		### TEST ENVIRONMENT SETUP ###
		LLMAPI_SETUP = os.environ.get("LLMAPI_SETUP", 0)

test: Add OpenAI frontend testing for LLM API backend #8040

Are you sure you want to change the base?

test: Add OpenAI frontend testing for LLM API backend #8040

Conversation

krishung5 commented Feb 27, 2025 • edited Loading

What does the PR do?

Checklist

Commit Type:

Related PRs: LLM API backend MR.

Where should the reviewer start?

Test plan:

Caveats:

Background

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

richardhuo-nv left a comment

Choose a reason for hiding this comment

rmccorm4 Feb 27, 2025 • edited Loading

Choose a reason for hiding this comment

krishung5 Feb 27, 2025

Choose a reason for hiding this comment

rmccorm4 Feb 27, 2025

Choose a reason for hiding this comment

krishung5 Feb 28, 2025

Choose a reason for hiding this comment

rmccorm4 Feb 28, 2025

Choose a reason for hiding this comment

krishung5 Feb 28, 2025

Choose a reason for hiding this comment

rmccorm4 Feb 28, 2025

Choose a reason for hiding this comment

krishung5 commented Feb 28, 2025

rmccorm4 commented Mar 1, 2025

krishung5 commented Feb 27, 2025 •

edited

Loading

rmccorm4 Feb 27, 2025 •

edited

Loading