Skip to content

Actions: neuralmagic/vllm

pre-commit

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
88 workflow runs
88 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[doc] clarify multi-node serving doc (#13558)
pre-commit #63: Commit ad5a35c pushed by tlrmchlsmth
February 19, 2025 15:54 4m 52s main
February 19, 2025 15:54 4m 52s
[CI/Build] migrate static project metadata from setup.py to pyproject…
pre-commit #62: Commit a02c86b pushed by tlrmchlsmth
February 18, 2025 16:10 4m 37s main
February 18, 2025 16:10 4m 37s
[WIP] Working Grouped gemm with group ID
pre-commit #61: Pull request #48 synchronize by ElizaWszola
February 18, 2025 13:54 4m 49s grouped-gemm-with-group-id
February 18, 2025 13:54 4m 49s
[Bugfix] Fix VLLM_USE_MODELSCOPE issue (#13384)
pre-commit #60: Commit ce77eb9 pushed by tlrmchlsmth
February 17, 2025 14:55 4m 46s main
February 17, 2025 14:55 4m 46s
[V1][PP] Cache Intermediate Tensors (#13353)
pre-commit #59: Commit e18227b pushed by tlrmchlsmth
February 16, 2025 18:03 4m 35s main
February 16, 2025 18:03 4m 35s
[Bugfix] Fix 2 Node and Spec Decode tests (#13341)
pre-commit #58: Commit 5d2965b pushed by kylesayrs
February 16, 2025 15:09 4m 38s main
February 16, 2025 15:09 4m 38s
[ci/build] update flashinfer (#13323)
pre-commit #57: Commit 54ed913 pushed by tlrmchlsmth
February 15, 2025 14:39 4m 31s main
February 15, 2025 14:39 4m 31s
[V1][Core] min_p sampling support (#13191)
pre-commit #56: Commit a12934d pushed by tlrmchlsmth
February 14, 2025 23:59 4m 42s main
February 14, 2025 23:59 4m 42s
[Hardware][Gaudi][Bugfix] Fix error for guided decoding (#12317)
pre-commit #55: Commit c9e2d64 pushed by tlrmchlsmth
February 14, 2025 14:23 5m 33s main
February 14, 2025 14:23 5m 33s
[WIP] Working Grouped gemm with group ID
pre-commit #54: Pull request #48 synchronize by ElizaWszola
February 14, 2025 07:44 4m 49s grouped-gemm-with-group-id
February 14, 2025 07:44 4m 49s
Revert "Add label if pre-commit passes" (#13242)
pre-commit #53: Commit e38be64 pushed by tlrmchlsmth
February 14, 2025 00:50 4m 44s main
February 14, 2025 00:50 4m 44s
Add label if pre-commit passes (#12527)
pre-commit #52: Commit bffddd9 pushed by tlrmchlsmth
February 13, 2025 21:33 4m 50s main
February 13, 2025 21:33 4m 50s
[Frontend] Add /v1/audio/transcriptions OpenAI API endpoint (#12909)
pre-commit #51: Commit d84cef7 pushed by SageMoore
February 13, 2025 18:20 5m 27s main
February 13, 2025 18:20 5m 27s
[V1][Bugfix] Copy encoder input ids to fix set iteration issue during…
pre-commit #50: Commit 4c0d93f pushed by tlrmchlsmth
February 12, 2025 21:06 4m 46s main
February 12, 2025 21:06 4m 46s
February 12, 2025 17:47 5m 30s
[CI/Build][Bugfix] Fix CPU backend default threads num (#13077)
pre-commit #48: Commit 565c1ef pushed by rahul-tuli
February 11, 2025 17:36 5m 59s main
February 11, 2025 17:36 5m 59s
[V1][Metrics] Add several request timing histograms (#12644)
pre-commit #47: Commit 75e6e14 pushed by SageMoore
February 11, 2025 15:23 5m 38s main
February 11, 2025 15:23 5m 38s
[Bugfix] Clean up and fix multi-modal processors (#13012)
pre-commit #46: Commit 51f0b5f pushed by varun-sundar-rabindranath
February 10, 2025 14:18 5m 33s main
February 10, 2025 14:18 5m 33s
[ROCm] [Feature] [Doc] [Dockerfile] [BugFix] Support Per-Token-Activa…
pre-commit #45: Commit eaa92d4 pushed by tlrmchlsmth
February 7, 2025 16:54 5m 20s main
February 7, 2025 16:54 5m 20s
Prevent unecessary requests to huggingface hub (#12837)
pre-commit #44: Commit 6e1fc61 pushed by varun-sundar-rabindranath
February 7, 2025 08:46 4m 34s main
February 7, 2025 08:46 4m 34s
[MISC] Check space in the file names in the pre commit checks (#12804)
pre-commit #43: Commit 741429a pushed by tlrmchlsmth
February 7, 2025 00:03 5m 35s main
February 7, 2025 00:03 5m 35s
[V1] LoRA Support (#10957)
pre-commit #42: Commit 467a96a pushed by tlrmchlsmth
February 6, 2025 18:47 4m 44s main
February 6, 2025 18:47 4m 44s
[Attention] Use FA3 for MLA on Hopper (#12807)
pre-commit #41: Commit c786e75 pushed by dsikka
February 6, 2025 13:25 4m 40s main
February 6, 2025 13:25 4m 40s
[VLM] Qwen2.5-VL
pre-commit #40: Commit bf3b79e pushed by tlrmchlsmth
February 5, 2025 22:11 4m 46s main
February 5, 2025 22:11 4m 46s
pre-commit
pre-commit #39: by varun-sundar-rabindranath
February 5, 2025 15:43 4m 47s main
February 5, 2025 15:43 4m 47s