Skip to content

Actions: vllm-project/vllm

PR Reminder Comment Bot

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,340 workflow runs
4,340 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Quantization][FP8] Adding support for fp8 gemm layer input in fp8
PR Reminder Comment Bot #4341: Pull request #14578 opened by gshtras
March 10, 2025 21:15 12s
March 10, 2025 21:15 12s
[V1] Cache logits buffer in sampler
PR Reminder Comment Bot #4340: Pull request #14577 opened by WoosukKwon
March 10, 2025 21:12 13s
March 10, 2025 21:12 13s
[V1] Prevent xgrammar from breaking TPU support
PR Reminder Comment Bot #4339: Pull request #14575 opened by russellb
March 10, 2025 20:35 11s
March 10, 2025 20:35 11s
[Bugfix][TPU][V1] Disable StructuredOutputManager import on TPU
PR Reminder Comment Bot #4338: Pull request #14573 opened by NickLucche
March 10, 2025 17:26 4m 21s
March 10, 2025 17:26 4m 21s
[BugFix/Build] Fix sparse kernels not getting built on hopper
PR Reminder Comment Bot #4337: Pull request #14572 opened by LucasWilkinson
March 10, 2025 16:34 11s
March 10, 2025 16:34 11s
[Minor] Update the tqdm bar for parallel sampling
PR Reminder Comment Bot #4336: Pull request #14571 opened by WoosukKwon
March 10, 2025 16:31 16s
March 10, 2025 16:31 16s
Mseznec/flash attention fp8
PR Reminder Comment Bot #4335: Pull request #14570 opened by mickaelseznec
March 10, 2025 15:40 13s
March 10, 2025 15:40 13s
permute/unpermute kernel for moe optimization
PR Reminder Comment Bot #4334: Pull request #14568 opened by CalebDu
March 10, 2025 14:26 15s
March 10, 2025 14:26 15s
benchmarks: simplify test jsonschema
PR Reminder Comment Bot #4333: Pull request #14567 opened by russellb
March 10, 2025 14:26 14s
March 10, 2025 14:26 14s
Fix typo in benchmark_serving_structured_output.py
PR Reminder Comment Bot #4332: Pull request #14566 opened by russellb
March 10, 2025 14:11 13s
March 10, 2025 14:11 13s
[Doc] Update PaliGemma note to a warning
PR Reminder Comment Bot #4331: Pull request #14565 opened by DarkLight1337
March 10, 2025 14:09 12s
March 10, 2025 14:09 12s
[Hardware][Intel GPU] upgrade IPEX dependency to 2.6.10.
PR Reminder Comment Bot #4330: Pull request #14564 opened by jikunshang
March 10, 2025 13:44 14s
March 10, 2025 13:44 14s
[Build/CI] Upgrade xgrammar to >=0.1.15
PR Reminder Comment Bot #4329: Pull request #14563 opened by russellb
March 10, 2025 13:30 15s
March 10, 2025 13:30 15s
Correct capitalisation: VLLM -> vLLM
PR Reminder Comment Bot #4328: Pull request #14562 opened by hmellor
March 10, 2025 13:26 12s
March 10, 2025 13:26 12s
Correct capitalisation: Github -> GitHub
PR Reminder Comment Bot #4327: Pull request #14561 opened by hmellor
March 10, 2025 13:07 14s
March 10, 2025 13:07 14s
[Misc] Correct deepseek-vl2 chat template
PR Reminder Comment Bot #4326: Pull request #14558 opened by Isotr0py
March 10, 2025 12:08 11s
March 10, 2025 12:08 11s
[Docs] Make installation URLs nicer
PR Reminder Comment Bot #4325: Pull request #14556 opened by hmellor
March 10, 2025 11:41 11s
March 10, 2025 11:41 11s
[BugFix][TritonMLA] Process weights after model loading
PR Reminder Comment Bot #4324: Pull request #14555 opened by tywuAMD
March 10, 2025 11:39 11s
March 10, 2025 11:39 11s
[Bugfix][v1] fixed llava-hf/llava-1.5-7b-hf is broken on V1
PR Reminder Comment Bot #4323: Pull request #14554 opened by chaunceyjiang
March 10, 2025 11:33 15s
March 10, 2025 11:33 15s
Add max output length limit
PR Reminder Comment Bot #4322: Pull request #14553 opened by tjandy98
March 10, 2025 11:27 12s
March 10, 2025 11:27 12s
[Docs] Mention model_impl arg when explaining Transformers fallback
PR Reminder Comment Bot #4321: Pull request #14552 opened by hmellor
March 10, 2025 11:21 11s
March 10, 2025 11:21 11s
[Frontend] Skip stop in reasoning content
PR Reminder Comment Bot #4320: Pull request #14550 opened by gaocegege
March 10, 2025 11:06 11s
March 10, 2025 11:06 11s
Move dockerfiles into their own directory
PR Reminder Comment Bot #4319: Pull request #14549 opened by hmellor
March 10, 2025 11:02 13s
March 10, 2025 11:02 13s
[V1][Bugfix] Fix handing of second_per_grid_ts for Qwen2-VL & Qwen2.5-VL
PR Reminder Comment Bot #4318: Pull request #14548 opened by ywang96
March 10, 2025 09:22 11s
March 10, 2025 09:22 11s
[Core] Refactor QKVCrossParallelLinear implementation to support BNB 4-bit quantization
PR Reminder Comment Bot #4317: Pull request #14545 opened by Isotr0py
March 10, 2025 08:55 12s
March 10, 2025 08:55 12s