[Bug]: init_mm_limits_per_prompt not been called when using V1 + TensorSplit + Qwen2VL #12245
Closed
1 task done
Labels
bug
Something isn't working
Your current environment
The output of `python collect_env.py`
Model Input Dumps
No input.
🐛 Describe the bug
V1 engine works for qwen2-vl only when single gpu, but not tensor-split(multi-gpu).
Before submitting a new issue...
The text was updated successfully, but these errors were encountered: