fix: allow pass image kwargs to image processor

#90

by bigmoyan - opened Mar 13

base: refs/heads/main

←

from: refs/pr/90

Discussion Files changed

-1

bigmoyan

Mar 13

No description provided.

allow pass images kwargs to image-processor7a5529b1

bigmoyan

Mar 13

Why we need this?

(1) From a design standpoint, I see no justification for why image_processor does not accept images_kwargs.
(2) This leads to a critical bug: passing an image with height=3 and a specific width causes the vLLM service to crash immediately. The crash occurs because height=3 images resolve to different dimensions during embedding calculation versus token calculation, causing a length mismatch between the computed token count and the actual embeddings produced. Without support for **image_kwargs, we have no workaround from the client side.

I will submit a PR to vLLM to fix this issue on the vLLM side, but the prerequisite is that image_processor must accept externally passed **kwargs.

bigmoyan changed pull request status to open Mar 13

bigmoyan

Mar 13

•

edited Mar 13

see: https://github.com/vllm-project/vllm/pull/36959 @Tingquan @ChengCui

bigmoyan

Mar 20

someone pls take a look @Tingquan @ChengCui

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment