adapt llava pipeline to latest Transformers #1344

Spycsh · 2025-02-28T03:07:26Z

Description

OH main branch is using a old Transformers version which causes a security issue. This PR upgrades this microservice to use latest TF and use the image-text-to-text pipeline for general VLM image understanding QA tasks.

There are existing code that handle multi-image/non-image cases. However, the handling logics does not work with the new image-text-to-text task name. Furthermore, explicitly overwriting the preprocess method only works with specific TF version and is prone to version upgrade. Batching inference also cannot explain the coherence between images (e.g. What are the differences between those 2 given images). Therefore for simplicity, we decide to remove that handling of multi-image inputs but keep the non-image inputs.

change TF pipeline name from image-to-text to image-text-to-text
simplify the preprocessing logics, keep the handling when image is empty string, and remove the multi-image inference in one run
fix few README errors

Issues

n/a

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Dependencies

latest TF instead of 4.45.2 that currently used by OH

Tests

UT

- change TF pipeline name from image-to-text to image-text-to-text huggingface/transformers#34769 - simplify the preprocessing logics, keep the handling when image is empty string, and remove the multi-image inference in one run - fix few README errors

for more information, see https://pre-commit.ci

…o llava_pipe

Spycsh requested a review from lvliang-intel as a code owner February 28, 2025 03:07

pre-commit-ci bot and others added 3 commits February 28, 2025 03:07

[pre-commit.ci] auto fixes from pre-commit.com hooks

e24c5d6

for more information, see https://pre-commit.ci

fix torch

17abf1b

Merge branch 'llava_pipe' of https://github.com/Spycsh/GenAIComps int…

57a4910

…o llava_pipe

lvliang-intel approved these changes Mar 1, 2025

View reviewed changes

lvliang-intel requested a review from letonghan March 1, 2025 13:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adapt llava pipeline to latest Transformers #1344

adapt llava pipeline to latest Transformers #1344

Spycsh commented Feb 28, 2025 •

edited

Loading

adapt llava pipeline to latest Transformers #1344

Are you sure you want to change the base?

adapt llava pipeline to latest Transformers #1344

Conversation

Spycsh commented Feb 28, 2025 • edited Loading

Description

Issues

Type of change

Dependencies

Tests

Spycsh commented Feb 28, 2025 •

edited

Loading