multi-image support for llama3.2 #705

yma11 · 2025-01-20T09:13:08Z

No description provided.

yma11 · 2025-01-24T05:03:17Z

@kdamaszk @michalkuligowski @kzawora-intel, can you help review on this PR? Already tested on 11B and 90B.

vllm/worker/hpu_enc_dec_model_runner.py

kdamaszk · 2025-01-27T11:52:31Z

@yma11 I observed accuracy regression on MMMU val dataset and LLama 3.2 11B Vision Instruct. Let's sync offline
EDIT: not visible anymore

kdamaszk

LGTM

kdamaszk · 2025-02-03T09:56:16Z

@PatrykWo please review

PatrykWo · 2025-02-04T12:07:33Z

@kdamaszk I see some errors in CI. Please rebase with the base branch so you are up to date and we can confirm that new changes does not influence the PR.

kdamaszk · 2025-02-04T13:03:03Z

@PatrykWo please note that CI will fail for the jenkins tests, as @yma11 submitted this PR from personal fork. There is a known issue with running CI tests in such case.

yma11 · 2025-02-11T01:58:25Z

@PatrykWo I did a rebase to latest branch but CI may still fail based on @kdamaszk 's comments. Can you help take a further look?

Signed-off-by: yan ma <[email protected]>

jkaniecki

I've tested the multi image support and it seems to work properly, but whit Fused SDPA in cross attention layers accuracy is worse - please, revert changes made in SDPA computation

jkaniecki · 2025-02-12T13:52:50Z

vllm/model_executor/models/mllama.py

+                                                    is_causal=False)
+            output = output.permute(2, 0, 1, 3).reshape(
+                q_len, self.num_local_heads * self.head_dim)
+            return output


@yma11 This PR works fine but using Fused SDPA in cross attetnion results in worse accuracy. We should better keep F.scaled_dot_product_attention here

yma11 requested review from kzawora-intel, madamczykhabana, michalkuligowski, mgawarkiewicz, vivekgoe and afierka-intel as code owners January 20, 2025 09:13

yma11 force-pushed the multi-image branch 3 times, most recently from 8477d17 to 08541ff Compare January 24, 2025 01:58

kdamaszk reviewed Jan 24, 2025

View reviewed changes

vllm/worker/hpu_enc_dec_model_runner.py Outdated Show resolved Hide resolved

kdamaszk reviewed Jan 24, 2025

View reviewed changes

vllm/worker/hpu_enc_dec_model_runner.py Outdated Show resolved Hide resolved

yma11 force-pushed the multi-image branch 3 times, most recently from c9bfe70 to 7223fb8 Compare January 26, 2025 07:38

kdamaszk reviewed Jan 27, 2025

View reviewed changes

vllm/worker/hpu_enc_dec_model_runner.py Outdated Show resolved Hide resolved

yma11 force-pushed the multi-image branch 2 times, most recently from 69411f3 to 0c2759e Compare January 31, 2025 10:06

kdamaszk approved these changes Jan 31, 2025

View reviewed changes

yma11 force-pushed the multi-image branch from bd1b9d9 to 84a55b7 Compare February 11, 2025 01:56

yma11 force-pushed the multi-image branch from c2b7d4b to eb222a8 Compare February 12, 2025 04:46

yma11 added 4 commits February 12, 2025 20:12

multi-image support for llama3.2

cd0a18a

Signed-off-by: yan ma <[email protected]>

fix

6dcf567

Signed-off-by: yan ma <[email protected]>

fix

1b85f73

fix profile_run

77aa610

Signed-off-by: yan ma <[email protected]>

yma11 added 3 commits February 12, 2025 20:12

fix

5a9bae7

revert

92807f5

fix rebase

eb222a8

Signed-off-by: yan ma <[email protected]>

jkaniecki requested changes Feb 12, 2025

View reviewed changes

Merge branch 'habana_main' into multi-image

e0feef7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi-image support for llama3.2 #705

multi-image support for llama3.2 #705

yma11 commented Jan 20, 2025 •

edited by github-actions bot

Loading

yma11 commented Jan 24, 2025

kdamaszk commented Jan 27, 2025 •

edited

Loading

kdamaszk left a comment

kdamaszk commented Feb 3, 2025

PatrykWo commented Feb 4, 2025

kdamaszk commented Feb 4, 2025

yma11 commented Feb 11, 2025

jkaniecki left a comment

jkaniecki Feb 12, 2025

multi-image support for llama3.2 #705

Are you sure you want to change the base?

multi-image support for llama3.2 #705

Conversation

yma11 commented Jan 20, 2025 • edited by github-actions bot Loading

yma11 commented Jan 24, 2025

kdamaszk commented Jan 27, 2025 • edited Loading

kdamaszk left a comment

Choose a reason for hiding this comment

kdamaszk commented Feb 3, 2025

PatrykWo commented Feb 4, 2025

kdamaszk commented Feb 4, 2025

yma11 commented Feb 11, 2025

jkaniecki left a comment

Choose a reason for hiding this comment

jkaniecki Feb 12, 2025

Choose a reason for hiding this comment

yma11 commented Jan 20, 2025 •

edited by github-actions bot

Loading

kdamaszk commented Jan 27, 2025 •

edited

Loading