ValueError: Input image size (490490) doesn't match model (336336). #461

ZTWHHH · 2024-12-25T09:00:24Z

When I ran the example inference code for model xcomposer2-vl-7b provided in the huggingface page:

import torch
from transformers import AutoModel, AutoTokenizer

torch.set_grad_enabled(False)

# init model and tokenizer
model = AutoModel.from_pretrained('internlm/internlm-xcomposer2-vl-7b', trust_remote_code=True).cuda().eval()
tokenizer = AutoTokenizer.from_pretrained('internlm/internlm-xcomposer2-vl-7b', trust_remote_code=True)

query = '<ImageHere>Please describe this image in detail.'
image = 'Our image path'

with torch.cuda.amp.autocast():
  response, _ = model.chat(tokenizer, query=query, image=image, history=[], do_sample=False)
print(response)

I got an error:
ValueError: Input image size (490*490) doesn't match model (336*336)

The text was updated successfully, but these errors were encountered:

dle666 · 2024-12-30T02:43:31Z

I had the same problem, did you solve it?

ZTWHHH · 2025-01-11T19:15:37Z

I had the same problem, did you solve it?

I haven't solved it. But XComposer-2.5 can work.

Moshindeiru · 2025-01-11T19:23:19Z

Nah I'm struggling

…

On Sat, Jan 11, 2025, 2:16 PM Tianwei Zhao ***@***.***> wrote: I had the same problem, did you solve it? I haven't solved it. — Reply to this email directly, view it on GitHub <#461 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AJIXNWAANFN7OMC63WLD7ML2KFUXFAVCNFSM6AAAAABUFYCUUWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKOBVGM4DANZRGI> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

stickydream · 2025-01-14T02:35:23Z

This is due to the excessive version of the transformer package. Downgrading it to version 4.40.0 allows the code to run normally.
transformers包的版本太高了，降低到4.40.0版本，代码能正常运行

mm-assistant bot assigned yhcao6 Dec 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ValueError: Input image size (490490) doesn't match model (336336). #461

ValueError: Input image size (490490) doesn't match model (336336). #461

ZTWHHH commented Dec 25, 2024 •

edited

Loading

dle666 commented Dec 30, 2024

ZTWHHH commented Jan 11, 2025 •

edited

Loading

Moshindeiru commented Jan 11, 2025 via email

stickydream commented Jan 14, 2025

ValueError: Input image size (490*490) doesn't match model (336*336). #461

ValueError: Input image size (490*490) doesn't match model (336*336). #461

Comments

ZTWHHH commented Dec 25, 2024 • edited Loading

dle666 commented Dec 30, 2024

ZTWHHH commented Jan 11, 2025 • edited Loading

Moshindeiru commented Jan 11, 2025 via email

stickydream commented Jan 14, 2025

ValueError: Input image size (490490) doesn't match model (336336). #461

ValueError: Input image size (490490) doesn't match model (336336). #461

ZTWHHH commented Dec 25, 2024 •

edited

Loading

ZTWHHH commented Jan 11, 2025 •

edited

Loading