Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Evaluation]Evaluation with Otter pre-trained model on COCO dataset return 0 CIDER score #287

Open
Thedatababbler opened this issue Nov 1, 2023 · 4 comments

Comments

@Thedatababbler
Copy link

Hi,
I used the evaluate.py file in the pipeline to evaluate my models. To make sure this evaluation can run on my single GPU node, I made some minor changes on the code for initialization of multi-gpu environments and remained other thing the same. The modified shell script is like the following:
`
#!/bin/bash

export CUDA_VISIBLE_DEVICES="0"
export MASTER_ADDR="localhost"
export MASTER_PORT="29501"
export WORLD_SIZE=4
export RANK=0

cd /path/to/Otter
realpath .
python -m pipeline.eval.evaluate
--model=otter
--results_file="OTTER_mpt1b_origin.json"
--model_path="luodian/OTTER-MPT1B-RPJama-Init" \

--precision="bf16" \
--batch_size=1 \
--eval_coco \
--device="cuda" \
--coco_train_image_dir_path "/path/to/images/train2014" \
--coco_val_image_dir_path "/path/to/coco/images/val2014" \
--coco_karpathy_json_path "/path/to/dataset_coco.json" \
--coco_annotations_json_path "/path/to/captions_val2014.json" \

`
Above shell script is used for running evaluation on COCO dataset with the pre-trained Otter 1b model. However, this evaluation result returns a 0 CIDEr store for all few-shots tests.

Magically, after I include the below argument in the run script, the evaluation returns a normal number for all tests.
`--checkpoint_path="path/to/checkpoint/OTTER-MPT1B-RPJama-Init/final_weights.pt'
Where the pt file here is a model fine-tuned by myself.
It seems like the model didn't properly loaded the pre-trained weights? That's why when my personalized ckpt file was loaded, it can return the results.

Could you help loacte the problem, which part of the code could possibly be blamed of this bug? Thank you!

@Luodian
Copy link
Owner

Luodian commented Nov 1, 2023 via email

@Thedatababbler
Copy link
Author

I tried to use "luodian/OTTER-Image-MPT7B" to replace the "luodian/OTTER_MPT1B_RPJama-Init" for the --model_path argument and the evaluation again. The CIDEr score is still 0.0 for all shots. It's really weird. What did I do wrong?

@Luodian
Copy link
Owner

Luodian commented Nov 2, 2023

@pufanyi Could Fanyi take a look at this issue, I thought we did the OTTER-MPT7B evaluation and report good numbers on COCO.

@chilljudaoren
Copy link

hei @pufanyi , You haven't come to solve this problem

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants