Correct the configuration of LLaVA-CoT by XuGW-Kevin · Pull Request #705 · open-compass/VLMEvalKit

XuGW-Kevin · 2024-12-31T03:06:16Z

First of all, we sincerely appreciate the work of VLMEvalKit and the tremendous contributions it has made to the entire VLM community! However, the current configuration of LLaVA-CoT (e.g., max_new_tokens) is incorrect, leading to significant deviations in the benchmark test results. This PR aims to correct the configuration of LLaVA-CoT.

* update vlrewardbench * pre-commit fix * formatter * [Improvement] Better `AUTO_SPLIT` and model split for InternVL2 * [Minor] Improve CC-OCR Import * [Model] Support QVQ * [Model] Update Molmo Eval to Match Official Implementation (#648) * add molmo prompts * fix lint format * [Fix] Refine Qwen-VL2 device assignment * [Fix] Fix RealWorldQA md5 * update MMMU_DEV_VAL tsv * [Fix] Fix confusing image width&height (#704) Co-authored-by: Yuan Ye <yuany2@chinatelecom.cn> * Update llama_vision.py (#705) * [Fix] Fix Lint * Fix Lint * Fix Lint --------- Co-authored-by: kennymckormick <dhd.efz@gmail.com> Co-authored-by: jamespark3922 <jspark96@cs.washington.edu> Co-authored-by: CMeteor <CMeteor@users.noreply.github.com> Co-authored-by: Yuan Ye <yuany2@chinatelecom.cn> Co-authored-by: Guowei Xu <113534787+XuGW-Kevin@users.noreply.github.com>

* update vlrewardbench * pre-commit fix * formatter * [Improvement] Better `AUTO_SPLIT` and model split for InternVL2 * [Minor] Improve CC-OCR Import * [Model] Support QVQ * [Model] Update Molmo Eval to Match Official Implementation (open-compass#648) * add molmo prompts * fix lint format * [Fix] Refine Qwen-VL2 device assignment * [Fix] Fix RealWorldQA md5 * update MMMU_DEV_VAL tsv * [Fix] Fix confusing image width&height (open-compass#704) Co-authored-by: Yuan Ye <yuany2@chinatelecom.cn> * Update llama_vision.py (open-compass#705) * [Fix] Fix Lint * Fix Lint * Fix Lint --------- Co-authored-by: kennymckormick <dhd.efz@gmail.com> Co-authored-by: jamespark3922 <jspark96@cs.washington.edu> Co-authored-by: CMeteor <CMeteor@users.noreply.github.com> Co-authored-by: Yuan Ye <yuany2@chinatelecom.cn> Co-authored-by: Guowei Xu <113534787+XuGW-Kevin@users.noreply.github.com>

Update llama_vision.py

adb6075

kennymckormick merged commit 6e1a59a into open-compass:main Dec 31, 2024

kennymckormick pushed a commit to TobiasLee/VLMEvalKit that referenced this pull request Jan 1, 2025

Update llama_vision.py (open-compass#705)

3691698

Mercury7353 pushed a commit to Mercury7353/VLMEvalKit that referenced this pull request Apr 28, 2025

Update llama_vision.py (open-compass#705)

96b5bce

Koii2k3 pushed a commit to wjnwjn59/VLMEvalKit that referenced this pull request Nov 13, 2025

Update llama_vision.py (open-compass#705)

804f33d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correct the configuration of LLaVA-CoT#705

Correct the configuration of LLaVA-CoT#705
kennymckormick merged 1 commit intoopen-compass:mainfrom
XuGW-Kevin:main

XuGW-Kevin commented Dec 31, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

XuGW-Kevin commented Dec 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

XuGW-Kevin commented Dec 31, 2024 •

edited

Loading