: Unknown vision tower: /share/junjie/shuyan/clip-vit-large-patch14-336

Traceback (most recent call last):
  File "/home/LLM/videoxl/videoxl/infer.py", line 17, in <module>
    tokenizer, model, image_processor, _ = load_pretrained_model(model_path, None, "llava_qwen", device_map="cuda:0")
  File "/home/LLM/videoxl/videoxl/videoxl/model/builder.py", line 215, in load_pretrained_model
    model = LlavaQwenForCausalLM.from_pretrained(model_path, low_cpu_mem_usage=True, attn_implementation=attn_implementation, **kwargs)
  File "/home/LLM/videoxl/videoxl/videoxl/model/language_model/llava_qwen.py", line 1498, in from_pretrained
    model, loading_info = super().from_pretrained(*args, **kwargs)
  File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3404, in from_pretrained
    model = cls(config, *model_args, **model_kwargs)
  File "/home/LLM/videoxl/videoxl/videoxl/model/language_model/llava_qwen.py", line 1466, in __init__
    self.model = LlavaQwenModel(config)
  File "/home/LLM/videoxl/videoxl/videoxl/model/language_model/llava_qwen.py", line 1454, in __init__
    super(LlavaQwenModel, self).__init__(config)
  File "/home/LLM/videoxl/videoxl/videoxl/model/llava_arch.py", line 40, in __init__
    self.vision_tower = build_vision_tower(config, delay_load=delay_load)
  File "/home/LLM/videoxl/videoxl/videoxl/model/multimodal_encoder/builder.py", line 23, in build_vision_tower
    raise ValueError(f"Unknown vision tower: {vision_tower}")


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

: Unknown vision tower: /share/junjie/shuyan/clip-vit-large-patch14-336 #16

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

: Unknown vision tower: /share/junjie/shuyan/clip-vit-large-patch14-336 #16

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions