Skip to content

Why the output of siglip+resampler is different between torch and llama.cpp? #18

@Xwmiss

Description

@Xwmiss

Hi, thanks for your work firstly.
while I find that I use the same picture as input of torch_model and llama.cpp_model, the res_(1, 96, 4096) of siglip+resampler part is different, the Cosine Similarity is only 0.75.
I am confused now.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions