Question about some operations

Hi, I'm following and have some questions:
1. There are different operations in the `modified_clip/model.py` and `modified_clip/open_model.py`, like `img_features[kth] = ln_x` in [model.py](https://github.com/linsun449/cliper.code/blob/master/modified_clip/model.py#L119) and `img_features[kth] = ln_x - img_features[kth]` in [open_model.py](https://github.com/linsun449/cliper.code/blob/master/modified_clip/open_model.py#L123). Why is there such a difference? They should just be introducing different sizes of ViT-based CLIP models through package CLIP and OpenCLIP.
2. In [forward of model.py](https://github.com/linsun449/cliper.code/blob/master/modified_clip/model.py#L173-183), are some operations like concat `fg_text_features.mean(0, True)` into the text_features, and `seg_last[seg_last < seg_last.amax(0, keepdim=True) * 0.2] = 0` used to improve the performance? how to determine the threshold as 0.2?

BTW, this code is simple yet elegant. Thanks for your impressive work. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about some operations #3

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Question about some operations #3

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions