Hi, thank you for your work! Just want to ask how are your visual features extracted? Saw in the paper that you used a combination of depthwise and pointwise convolutions. However, the image features are already extracted in the dataset that you have downloaded. Is there any script for how you extracted the features and trained the convolutional module?
Hi, thank you for your work! Just want to ask how are your visual features extracted? Saw in the paper that you used a combination of depthwise and pointwise convolutions. However, the image features are already extracted in the dataset that you have downloaded. Is there any script for how you extracted the features and trained the convolutional module?