The working mechanism of the classifier #17

zhangyupeng123 · 2024-08-30T06:53:59Z

Dear author, thank you very much for your excellent work. I have a question that I would like to ask you. Is the classifier designed to calculate the cosine similarity between images and text in the same way as CLIP, or is it designed differently? I don't seem to have found detailed information on this part.

machuofan · 2024-09-01T14:28:31Z

Hi there, thank you for your interest in our work. Yes, the classifier works in the same way as CLIP, i.e, the classifier weights are essentially composed of text embeddings.

zhangyupeng123 · 2024-10-09T04:16:46Z

When training, is the input on the text side the image's title, or is it just a template like "a photo of " or "a "?

zhangyupeng123 · 2024-10-10T03:25:50Z

@machuofan When training, is the input on the text side the image's title, or is it just a template like "a photo of " or "a "?

machuofan · 2024-10-14T05:18:52Z

It's 'a xxx'.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The working mechanism of the classifier #17

The working mechanism of the classifier #17

zhangyupeng123 commented Aug 30, 2024

machuofan commented Sep 1, 2024

zhangyupeng123 commented Oct 9, 2024

zhangyupeng123 commented Oct 10, 2024

machuofan commented Oct 14, 2024

The working mechanism of the classifier #17

The working mechanism of the classifier #17

Comments

zhangyupeng123 commented Aug 30, 2024

machuofan commented Sep 1, 2024

zhangyupeng123 commented Oct 9, 2024

zhangyupeng123 commented Oct 10, 2024

machuofan commented Oct 14, 2024