Why 26 encoders instead of 27?

by niktheod - opened Mar 31

Mar 31

Hi and thank you for your great work,

I noticed that your vision tower has 26 transformer encoder layers and not 27, as siglip-so400m-patch14-384. Could you explain this discripancy please?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment