You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Aftering some digging in an onnx file, I've figured out why this is. For others wondering, pangu's window_partition returns a 4-D tensor, instead of the standard 3-D tensor in other swin-transformers:
Hi, thanks for the clarification. As far as I can see, the understanding is correct. Just to say, the window_partition function naturally returns a 4D tensor because we are dealing with 3D data while the standard Swin transformer is dealing with 2D images.
I am confused about the way the pseudo code defined
type_of_windows
:Given that input to the attention:
and type_of_windows defined in the pseudocode:
Its size would be:
which contradicates the size prescribed in the paper:
Mlat*Mpl
, which presumably should be:It would not work in actual code unless
type_of_window
is set to:which is in turn not the way
position_index
is designed to do.Any help is appreicated
The text was updated successfully, but these errors were encountered: