The ge_layer implementation is not exactly the same as that in the paper #1

zding047 · 2022-11-18T09:41:04Z

In the paper, when stride=2, the left branch (Figure 5 (c)) has two depth-wise layers. Their output channels are the same, which is expand_ratio * C. But in the codes, due to the chain rule, the second layer's output has expand_ratio * expand_ratio * C channels.

Also, the expansion is with respect to the input dimension, instead of the output dimension.

bisenetv2-tf2/model.py

Lines 14 to 37 in e018b3b

    
           def ge_layer(x_in, c, e=6, stride=1): 
        
               x = layers.Conv2D(filters=c, kernel_size=(3,3), padding='same')(x_in) 
        
               x = layers.BatchNormalization()(x) 
        
               x = layers.Activation('relu')(x) 
        
               if stride == 2: 
        
                   x = layers.DepthwiseConv2D(depth_multiplier=e, kernel_size=(3,3), strides=2, padding='same')(x) 
        
                   x = layers.BatchNormalization()(x) 
        
                   y = layers.DepthwiseConv2D(depth_multiplier=e, kernel_size=(3,3), strides=2, padding='same')(x_in) 
        
                   y = layers.BatchNormalization()(y) 
        
                   y = layers.Conv2D(filters=c, kernel_size=(1,1), padding='same')(y) 
        
                   y = layers.BatchNormalization()(y) 
        
               else: 
        
                   y = x_in 
        
               x = layers.DepthwiseConv2D(depth_multiplier=e, kernel_size=(3,3), padding='same')(x) 
        
               x = layers.BatchNormalization()(x) 
        
               x = layers.Conv2D(filters=c, kernel_size=(1,1), padding='same')(x) 
        
               x = layers.BatchNormalization()(x) 
        
               x = layers.Add()([x, y]) 
        
               x = layers.Activation('relu')(x) 
        
               return x

The text was updated successfully, but these errors were encountered:

as reported in #1

markus-k · 2022-11-18T11:04:04Z

Thank you for reporting the issue! I have to admit that I'm quite out of the loop regarding ML, as this was part of my studies. I have created a pull request that should fix the double expension, feel free to have a look.

Also, the expansion is with respect to the input dimension, instead of the output dimension.
I'm not entirely sure what you mean by that, and how I'd go about fixing that. I'd be happy to accept a pull request :)

markus-k added a commit that referenced this issue Nov 18, 2022

fix chained expansion in ge_layer

d902fca

as reported in #1

markus-k mentioned this issue Nov 18, 2022

fix chained expansion in ge_layer #2

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The ge_layer implementation is not exactly the same as that in the paper #1

The ge_layer implementation is not exactly the same as that in the paper #1

zding047 commented Nov 18, 2022 •

edited

Loading

markus-k commented Nov 18, 2022

The ge_layer implementation is not exactly the same as that in the paper #1

The ge_layer implementation is not exactly the same as that in the paper #1

Comments

zding047 commented Nov 18, 2022 • edited Loading

markus-k commented Nov 18, 2022

zding047 commented Nov 18, 2022 •

edited

Loading