-
-
Notifications
You must be signed in to change notification settings - Fork 110
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Satellite Unet in multi-gpu #12
Comments
This comment has been minimized.
This comment has been minimized.
Hi @LJ-20 |
These are the lines for each of the codes: model = custom_unet( (256,256,3), num_classes=1, use_batch_norm=True, upsample_mode='deconv', use_dropout_on_upsampling=False, dropout=0.0, dropout_change_per_layer=0.0, filters=64, num_layers=4, output_activation='sigmoid') with the command: model = multi_gpu_model(model, gpus=4,cpu_relocation=True) The implentation followed the tensorflow documentation. https://www.tensorflow.org/api_docs/python/tf/keras/utils/multi_gpu_model?version=stable The error was the following: |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
@LJ-20 So it's not that you're not able to run satellite unet on multi gpu but more about it's not converging in multi gpu setup.. Hmm very interesting, thank you for brining that up! I will look into it and try debugging it however as of right now I don't see anything that could be causing this from the model implementation perspective.
Let me know how did it go! EDIT: |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
@muminoff I was able to reproduce your problem and debug it and it is related to Keras/tf.keras dependencies mismatch. |
@karolzak the pixel range is read from 0 to 1, the numpy arrays are dtype float32 and also tried the multi_gpu parameters. Our first thought was the batch normalization or the way the weights are merged in multi_gpu but we didn't have this problem with the custom_unet using the exact same code. |
Update. Upon revision, it seems like the problem was the float32. I had it set up as float64. Why is this an issue? |
@LJ-20 , so you used |
I used float64 for both custom_unet and satellite_unet and it only worked with custom_unet |
Hello
I wasn't able to run the Satellite Unet in multi-gpu. I didn't have this problem with the custom unet.
The text was updated successfully, but these errors were encountered: