You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If Ok, please give as many details as possible to help us solve the problem more efficiently.
Configuration
Version of DeepDetect:
Locally compiled on:
Ubuntu 14.04 LTS
Mac OSX
Other:
Docker
Amazon AMI
Commit (shown by the server when starting): ecdfad8
Your question / the problem you're facing:
I've launched a training for an image model. Everything went well during the lmdb creation (no errors seen). However at some point during the training I got a core dump.
Note that it was during the second epoch of my training so all the data has been seen and the test set has been predicted one time.
Error message (if any) / steps to reproduce the problem:
Here are the logs I obtained when it core dumped/
Server log output:
libpng warning: Ignoring bad adaptive filter type
libpng warning: Ignoring bad adaptive filter type
libpng warning: Ignoring bad adaptive filter type
libpng warning: Ignoring bad adaptive filter type
libpng warning: Ignoring bad adaptive filter type
libpng error: IDAT: CRC error
[2020-07-24 10:06:14.222] [caffe] [error] Could not decode datum
terminate called after throwing an instance of 'CaffeErrorException'
what(): src/caffe/data_transformer.cpp:895 / Check failed (custom): cv_cropped_image.data
[1] 5337 abort (core dumped) ./dede --port 8081
I've searched a bit, it might be due to a corrupted image but I don't understand how it worked correctly in the first epoch if it is the case.
The text was updated successfully, but these errors were encountered:
But my hunch is you have a bad png somewhere. I don't know about epochs or so, data augmentation is randomized and datum are prefetched with three threads.
If Ok, please give as many details as possible to help us solve the problem more efficiently.
Configuration
ecdfad8
Your question / the problem you're facing:
I've launched a training for an image model. Everything went well during the lmdb creation (no errors seen). However at some point during the training I got a core dump.
Note that it was during the second epoch of my training so all the data has been seen and the test set has been predicted one time.
Error message (if any) / steps to reproduce the problem:
Here are the logs I obtained when it core dumped/
I've searched a bit, it might be due to a corrupted image but I don't understand how it worked correctly in the first epoch if it is the case.
The text was updated successfully, but these errors were encountered: