-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversion error: Data rank or shape of sizes input is required to static #46
Comments
Hi @leets0429 , |
@tersekmatija could someone from your team look into it? I think it would be valuable to also have a tutorial on how to convert&deploy |
@Erol444 I think the problem with general support for models from Azure ML is that likely not all models are supported from the drag-and-drop tools they provide. Also, customers can train their own custom models. For example the model from @leets0429 seems to be like that - either custom one or one provided by Azure ML. It seems to have some operations that cause the shape to be dynamic, which is not supported.
Is it possible to change the model and retrain it inside Azure ML @leets0429 ? Or get more information about the model itself - what do the predictions look like, is there some metadata available, etc.? |
@tersekmatija It is our custom model. The model is a maskrcnn_resnet50_fpn, that is use to recognize sugarbeet and the output are bounding boxes, labels, scores (confidence level), mask. When validation was done in Azure ML studio it works. I think Azure by default uses Pytorch but model can be exported in onnx format. Nevertheless, I'll ask my colleague to retrain the model with fixed orientation. The training was done using landscape and portrait images. |
@leets0429 Best, Matija |
I've recreate a new model (Yolov5) with static images yet the problem still persists. How should I resolved the problem? Do I need to manually convert it instead of using the blob converter. Do you have any resources regarding such topics? I've attached the model here |
@leets0429 So, it seems that generated ONNX does not have a fixed batch size. This can be solved by specifying flag Your export will still fail, that is because of the ScatterND operation in ONNX graph. This happens when you are directly changing the slices, like
Currently, the only solution is to modify the code directly. See this issue for an example. For Yolo models however, we offer the NMS and post-processing directly on device. If you can download best.pt weights saved during the model training, I'd recommend you to use our tools. Simply upload best.pt weights, define input shape, and choose Yolo version. Then open the link in the left panel for instructions on how to run your model. |
@tersekmatija Correct me if I'm wrong, in order to remove the ScatterND, I have to change the code during the conversion from Pytorch to ONNX? |
@leets0429 Yes it's likely the post-processing code in the Detect layer. I'd still suggest you to use tools.luxonis.com for YoloV5-V7. |
@tersekmatija I've tried using your Yolo tool but it said error while loading the model. here's my pt file |
Hey @leets0429 , thanks. We will give this a look. In the meantime can you tell me if official YoloV5 repository was used to produce this .pt? |
@tersekmatija I have no information on the Yolo repo. The model was trained from using Azure Automated ML. |
Hm, thanks. It could be that they are using some other source code inside their tool rather than the official repo. We will investigate this with the .pt file. Can you please also send me the ONNX model that you can export with their tool? @leets0429 |
The onnx file can be found here #46 (comment) |
@tersekmatija any updates on the Yolo tool issue? |
@leets0429 Could you please share with us some examples of images and the expected outputs? So that we could test the model. |
@leets0429 The issue with the model that you've shared with us is that it is a custom implementation of YoloV5 which seems to have a slightly different postprocessing. We are looking into that. I am not sure a standard support in tools.luxonis.com will be possible, but in case we can match their postprocessing with ours we can share an example of how you can export it yourself. Besides an example of an input, do you maybe have access to the source code? For the best plug-and-play experience I'd recommend training your model with official Ultralytics' YoloV5 GitHub repository. Not sure how feasible this is with Azure ML. |
In the "input" folder, you will find the required images. As for right now, I am only able to deploy the onnx model from Azure locally. You can find it in the onnx_local_deploy folder. It is not the same model, but it uses Yolov5 too and trained through Azure. The output should be bounding boxes with label. I tried to deploying the Pytorch model from Azure on my laptop but the parameter names are not configured correctly. I think this is the same issue you are facing now too. I've contacted support and hopefully I could get a solution from them in a few days. I can find the source code but it uses Azure services, requiring certain credentials, hence I don't think you can use it. Regarding the ScatterND problem, shouldn't your Blob Converter considered all kinds of ONNX Layer? As far as I understand ONNX was supposed to be standardized format to ease deployment. As for right now, I could manually convert pt to onnx to avoid the ScatterND issues, but in the long term, it seems troublesome to deal with such exception everytime. |
Thanks, @leets0429, @HonzaCuhel will give it a look and give you some updates on the export process when we have them. The reason I am asking for the source code is to determine whether some different post-processing is used in their YoloV5 version or not, also during the training (for example some constant subtraction). This would help us properly export the model and test it out to make sure it works. We've modified the ONNX model in the way that our DepthAI API expects it, but are seeing some false positive detections. Regarding the ScatterND, blobconverter is using OpenVINO model optimizer and compile tool in the background which seems to support it, but they don't support ScatterNDUpdate. So, even if ONNX, which is supposed to be a general format supports some layer, it doesn't mean that all of them are supported in OpenVINO and consequently on OAK-D. You can see only supported layers and their limitations here. As mentioned, this happens due to updating the box "slices" in the postprocessing code in YoloV5. Latest version seems to avoid that by concatenating the box locations and sizes, while this was not the case in the older versions. If you modify the postporcessing code to match that, you should be able to export it without a problem, at least to the NMS layer which could be causing some troubles as well. Note that if you succeed, NeuralNetwork node does all off the post-processing. If you train your model with official YoloV5 repository and export it using tools.luxonis.com, post-processing is not done in the NeuralNetwork, but still completely on device. The benefit is that it enables you to directly use our YoloDetectionNetwork, which you can combine with SpatialLocationCalculator for example. |
@tersekmatija @HonzaCuhel I just found out that the model provided was not a well trained model. Hence false positive detections are expected. Could you provided me the modified model and also the way how you modify it so in future if I have a better model, I can do it myself? Thanks |
Thank you for the info, and I apologize for the longer delay in my answer, @leets0429. Here is a Colab notebook that takes care of the conversion. I tried to export both models, and the second one (that detects salads and weed) works, but because the model is rather heavy, the performance was around 6fps when I tested it on OAK-1. So for better performance, I'd recommend using a lighter version of YoloV5 if possible. |
@HonzaCuhel Hi thank you very much for the script. Could you send me the converted blob file so that I can test it on my OAK right away? |
@leets0429, no problem! Yeah, absolutely. Here are the converted files. |
@HonzaCuhel I'm not sure whether I deployed the model on the OAK correctly. This is my result after increasing the threshold to 0.8 or else I receive error about to much metadata. I suspect I interpret the result incorrectly hence the weird bounding boxes. I've seen you have attached a json file too. Is that relevant to interpretation of the result? If so, how should I used it in the OAK pipeline. I've faced similar issue when I try to use the ONNX model on my laptop but it was resolved after correctly interpreting the result. Hence, I suspect what I got now is due to the same cause. I've attached my OAK pipeline script (oak_stream_custom_nn_noros.py) and also the local deployment onnx script (onnx_local_deploy_example/examples) |
I haven't checked the export script, but the processing you are doing in the Python script is indeed incorrect. You are using a MobileNetDetectionNetwork which we use for SSDLite based detectors. You should replace it with YoloDetectionNetwork, similar as here (L66). Note that for Yolo you need to use the items from the JSON and set them as parameters of the detection network, like here (L82-L90). You can give it a quick try using
|
@tersekmatija The model is working now but the fps is quite low (around 5 fps). What are the possible ways that I can increase it? We are using the device on the moving vehicle, hence fps is quite important for us. Another problem is how do I change the exposure of the camera with this script I've tried using the updateColorCamConfig() but I received the following error. So far I can't find any documentation about the PipelineManager in your website. Here is my script. |
@leets0429 As far as the script goes, |
I have a model that was created using Azure Machine Learning Studio.
When I try to convert the onnx model, I received the following error. What does it mean and how should I correct it?
The text was updated successfully, but these errors were encountered: