-
Thank you for your contribution, it's really helpful for a newcomer! How to extend SACPID to multi-constraint situation? Although I've noticed similar discussions[#258], I still haven't figured out how to do it. I think more cost_critics should be introduced to estimate the cost, but I don't know how to deal with these λs. |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 11 replies
-
Thanks for your recognition, and it is a good question. Suppose you have multiple critics, then the formulation of CMDPs with multiple costs is: |
Beta Was this translation helpful? Give feedback.
-
Thank you very much for providing your implementation reference for supporting multiple costs in OmniSafe's Logger for the open-source community. Congratulations on the positive results from your experiments. However, due to significant differences between your code and OmniSafe's structure, we are having difficulty pinpointing the issues quickly. Could you please try incorporating your changes to the Logger directly in |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
Hey, I modified the logger, and the algorithm seems to be experimentally correct, but features like tensorboard don't work. . .
I'm not a professional programmer and my code may be inefficient and have potential logic errors. I would be very happy if you could help me check if there are any errors in the modified code and improve it. And how to make tansorboard work properly?
new_algo.txt