🌼
Inner peace
Interested in RL, Robotics, and LLMs.
-
KAUST
- Saudi Arabia
-
15:58
(UTC +03:00) - https://mengdi-li.github.io/
- @limengdi2
- in/limengdi2
Pinned Loading
-
robotic-occlusion-reasoning
robotic-occlusion-reasoning PublicPyTorch implementation of "Robotic Occlusion Reasoning for Efficient Object Existence Prediction" (IROS 2021)
Python 8
-
internally-rewarded-rl
internally-rewarded-rl Public[ICML 2023] Code for paper "Internally Rewarded Reinforcement Learning"
Jupyter Notebook 10
-
awesome-RLAIF
awesome-RLAIF PublicA continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)
-
vanilla-RLAIF-pipeline
vanilla-RLAIF-pipeline PublicAn implementation of a vanilla RLAIF pipeline, utilizing GPT-2-Large for the summarization task with the TL;DR dataset.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.