mengdi-li

Follow

🌼

Inner peace

Mengdi Li mengdi-li

🌼

Inner peace

Follow

Interested in RL, Robotics, and LLMs.

14 followers · 23 following

KAUST
Saudi Arabia
15:58 (UTC +03:00)
https://mengdi-li.github.io/
@limengdi2
in/limengdi2

Achievements

Achievements

Pinned Loading

robotic-occlusion-reasoning robotic-occlusion-reasoning Public

PyTorch implementation of "Robotic Occlusion Reasoning for Efficient Object Existence Prediction" (IROS 2021)

Python 8
internally-rewarded-rl internally-rewarded-rl Public

[ICML 2023] Code for paper "Internally Rewarded Reinforcement Learning"

Jupyter Notebook 10
awesome-RLAIF awesome-RLAIF Public

A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)

153 4
vanilla-RLAIF-pipeline vanilla-RLAIF-pipeline Public

An implementation of a vanilla RLAIF pipeline, utilizing GPT-2-Large for the summarization task with the TL;DR dataset.

Python 1 1