Name	Name	Last commit message	Last commit date
parent directory ..
configs	configs	Change name of InterpolatedPolicy to PerStepSwitchPolicy.	Mar 5, 2019
testdata	testdata	Release open-source implementation of Quillen et al. 2018.	Jan 11, 2019
README.md	README.md	Add an image of the grasping setup to the README.	Jan 11, 2019
__init__.py	__init__.py	internal	Jan 22, 2020
collect_eval.py	collect_eval.py	internal	Jan 22, 2020
cross_entropy.py	cross_entropy.py	internal	Jan 22, 2020
ddpg_graph.py	ddpg_graph.py	internal	Jan 22, 2020
episode_to_transitions.py	episode_to_transitions.py	internal	Jan 22, 2020
gin_imports.py	gin_imports.py	internal	Jan 22, 2020
grasping_env.py	grasping_env.py	internal	Jan 22, 2020
grasping_setup.png	grasping_setup.png	Add an image of the grasping setup to the README.	Jan 11, 2019
input_data.py	input_data.py	Adding average episode steps logging.	Feb 13, 2020
kuka.py	kuka.py	internal	Jan 22, 2020
policies.py	policies.py	internal	Jan 22, 2020
policies_test.py	policies_test.py	internal	Jan 22, 2020
q_graph.py	q_graph.py	Adding average episode steps logging.	Feb 13, 2020
requirements.txt	requirements.txt	Bump pillow and werkzeug versions	Dec 3, 2019
run.sh	run.sh	internal	Jan 22, 2020
run_env.py	run_env.py	Adding average episode steps logging.	Feb 13, 2020
run_env_test.py	run_env_test.py	internal	Jan 22, 2020
run_random_collect_oss.sh	run_random_collect_oss.sh	internal	Jan 22, 2020
run_train_collect_eval.py	run_train_collect_eval.py	internal	Jan 22, 2020
run_train_collect_eval_oss.sh	run_train_collect_eval_oss.sh	internal	Jan 22, 2020
schedules.py	schedules.py	internal	Jan 22, 2020
schedules_test.py	schedules_test.py	internal	Jan 22, 2020
tf_critics.py	tf_critics.py	internal	Jan 22, 2020
tf_modules.py	tf_modules.py	Explicitly replace "import tensorflow" with "tensorflow.compat.v1" fo…	Feb 14, 2020
train_collect_eval.py	train_collect_eval.py	internal	Jan 22, 2020
train_collect_eval_test.py	train_collect_eval_test.py	internal	Jan 22, 2020
train_ddpg.py	train_ddpg.py	Internal change	Feb 12, 2020
train_q.py	train_q.py	Internal change	Feb 12, 2020
writer.py	writer.py	internal	Jan 22, 2020

Name

Last commit message

Last commit date

configs

Change name of InterpolatedPolicy to PerStepSwitchPolicy.

Mar 5, 2019

testdata

Release open-source implementation of Quillen et al. 2018.

Jan 11, 2019

README.md

Add an image of the grasping setup to the README.

Jan 11, 2019

Jan 22, 2020

Jan 22, 2020

Jan 22, 2020

Jan 22, 2020

episode_to_transitions.py

Jan 22, 2020

Jan 22, 2020

Jan 22, 2020

Add an image of the grasping setup to the README.

Jan 11, 2019

input_data.py

Adding average episode steps logging.

Feb 13, 2020

Jan 22, 2020

Jan 22, 2020

Jan 22, 2020

Adding average episode steps logging.

Feb 13, 2020

requirements.txt

Bump pillow and werkzeug versions

Dec 3, 2019

run.sh

internal

Jan 22, 2020

run_env.py

Adding average episode steps logging.

Feb 13, 2020

run_env_test.py

internal

Jan 22, 2020

run_random_collect_oss.sh

internal

Jan 22, 2020

run_train_collect_eval.py

internal

Jan 22, 2020

run_train_collect_eval_oss.sh

Jan 22, 2020

Jan 22, 2020

Jan 22, 2020

Jan 22, 2020

Explicitly replace "import tensorflow" with "tensorflow.compat.v1" fo…

Feb 14, 2020

train_collect_eval.py

internal

Jan 22, 2020

train_collect_eval_test.py

Jan 22, 2020

Feb 12, 2020

Feb 12, 2020

Jan 22, 2020

Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparison of Off-Policy Methods

This codebase implements learning algorithms and experiments from Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparison of Off-Policy Methods (ICRA 2018).

If you use this codebase for your research, please cite the paper:

@article{quillen2018deep,
  title={Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods},
  author={Quillen, Deirdre and Jang, Eric and Nachum, Ofir and Finn, Chelsea and Ibarz, Julian and Levine, Sergey},
  journal={IEEE International Conference on Robotics and Automation},
  year={2018}
}

Features

Several grasping environments with varying degrees of grasping difficulty.
Customizable DQL, MC, Supervised, Corr-MC, DDPG, PCL algorithms.
MC returns and elibility traces for biased returns.
Bash scripts for gathering data from random policies and running synchronous on-policy or off-policy experiments that alternate between training and evaluation.
Scripts to run grid search over hyperparameters.

Getting Started

The recommended way to set up these experiments is via a virtualenv

sudo apt-get install python-pip
python -m pip install --user virtualenv
python -m virtualenv ~/env
source ~/env/bin/activate

Then install the project dependencies in that virtualenv:

pip install -r dql_grasping/requirements.txt

The first step is then to collect off-policy grasping data with a random policy.

sh dql_grasping/run_random_collect_oss.sh

Then you can train with onpolicy re-collection. By default this runs Deep Q-Learning on the env_procedural environment.

sh dql_grasping/run_train_collect_eval_oss.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

dql_grasping

dql_grasping

README.md

Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparison of Off-Policy Methods

Features

Getting Started

Files

dql_grasping

Directory actions

More options

Directory actions

More options

Latest commit

History

dql_grasping

Folders and files

parent directory

README.md

Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparison of Off-Policy Methods

Features

Getting Started