PyTorch Implementation: Context Encoder

PyTorch implementation of Context Encoders: Feature Learning by Inpainting based on the official lua implementation.

The implementation supports the following datasets:

CIFAR-10 / CIFAR-100
SVHN
Caltech101 / Caltech256
STL10
HAM10000
ImageNet

Installation

Required python packages are listed in requirements.txt. All dependencies can be installed using pip

pip install -r requirements.txt

or using conda

conda install --file requirements.txt

Training

Context encoder training is started by running the following command (--pbar to show progress bar during training):

python main.py

All commandline arguments, which can be used to adapt the configuration of the context encoder are defined and described in arguments.py. By default the following configuration is run:

dataset: 'cifar10'
epochs: 50
batch_size: 64
lr: 0.0002
beta1: 0.5
beta2: 0.999
w_rec: 0.999
overlap: 0
bottleneck: 4000
image_size: 128
mask_area: 0.25
device: 'cuda'
out_dir: 'context_encoder'

Note: If --random-masking is not set, int(mask_area * image_size) has to be in [32, 64, 128] as specified in the paper.

In addition to these, the following arguments can be used to further configure the context encoder training process:

--random-masking: If this flag is specified the context encoder is trained to inpaint images in which regions are randomly masked out
--device <cuda / cpu>: Specify whether training should be run on GPU (if available) or CPU
--num-workers <num_workers>: Number of workers used by torch dataloader
--resume <path to run_folder>: Resumes training of training run saved at specified path, e.g. 'out/context_encoder_training/run_0'. Dataset splits, model state, optimizer state, etc. are loaded and training is resumed with specified arguments.
see arguments.py for more

Alternatively, the polyaxon.yaml-file can be used to start the context encoder training on a polyaxon-cluster:

polyaxon run -f polyaxon.yaml -u

For a general introduction to polyaxon and its commandline client, please refer to the official documentation

Monitoring

The training progress (loss, accuracy, etc.) can be monitored using tensorboard as follows:

tensorboard --logdir <result_folder>

This starts a tensorboard instance at localhost:6006, which can be opened in any common browser.

Evaluation

A trained context encoder can be evaluated by running:

 python3 eval.py --run-path out/context_encoder_training/run_0 --pbar --device <cuda / cpu>

where --run-path specifies the path at which the run to be evaluated is saved. Additionally, the eval.py script saves a grid of reconstructed images to the run folder (if --save-path is not specified). Alternatively, one can also check all metrics over all epochs using the tensorboard file.

References

@inproceedings{pathak2016context,
  title={Context encoders: Feature learning by inpainting},
  author={Pathak, Deepak and Krahenbuhl, Philipp and Donahue, Jeff and Darrell, Trevor and Efros, Alexei A},
  booktitle={Proceedings of the IEEE conference on computer vision and pattern recognition},
  pages={2536--2544},
  year={2016}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
augmentation		augmentation
datasets		datasets
models		models
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
arguments.py		arguments.py
eval.py		eval.py
main.py		main.py
polyaxon.yaml		polyaxon.yaml
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyTorch Implementation: Context Encoder

Installation

Training

Monitoring

Evaluation

References

About

Releases

Packages

Languages

License

fbuchert/context-encoder-pytorch

Folders and files

Latest commit

History

Repository files navigation

PyTorch Implementation: Context Encoder

Installation

Training

Monitoring

Evaluation

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages