v0.4.4
API Change
- context in new task pipeline is implemented by
dataclass
now, rather thandict
- recommend visulization is
wandb
now, rather thantensorboard
Env
- add modified gym-hybrid including moving, sliding and hardmove (#505) (#519)
- add evogym support (#495) (#527)
- add save_replay_gif option (#506)
- adapt minigrid_env and related config to latest MiniGrid v2.0.0 (#500)
Algorithm
- add pcgrad optimizer (#489)
- add some features in MLP and ResBlock (#511)
- delete mcts related modules (#518) (we will release a MCTS repo in future)
Enhancement
- add wandb middleware and demo (#488) (#523) (#528)
- add new properties in Context (#499)
- add single env policy wrapper for policy deployment (demo)
- add custom model demo and doc (文档)
Fix
- fix build logger args and unittests (#522)
- fix total_loss calculation in PDQN (#504)
- fix save gif function bug
- fix level sample unittest bug
Style
- update contact email address (#503)
- polish env log and resblock name
- add details button in readme
New Repo
- DI-1024: Deep Reinforcement Learning + 1024 Game
Contributors: @PaParaZz1 @puyuan1996 @karroyan @hiha3456 @davide97l @Weiyuhong-1998 @zjowowen @norman26625