Release v0.1.8
What's Changed
- Support the maximize parameter for adam when dtype is torch.half by @alphaGem in #35
- add iter to make TransformerBlockList Iterable by @MayDomine in #37
- Support pytorch 1.12.0 #38
- Set default rank and world_size when bmtrain is not initialized. #38
New Contributors
Full Changelog: 0.1.7...0.1.8