PyTorch based key-value memory network, based on the code released
Not implemented the L2 regularizer on the loss because there is no standard implementation and the pyToch Adagrad already perform a soft L2
PyTorch based key-value memory network, based on the code released
Not implemented the L2 regularizer on the loss because there is no standard implementation and the pyToch Adagrad already perform a soft L2