Deep learning of an embedding mapping using t-SNE as a loss function on top of a 3-hidden-layer neural network. Use pytorch !
learn a direct deep embedding using a DNN with t-SNE as loss function (batch version of computing P)
Instead of computing global P, a local P is computed for each mini batch. This does work, it's promising in terms of scaling up.