Sparse and structured attention mechanisms

Efficient implementation of structured sparsity inducing attention mechanisms: fusedmax, oscarmax and sparsemax.

Note: If you are just looking for sparsemax, I recommend the implementation in this OpenNMT-py module

Currently available for pytorch >= 0.4.1. (For older versions, use a previous release of this package.) Requires python >= 2.7, cython, numpy, scipy.

Usage example:

In [1]: import torch
In [2]: import torchsparseattn
In [3]: a = torch.tensor([1, 2.1, 1.9], dtype=torch.double)
In [4]: lengths = torch.tensor([3])
In [5]: fusedmax = torchsparseattn.Fusedmax(alpha=.1)
In [6]: fusedmax(a, lengths)
Out[6]: tensor([0.0000, 0.5000, 0.5000], dtype=torch.float64)

For details, check out our paper:

Vlad Niculae and Mathieu Blondel A Regularized Framework for Sparse and Structured Neural Attention In: Proceedings of NIPS, 2017. https://arxiv.org/abs/1705.07704

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
pytorch		pytorch
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
fusedmax.png		fusedmax.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sparse and structured attention mechanisms

About

Releases

Packages

Languages

License

crystal22/sparse-structured-attention

Folders and files

Latest commit

History

Repository files navigation

Sparse and structured attention mechanisms

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages