FORK NOTE

This fork removes all embedding specific code and generalizes the models to work more seamlessly with any sequential input. It also introduces a dedicated output dimension for the xLSTM model.

As described in the paper, the goal was to challenge transformers, hence the models were meant to be used with some kind of embedding.

Anyway, as the core of the xLSTM are still LSTM cells, any sequential input should work.

If this really makes sense is another question, but I wanted to try it out.

xLSTM barebone in PyTorch Lightning

This repo contains the unofficial implementation of xLSTM model as introduced in Beck et al. (2024). This repo is developed mainly for didactic purposes to spell out the details of a modern Long-Short Term Memory with competitive performances against modern Transformers or State-Space models (e.g. Mamba).

Usage

To train the model, simply run the following command:

from xlstm import xLSTM
import torch

batch_size = 32
seq_len = 100
inp_dim = 16

xlstm = xLSTM(
            num_layers = 2,
            signature = (7, 1),
            inp_dim= inp_dim,
            head_dim= 8,
            head_num= 4,
            out_dim= 24,
            p_factor= (2, 4/3),
            ker_size = 4,
            only_last = False
        )


seq = torch.randn(batch_size, seq_len, inp_dim)
out = xlstm(seq)

Requirements

Code was tested with Python 3.11+. To install the required dependencies simply run pip install -r requirements.txt.

torch==2.3.0
PyYAML==6.0.1
einops==0.8.0
lightning==2.2.4
setuptools==69.5.1

Citations

@article{beck2024xlstm,
  title={xLSTM: Extended Long Short-Term Memory},
  author={Beck, Maximilian and P{\"o}ppel, Korbinian and Spanring, Markus and Auer, Andreas and Prudnikova, Oleksandra and Kopp, Michael and Klambauer, G{\"u}nter and Brandstetter, Johannes and Hochreiter, Sepp},
  journal={arXiv preprint arXiv:2405.04517},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
test		test
xlstm		xlstm
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FORK NOTE

xLSTM barebone in PyTorch Lightning

Usage

Requirements

Citations

About

Releases

Packages

Languages

dmnkf/x-lstm

Folders and files

Latest commit

History

Repository files navigation

FORK NOTE

xLSTM barebone in PyTorch Lightning

Usage

Requirements

Citations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages