This is the development of a temporal ViT using the MNIST dataset.
To set up the environment, you can use the following command:
source environments/setup_envs.sh
We download and process the MNIST dataset. This involves performing a sliding rolling window over the images to create a temporal dataset. We will then train a temporal ViT model on this dataset.