This is a simple program for training and testing vit. Key requirements: torch, torchvision and timm.
I put 5 categories of the cub classification data set for simple training. You can train on your dataset by setting file directory with the same structure standard.
The num-worker is set to zero for using cpu and I suggest you increase the number when switching to gpu.
I put 5 pictures and for testing the model, you should change the class-dict when you use your own dataset.