ViT-Truck-Images This is a image classification task using vision transformer. Dataset Truck images with YOLO boundry box coordinates file (x,y,w,h).