[About the Model]ViT → Image Patching However, it’s about Multi-axis Visual Transformer (MaxVit) [About the Code] [Result] Size Resolution was the worst issue to consider..??