The field of computer vision has significantly transformed over recent decades, advancing from basic image processing tasks to enabling sophisticated technologies such as autonomous driving and augmented reality. Recent advancements have been driven chiefly by deep learning techniques, facilitating considerable improvement in image recognition, object detection, and semantic segmentation. Transformative approaches, including Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs),...