News

This repository contains an implementation of the Vision Transformer (ViT) architecture for object detection. The model is designed to detect objects in images and predict their bounding boxes and ...
Abstract: Several factors may compromise the effectiveness of algorithms for relatively localizing specific objects ... on instantaneous detection may not be reliable in such application scenarios. In ...
This repository holds the implementation of YOLOX-ViT ... object detection in side-scan sonar images based on transformer-yolov5) proposed a YOLOv5-TR for containers and shipwreck detection. Aubard et ...