Introduced in the paper "Roboflow 100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models", RF100-VL is a large-scale collection of 100 multi-modal datasets with diverse concepts ...
Abstract: Remote sensing image object detection (RSIOD) aims to identify and locate specific objects within satellite or aerial imagery. However, there is a scarcity of labeled data in current RSIOD ...
Abstract: Multimodal object detection is crucial for autonomous driving and intelligent surveillance. However, the existing methods face critical challenges, such as cross-modal semantic misalignment ...