Several popular drone datasets on GitHub

With the explosive growth of the low-altitude economy, the widespread adoption of drones has brought increasingly prominent safety risks, making counter-drone technology a fiercely contested domain in the security field. However, confronted with detection challenges such as “extremely small targets, ultra-fast flight speeds, and complex backgrounds,” algorithm performance is often bottlenecked by data scarcity. In the era of deep learning, high-quality data is the fuel that powers progress. To address the resource-hunting difficulties faced by developers worldwide, this article systematically reviews mainstream publicly available drone detection datasets globally, encompassing multimodal resources from ground-to-air visible light (RGB) and infrared thermal imaging to radio frequency signals. It provides an in-depth analysis of their applicable scenarios and core characteristics, helping you shatter data silos and rapidly build high-precision drone detection systems.

VisDrone-Dataset

VisDrone is a large-scale multi-scene drone vision benchmark, offering over 2.6 million finely annotated boxes with rich attributes to advance drone-based vision. This is not a drone image dataset, please note

Quantity Information:

Video clips: 288
Total frames: 261,908 frames
Static images: 10,209
Annotated bounding boxes: Over 2.6 million
Covered cities: 14 Chinese cities

Core Advantages:

Large scale with fine-grained annotations: Massive data volume, providing over 2.6 million meticulously annotated target bounding boxes.
High diversity: Covers various drone models, diverse scenarios (urban and rural), and a wide range of weather and lighting conditions.
Comprehensive coverage: Includes rich target categories (pedestrians, vehicles, bicycles, etc.) and scenes with varying densities (from sparse to crowded).
Rich attributes: Additionally provides key attributes such as scene visibility, object categories, and occlusion, enhancing the practical value of the data.
Application-oriented: Specifically designed to advance the integration of computer vision and drone technology, supporting a variety of visual tasks.

DroneDetectionThesis

This is a multimodal drone detection dataset designed to advance the development and evaluation of counter-drone systems. It integrates infrared (IR), visible light, and audio data, enabling comprehensive sensor fusion for robust drone identification in complex environments.
Quantity Information:

Video clips: 650 (365 IR + 285 visible)
Audio clips: 90
Total annotated images (extracted from videos): 203,328
Video labels: Airplane, Bird, Drone, Helicopter
Audio labels: Drone, Helicopter, Background

Core Advantages:

Multimodal data: Integrates infrared (IR), visible light, and audio modalities, ideal for training and evaluating comprehensive drone detection sensors and systems.
Rich labeling: Provides detailed video and audio annotations with diverse target categories (including drones, helicopters, airplanes, birds, and background).
Large-scale image extraction: Yields over 200,000 annotated images when frames are extracted from videos, enabling robust model training.
Fully open access: Free to download, use, and modify; includes annotation files in .mat format (generated via MATLAB Video Labeler).
User-friendly resources: Accompanied by a detailed video description file (“Video_dataset_description.xlsx”) and a script with instructions and examples (“Create_a_dataset_from_videos_and_labels.m”) for seamless dataset creation and utilization.

DroneAudioDataset

A specialized drone audio dataset featuring indoor-recorded propeller noise, augmented with diverse environmental sounds (ESC-50), white noise (Speech Commands), and custom silence clips. Organized for binary/multiclass classification, it supports deep learning-based drone detection. Tied to the paper Audio Based Drone Detection and Identification using Deep Learning by Sara Al-Emadi.

Quantity Information:

Audio clips: Drone propeller noise (indoor recordings) + artificially augmented with random noise
Noise categories:
- “Unknown” clips sourced from ESC-50 (environmental sounds)
- White noise from Speech Commands dataset
- Custom-created silence clips for class balance
Organized in binary and multiclass folder structures

Core Advantages:

Specialized drone audio focus: Captures real indoor drone propeller sounds, enhanced via controlled augmentation for robust deep learning training.
Rich noise diversity: Integrates open-source environmental noise (ESC-50), speech-derived white noise, and synthetic silence to simulate real-world acoustic variability.
Research-backed: Directly tied to the conference paper “Audio Based Drone Detection and Identification using Deep Learning” by Sara Al-Emadi.
Open and extensible: Leverages established public datasets (ESC-50, Speech Commands) and includes custom silence augmentation for balanced classification.
Ready for acoustic AI: Structured for binary/multiclass drone detection tasks, ideal for developing noise-robust audio-based counter-drone systems.

dasmehdix/drone-dataset

Patch Antenna, also called Microstrip Antenna, is a compact, planar antenna widely used in C-UAS detection and portable drone jammers.

Key Uses in C-UAS:

Detection: Integrates into handheld drone detectors and wearable RF scanners to monitor 2.4GHz/5.8GHz ISM bands.
Jamming: Arrays of high-power patches form lightweight jamming panels in anti-drone guns and vehicle-mounted systems.

Advantages:

Ultra-low profile (<1cm thick)
Lightweight, conformal to surfaces
Easy to array for sector coverage
Cost-effective mass production

Disadvantages:

Narrow bandwidth (~5–10%)
Low gain (5–8 dBi) vs. Yagi/LPDA
Sensitive to ground plane size
Poor efficiency at <1GHz

Best for compact multi-band C-UAS sensors and urban drone defense.

In summary, GitHub hosts a rich variety of drone-related datasets, spanning multiple modalities: aerial imagery/video datasets (e.g., VisDrone, UAVDT), flight trajectory datasets (for prediction and anomaly detection), high-resolution drone image datasets (e.g., DroneVehicle, UAV123), and audio datasets (e.g., indoor-recorded propeller noise with augmentation). Each modality drives distinct technical approaches: image data relies on object detection and tracking; trajectory data enables behavioral analysis and intent inference—a key non-traditional counter-drone technique; audio datasets focus on acoustic feature extraction and deep classification, ideal for silent or nighttime detection. Developers can selectively integrate these resources to build multimodal, high-precision anti-drone systems tailored to real-world security needs.