Abstract

The wide range of potential real-world applications (e.g. smart city, traffic management, crash detection) for the Multi- Camera Vehicle Tracking (MCVT) problem makes it a worthwhile research topic in the computer vision field. In general, there are two approaches to address the MCVT problem: the global approach, which processes detections to create unified tracks directly, and the more commonly used two-step hierarchical approach, which involves separate stages for intracamera and inter-camera tracking. Typically, the two-step hierarchical MCVT approach can be further divided into four modules: object detection, feature extraction, single camera tracking and multi camera tracking. Each module plays a distinct role in enhancing the overall effectiveness of MCVT solutions. To date, there has only been limited research thoroughly examining how these modules individually affect the overall tracking performance. This paper presents an ablation study on the MCVT problem as a case study using the CityFlow V2 dataset. Using a benchmark MCVT framework, various state-of-art algorithms for each module have been implemented back-to-back to assess the impact of these algorithms. The effectiveness of these algorithms is assessed through two key metrics: IDF1 score performance and computational complexity. The study provides a comprehensive comparison study to understand the contributions of different algorithms in each module. Among all those modules, automatically generated spatial-temporal constraints maintains the computational efficiency while also contribute a lot on IDF1 score performance which could be the focusing point for future research on real-time real-world application
Original languageEnglish
Title of host publicationProceedings Volume 13517, Seventeenth International Conference on Machine Vision (ICMV 2024)
ISBN (Electronic)9781510688285
DOIs
Publication statusPublished - 24 Feb 2025

Fingerprint

Dive into the research topics of 'Ablation study for multicamera vehicle tracking using CityFlow dataset'. Together they form a unique fingerprint.

Cite this