The plug-in comes at a time when the UAE’s smart city market is projected to grow from $216.51 million in 2025 to $328.19 million by 2030, reflecting an 8.67% CAGR. It aligns with Dubai’s push to leverage AI for real-time city operations, public safety, and urban efficiency.
Milestone Systems, a global leader in data-driven video technology, has announced the upcoming launch of a generative AI-powered video analytics plug-in for its XProtect® video management software, developed in collaboration with NVIDIA. The tool is designed to help operators contextualize alarms, automate video review, and reduce false positives, potentially cutting operator alarm fatigue by up to 30%. General availability is expected later this year.
The plug-in comes at a time when the UAE’s smart city market is projected to grow from $216.51 million in 2025 to $328.19 million by 2030, reflecting an 8.67% CAGR. It aligns with Dubai’s push to leverage AI for real-time city operations, public safety, and urban efficiency.
Modern video systems capture massive volumes of data, but reviewing footage remains time-consuming. Milestone’s solution addresses this challenge by automatically summarizing, contextualizing, and validating video content in real time. Key features include automated incident reports, event validation to reduce false positives, and contextual bookmark summaries that provide natural-language summaries of footage for faster triage. The plug-in integrates seamlessly with the XProtect rule engine and can be deployed on-premises or in the cloud to ensure flexibility and compliance.
Louise Bou Rached, Director for Middle East, Turkey, and Africa at Milestone Systems, noted that the solution supports the UAE’s ambition to become the world’s smartest and safest city by 2031. It enhances real-time situational awareness, improves response times, and optimizes city operations including traffic management and public safety while maintaining ethical and regulatory standards.
Built on Milestone’s Hafnia Vision Language Model (VLM), trained on 75,000 hours of ethically sourced real-world video data, the platform leverages NVIDIA Cosmos Curator for data preparation and NVIDIA Cosmos Reason VLM for processing. Thomas Jensen, CEO of Milestone Systems, emphasized that the plug-in enables cities and organizations to unlock new levels of operational efficiency and insight, making advanced video intelligence widely accessible and responsibly deployed.
Early adopters, including the cities of Genoa, Italy, and Dubuque, Iowa, are already exploring the solution to enhance traffic and safety management. In addition, Milestone is introducing VLM-as-a-Service via APIs, allowing developers and partners to build their own generative AI solutions on any video management platform, further fostering ecosystem innovation.

