AI-Driven Multi-Robot Systems for Autonomous Maintenance

Leveraging  robotics and AI for automation

While robots promise enhanced efficiency and precision, complex maintenance tasks often require multiple, specialised systems from different vendors. Reply has engineered an advanced, AI-driven approach that orchestrates these diverse robotic systems, enabling them to collaborate seamlessly on unified maintenance operations.

The Criticality of Timely and Coordinated Maintenance

The effectiveness of any maintenance process hinges on its timeliness. The total reaction time, from the initial detection of an anomaly to the final execution of a remedy, is a critical variable that can have a substantial impact on operational continuity and safety. This process can be broken down into three principal phases: the alert, which signifies the detection of an issue; the decision, where a course of action is determined; and the action itself, which involves executing the corrective measures. In any environment, be it a production facility or a secure perimeter, minimising the time spent in each of these phases is essential, as delays in either detection or reaction can lead to significant consequences.

The Integration Hurdle in Multi-Vendor Environments

Resolving complex maintenance issues often requires a sequence of distinct actions performed by specialised robotic systems. An effective response might necessitate the use of various robots, including patrol drones, four-legged robots, autonomous mobile robots (AMRs) for transport, and robotic arms for manipulation. A significant barrier arises from the fact that these robots are typically sourced from different vendors and operate on proprietary, vertical software stacks designed for intra-fleet communication only. This makes it incredibly difficult to integrate and synchronise a diverse fleet, preventing them from collaborating in an orderly, sequential manner to complete a single, overarching task. The challenge lies in creating a unified system that can overcome these silos and manage disparate robots as a single, cohesive unit.

A Unified Architecture for Diverse Robotic Fleets

At the heart of Reply’s approach is the Robotic Coordinator, engineered to orchestrate and synchronise heterogeneous robotic fleets from multiple vendors. Built upon the scalable ROS 2 framework, the coordinator provides a collection of reusable libraries and tools that enable seamless interoperability. Its architecture includes several key components: the Coordinator Core, which prevents traffic conflicts and negotiates passage between robots; the Traffic Editor, for creating shared maps and defining traffic patterns; and individual Fleet Adapters, which connect each vendor's specific API to the central core, ensuring compatibility without overhauling the native robot software. A comprehensive Web interface allows operators to monitor the status of all robotic assets on a 2D floor plan. This architecture makes it possible for different robots to collaborate flawlessly on a complex maintenance workflow.

Advanced Perception through On-Edge Computer Vision

Reply’s approach places strong emphasis on rapid and accurate issue detection. For example, a use case focuses on border patrol, where a wheeled Jackal robot is tasked with conducting automated inspections using computer vision. Powerful hardware like 3D LiDAR for robust autonomous navigation has been retrofitted with a camera and an integrated NVIDIA Jetson Orin board. This approach enables AI models to be executed directly on the robot, thanks to the edge computing. This on-edge inference capability is crucial, as it eliminates reliance on the cloud, significantly reduces latency, and improves data privacy. Furthermore, it ensures reliable performance even in environments with limited or unstable network connectivity, as the robot only needs to transmit concise, vital information (such as the type and geolocation of a detected anomaly) rather than a continuous video stream.

Leveraging Generative AI for Robust Model Training

A common obstacle in developing reliable computer vision models is the scarcity of training data for highly specific or rare events, such as finding images of a broken fence for a perimeter security use case. To overcome this, Reply’s approach leverages Generative AI to create customised synthetic data tailored to these unique needs. By using models like Fluxnel, it is possible to generate a large volume of realistic images depicting rare or complex scenarios, such as holes in a chain-link fence. Although this process requires filtering, with approximately 4-5% of the generated images proving suitable for training, it effectively eliminates the challenges and high costs associated with real-world data collection. This approach enhances the robustness and performance of the AI models in their targeted contexts.

Natural Language for Simplified Human-Robot Interaction

To make this advanced technology accessible to all operators, regardless of their technical expertise, the system includes an AI Task Decoder. This intelligent agent functions as an intuitive interface, allowing users to issue commands in natural language. The system's Large Language Model (LLM) then decodes this request, identifying the core task, the objects involved, and the required sequence of actions like 'pick' and 'drop'. Finally, the agent serialises this information into a structured JSON format compatible with the Robotic Coordinator, which then dispatches the appropriate robots to execute the task. This natural language interface streamlines the entire operational experience, making interaction with the complex multi-robot system effortless.

Enhancing Efficiency Across Industries

The flexibility of this AI-driven multi-robot system allows its application across a wide array of use cases, far-beyond perimeter security. In manufacturing, it can be deployed to reduce bottlenecks and enhance productivity by automating intralogistics and material handling tasks. In airport environments, the system can lower operational costs and improve security through automated perimeter surveillance, detecting unauthorised entry of people or animals and ensuring fence integrity. For the retail sector, it offers a means to support increased throughput and maintain operational flexibility in warehouses and distribution centres. By coordinating multiple robot fleets to optimise traffic flow and task allocation, Reply’s approach reduces costly downtime and allows human resources to be allocated more efficiently.

The Path Forward: Expanding Capabilities

Reply’s experts are now focused on continuous expansion and enhancement. A key objective is to broaden its hardware-agnostic capabilities by integrating an even wider range of robots from various vendors, allowing for greater customisation to meet specific use-case requirements. Another priority is to deepen the system's interaction with its environment by extending its communication protocols to interface with smart infrastructure like automatic doors, gates, and elevators, thereby enabling greater autonomy. Furthermore, there are plans to continually expand the catalogue of available computer vision models to address new detection challenges and to adapt the versatile system for new industrial contexts beyond manufacturing and surveillance, building new references in other domains.

Concept Reply is an IoT software developer specializing in the research, development and validation of innovative solutions and supports its customers in the automotive, manufacturing, smart infrastructure and other industries in all matters relating to the Internet of Things (IoT) and cloud computing. The goal is to offer end-to-end solutions along the entire value chain: from the definition of an IoT strategy, through testing and quality assurance, to the implementation of a concrete solution.