: Scenarios were built in Unity 3D to mimic real-world construction tasks, such as collaborative excavation.
: The video frames were used to train YOLOv7 (You Only Look Once) and Mask-RCNN models to detect objects and estimate distances accurately in real-time.
: Distinguishes between workers, excavators, and forklifts.
: The system significantly decreased the number of "nuisance" alarms compared to static sensors, as it understands when a worker or another machine is approaching safely for collaboration.