- Question Bank
- Question #535118DC
Research Question
How can multi-agent sensor fusion between port vehicles, wearable pedestrian devices, and fixed infrastructure reduce false negatives in collision avoidance systems under heavy occlusion at container terminals?
AI Novelty Assessment
High Novelty
This research question explores a largely uncharted area with significant potential for new discoveries.
Detailed Analysis
Related work covers occlusion-aware planning, RSU-vehicle fusion, cooperative perception, and one port-specific truck-pedestrian risk model, but I found little on fully integrated multi-agent fusion across port vehicles, fixed infrastructure, and wearable pedestrian devices specifically for heavily occluded container terminals. The gap is not collision avoidance itself, but the port-specific architecture and validation setup.
Related Academic Papers
10 papers found relevant to this research question. Each paper is scored by how closely it relates to the question.
Seung-oh Son, Juneyoung Park, C. Oh, C. Yeom (2021)
Abstract
This study develops an algorithm to detect the risk of collision between trucks (i.e., yard tractors) and pedestrians (i.e., workers) in the connected environment of the port. The algorithm consists of linear regression-based movable coordinate predictions and vertical distance and angle judgments considering the moving characteristics of objects. Time-to-collision for port workers (TTCP) is developed to reflect the characteristics of the port using the predictive coordinates. This study assumes the connected environment in which yard tractors and workers can share coordinates of each object in real time using the Internet of Things (IoT) network. By utilizing microtraffic simulations, a port network is implemented, and the algorithm is verified using data from simulated workers and yard trucks in the connected environment. The risk detection algorithm is validated using confusion matrix. Validation results show that the true-positive rate (TPR) is 61.5∼98.0%, the false-positive rate (FPR) is 79.6∼85.9%, and the accuracy is 72.2∼88.8%. This result implies that the metric scores improve as the data collection cycle increases. This is expected to be useful for sustainable transportation industry sites, particularly IoT-based safety management plans, designed to ensure the safety of pedestrians from crash risk by heavy vehicles (such as yard tractors).
Why this paper is relevant
Directly models truck-pedestrian collision risk in a connected port environment, though without wearable and infrastructure fusion together.
Shaohong Ding, Yi Xu, Qian Zhang, Jinxin Yu, Teng Sun, Juanli Ni, Shuyue Shi, Xiangcun Kong, Ruoyu Zhu, Liming Wang, Pengwei Wang (2023)
Abstract
Road obstacle detection is an important component of intelligent assisted driving technology. Existing obstacle detection methods ignore the important direction of generalized obstacle detection. This paper proposes an obstacle detection method based on the fusion of roadside units and vehicle mounted cameras and illustrates the feasibility of a combined monocular camera inertial measurement unit (IMU) and roadside unit (RSU) detection method. A generalized obstacle detection method based on vision IMU is combined with a roadside unit obstacle detection method based on a background difference method to achieve generalized obstacle classification while reducing the spatial complexity of the detection area. In the generalized obstacle recognition stage, a VIDAR (Vision-IMU based identification and ranging) -based generalized obstacle recognition method is proposed. The problem of the low accuracy of obstacle information acquisition in the driving environment where generalized obstacles exist is solved. For generalized obstacles that cannot be detected by the roadside unit, VIDAR obstacle detection is performed on the target generalized obstacles through the vehicle terminal camera, and the detection result information is transmitted to the roadside device terminal through the UDP (User Data Protocol) protocol to achieve obstacle recognition and pseudo-obstacle removal, thereby reducing the error recognition rate of generalized obstacles. In this paper, pseudo-obstacles, obstacles with a certain height less than the maximum passing height of the vehicle, and obstacles with a height greater than the maximum passing height of the vehicle are defined as generalized obstacles. Pseudo-obstacles refer to non-height objects that appear to be “patches” on the imaging interface obtained by visual sensors and obstacles with a height less than the maximum passing height of the vehicle. VIDAR is a vision-IMU-based detection and ranging method. IMU is used to obtain the distance and pose of the camera movement, and through the inverse perspective transformation, it can calculate the height of the object in the image. The VIDAR-based obstacle detection method, the roadside unit-based obstacle detection method, YOLOv5 (You Only Look Once version 5), and the method proposed in this paper were applied to outdoor comparison experiments. The results show that the accuracy of the method is improved by 2.3%, 17.4%, and 1.8%, respectively, compared with the other four methods. Compared with the roadside unit obstacle detection method, the speed of obstacle detection is improved by 1.1%. The experimental results show that the method can expand the detection range of road vehicles based on the vehicle obstacle detection method and can quickly and effectively eliminate false obstacle information on the road.
Why this paper is relevant
Demonstrates RSU and vehicle camera fusion for obstacle detection; highly relevant to infrastructure-vehicle fusion under occlusion.
De Jong Yeong, G. Velasco-Hernández, John Barry, Joseph Walsh (2021)
Abstract
With the significant advancement of sensor and communication technology and the reliable application of obstacle detection techniques and algorithms, automated driving is becoming a pivotal technology that can revolutionize the future of transportation and mobility. Sensors are fundamental to the perception of vehicle surroundings in an automated driving system, and the use and performance of multiple integrated sensors can directly determine the safety and feasibility of automated driving vehicles. Sensor calibration is the foundation block of any autonomous system and its constituent sensors and must be performed correctly before sensor fusion and obstacle detection processes may be implemented. This paper evaluates the capabilities and the technical performance of sensors which are commonly employed in autonomous vehicles, primarily focusing on a large selection of vision cameras, LiDAR sensors, and radar sensors and the various conditions in which such sensors may operate in practice. We present an overview of the three primary categories of sensor calibration and review existing open-source calibration packages for multi-sensor calibration and their compatibility with numerous commercial sensors. We also summarize the three main approaches to sensor fusion and review current state-of-the-art multi-sensor fusion techniques and algorithms for object detection in autonomous driving applications. The current paper, therefore, provides an end-to-end review of the hardware and software methods required for sensor fusion object detection. We conclude by highlighting some of the challenges in the sensor fusion field and propose possible future research directions for automated driving systems.
Why this paper is relevant
General review of sensor and sensor-fusion technologies applicable to port collision avoidance design.
Rainer Trauth, Korbinian Moller, Johannes Betz (2023)
Abstract
Autonomous vehicles face numerous challenges to ensure safe operation in unpredictable and hazardous conditions. The autonomous driving environment is characterized by high uncertainty, especially in occluded areas with limited information about the surrounding obstacles. This work aims to provide a trajectory planner to solve these unsafe environments. The work proposes an approach combining a visibility model, contextual environmental information, and behavioral planning algorithms to predict the likelihood of occlusions and collision probabilities. Ultimately, this allows us to estimate the potential harm from collisions with pedestrians in occluded situations. The primary goal of our proposed approach is to minimize the risk of hitting pedestrians and to establish a predefined, adjustable maximum level of harm. We show several practical applications for informing a sampling-based trajectory planner about occluded areas to increase safety. In addition, to respond to possible high-risk situations, we introduce an adjustable threshold that governs the vehicle’s speed when encountering uncertain situations and strategies to maximize the vehicle’s visible area. In implementing our novel methodology, we analyzed several real-world scenarios in a simulation environment. Our results indicate that combining occlusion-aware trajectory planning algorithms and harm estimation significantly influences vehicle driving behavior, especially in risky situations. The code used in this research is publicly available as open-source software and can be accessed at the following link: https://github.com/TUM-AVS/Frenetix-Motion-Planner.
Why this paper is relevant
Addresses occlusion-aware planning and hidden-risk mitigation, supporting the false-negative problem under occlusion.
Shane Gilroy, D. Mullins, Edward Jones, Ashkan Parsi, M. Glavin (2022)
Abstract
Robust detection of vulnerable road users is a safety critical requirement for the deployment of autonomous vehicles in heterogeneous traffic. One of the most complex outstanding challenges is that of partial occlusion where a target object is only partially available to the sensor due to obstruction by another foreground object. A number of leading pedestrian detection benchmarks provide annotation for partial occlusion, however each benchmark varies greatly in their definition of the occurrence and severity of occlusion. Recent research demonstrates that a high degree of subjectivity is used to classify occlusion level in these cases and occlusion is typically categorized into 2 to 3 broad categories such as partially and heavily occluded. This can lead to inaccurate or inconsistent reporting of pedestrian detection model performance depending on which benchmark is used. This research introduces a novel, objective benchmark for partially occluded pedestrian detection to facilitate the objective characterization of pedestrian detection models. Characterization is carried out on seven popular pedestrian detection models for a range of occlusion levels from 0-99%, in order to demonstrate the efficacy and increased analysis capabilities of the proposed characterization method. Results demonstrate that pedestrian detection performance degrades, and the number of false negative detections increase as pedestrian occlusion level increases. Of the seven popular pedestrian detection routines characterized, CenterNet has the greatest overall performance, followed by SSDlite. RetinaNet has the lowest overall detection performance across the range of occlusion levels.
Why this paper is relevant
Analyzes how partial occlusion harms pedestrian detectability; relevant to reducing false negatives in terminals.
Yuan Che, Mun On Wong, Xiaowei Gao, Haoyang Liang, Yun Ye (2026)
Abstract
Autonomous driving improves traffic efficiency but presents safety challenges in complex port environments. This study investigates how environmental factors, traffic factors, and pedestrian characteristics influence interaction safety between autonomous vehicles and pedestrians in ports. Using virtual reality (VR) simulations of typical port scenarios, 33 participants completed pedestrian crossing tasks under varying visibility, vehicle sizes, and time pressure conditions. Results indicate that low-visibility conditions, partial occlusions and larger vehicle sizes significantly increase perceived risk, prompting pedestrians to wait longer and accept larger gaps. Specifically, pedestrians tended to accept larger gaps and waited longer when interacting with large autonomous truck platoons, reflecting heightened caution due to their perceived threat. However, local obstructions also reduce post-encroachment time, compressing safety margins. Individual attributes such as age, gender, and driving experience further shape decision-making, while time pressure undermines compensatory behaviors and increases risk. Based on these findings, safety strategies are proposed, including installing wide-angle cameras at multiple viewpoints, enabling real-time vehicle-infrastructure communication, enhancing port lighting and signage, and strengthening pedestrian safety training. This study offers practical recommendations for improving the safety and deployment of vision-based autonomous systems in port settings.
Why this paper is relevant
Port-specific evidence on pedestrian-AV interactions and visual constraints, but not a full multi-agent fusion stack.
D. Selvaraj, Christian Vitale, T. Panayiotou, P. Kolios, C. Chiasserini, G. Ellinas (2024)
Abstract
Intersection crossing represents one of the most dangerous sections of the road infrastructure and Connected Vehicles (CVs) can serve as a revolutionary solution to the problem. In this work, we present a novel framework that detects preemptively collisions at urban crossroads, exploiting the Multi-access Edge Computing (MEC) platform of 5G networks. At the MEC, an Intersection Manager (IM) collects information from both vehicles and the road infrastructure to create a holistic view of the area of interest. Based on the historical data collected, the IM leverages the capabilities of an encoder-decoder recurrent neural network to predict, with high accuracy, the future vehicles' trajectories. As, however, accuracy is not a sufficient measure of how much we can trust a model, trajectory predictions are additionally associated with a measure of uncertainty towards confident collision forecasting and avoidance. Hence, contrary to any other approach in the state of the art, an uncertainty-aware collision prediction framework is developed that is shown to detect well in advance (and with high reliability) if two vehicles are on a collision course. Subsequently, collision detection triggers a number of alarms that signal the colliding vehicles to brake. Under real-world settings, thanks to the preemptive capabilities of the proposed approach, all the simulated imminent dangers are averted.
Why this paper is relevant
Edge-assisted collision avoidance using infrastructure information; relevant to uncertainty reduction from external sensing.
Sukru Yaren Gelbal, Sheng Zhu, Gokul Arvind Anantharaman, Bilin Aksun Guvenc, Levent Guvenc (2023)
Abstract
Connected vehicle (CV) technology is among the most heavily researched areas in both the academia and industry. The vehicle to vehicle (V2V), vehicle to infrastructure (V2I) and vehicle to pedestrian (V2P) communication capabilities enable critical situational awareness. In some cases, these vehicle communication safety capabilities can overcome the shortcomings of other sensor safety capabilities because of external conditions such as 'No Line of Sight' (NLOS) or very harsh weather conditions. Connected vehicles will help cities and states reduce traffic congestion, improve fuel efficiency and improve the safety of the vehicles and pedestrians. On the road, cars will be able to communicate with one another, automatically transmitting data such as speed, position, and direction, and send alerts to each other if a crash seems imminent. The main focus of this paper is the implementation of Cooperative Collision Avoidance (CCA) for connected vehicles. It leverages the Vehicle to Everything (V2X) communication technology to create a real-time implementable collision avoidance algorithm along with decision-making for a vehicle that communicates with other vehicles. Four distinct collision risk environments are simulated on a cost effective Connected Autonomous Vehicle (CAV) Hardware in the Loop (HIL) simulator to test the overall algorithm in real-time with real electronic control and communication hardware.
Why this paper is relevant
Connected-vehicle collision avoidance highlights NLOS benefits relevant to occluded container-terminal settings.
Jinlong Li, Runsheng Xu, Xinyi Liu, Jin Ma, Zicheng Chi, Jiaqi Ma, Hongkai Yu (2022)
Abstract
Deep learning has been widely used in intelligent vehicle driving perception systems, such as 3D object detection. One promising technique is Cooperative Perception, which leverages Vehicle-to-Vehicle (V2V) communication to share deep learning-based features among vehicles. However, most cooperative perception algorithms assume ideal communication and do not consider the impact of Lossy Communication (LC), which is very common in the real world, on feature sharing. In this paper, we explore the effects of LC on Cooperative Perception and propose a novel approach to mitigate these effects. Our approach includes an LC-aware Repair Network (LCRN) and a V2V Attention Module (V2VAM) with intra-vehicle attention and uncertainty-aware inter-vehicle attention. We demonstrate the effectiveness of our approach on the public OPV2V dataset (a digital-twin simulated dataset) using point cloud-based 3D object detection. Our results show that our approach improves detection performance under lossy V2V communication. Specifically, our proposed method achieves a significant improvement in Average Precision compared to the state-of-the-art cooperative perception algorithms, which proves the capability of our approach to effectively mitigate the negative impact of LC and enhance the interaction between the ego vehicle and other vehicles.
Why this paper is relevant
Shows cooperative perception under lossy V2V communication, relevant to multi-agent fusion reliability.
De Jong Yeong, Krishna Panduru, Joseph Walsh (2025)
Abstract
Autonomous vehicles (AVs) rely heavily on multi-sensor fusion to perceive their environment and make critical, real-time decisions by integrating data from various sensors such as radar, cameras, Lidar, and GPS. However, the complexity of these systems often leads to a lack of transparency, posing challenges in terms of safety, accountability, and public trust. This review investigates the intersection of multi-sensor fusion and explainable artificial intelligence (XAI), aiming to address the challenges of implementing accurate and interpretable AV systems. We systematically review cutting-edge multi-sensor fusion techniques, along with various explainability approaches, in the context of AV systems. While multi-sensor fusion technologies have achieved significant advancement in improving AV perception, the lack of transparency and explainability in autonomous decision-making remains a primary challenge. Our findings underscore the necessity of a balanced approach to integrating XAI and multi-sensor fusion in autonomous driving applications, acknowledging the trade-offs between real-time performance and explainability. The key challenges identified span a range of technical, social, ethical, and regulatory aspects. We conclude by underscoring the importance of developing techniques that ensure real-time explainability, specifically in high-stakes applications, to stakeholders without compromising safety and accuracy, as well as outlining future research directions aimed at bridging the gap between high-performance multi-sensor fusion and trustworthy explainability in autonomous driving systems.
Why this paper is relevant
Survey of multi-sensor fusion and XAI in AVs; useful methodological backdrop but not port-specific.
Generate your own research questions
ChatAcademia helps researchers discover novel research questions with AI-powered analysis.
Get Started Free