Strict Safety Guarantees for RL in Mobile Robotics

Comparative Study of Safety Mechanisms | Shahna et al., arXiv:2601.00610

5
Safety Methods Compared
0.000
Best Eval Violation Rate
CBF
Best Safety Method
200
Training Episodes

Training Violation Rates

Evaluation Violation Rates

Noise Robustness: Violation Rate vs Disturbance

Safety Method Comparison

MethodTrain VREval VRAvg RewardGuarantee Level
Unconstrained0.00590.0000-111.3None
Reward Shaping0.00360.0000-113.1Probabilistic
Lagrangian0.00500.0110-106.6Soft
CBF Filter0.00590.0000-111.3Practical
Shield0.00590.0000-111.3Strict