CV

Differential Trajectory Analysis (DTA)

Discovering uncaptured failure modes in VLM agents operating without textual feedback

The Feedback Paradox

When textual feedback is removed from VLM agents, overall success drops (52.7% to 28.3%), yet named failure categories like action looping and state mismanagement DECREASE. This paradox implies the existence of failure modes not captured by existing taxonomies. DTA resolves this by identifying 3 novel failure modes that account for 49.8% of no-feedback failures.

-24.4pp
Success Rate Drop
52.7% to 28.3%
0.767
Discovery F1 Score
Precision: 1.0, Recall: 0.621
30.9%
Residual Failures Found
133 of 430 F- failures
3
Novel Modes Discovered
49.8% of F- failures

Failure Mode Rates: F+ vs F-

Failure Composition Shift

Feedback Degradation Spectrum

Failure mode rates across feedback availability levels (1.0 = full feedback to 0.0 = none). Known modes decrease monotonically while novel/residual failures increase, confirming mode replacement rather than improvement.

Feature Profiles of Failure Modes

DTA Pipeline Performance

MetricValue
F- failure episodes430
Residual (classifier)133 (30.9%)
Anomalies (Isolation Forest)109 (25.3%)
Union (residual + anomaly)200 (46.5%)
Ground-truth novel failures214
Precision1.000
Recall0.621
F1 Score0.767

Discovered Novel Failure Modes

Hallucinated Feedback

Low action entropy (0.461), moderate repetition (0.346), strongly negative reward (-5.15). Agent takes confident actions from a restricted set as if receiving progress signals, but achieves poor outcomes. F+ rate: 4.4%, F- rate: 15.2%.

Exploratory Drift

High action entropy (0.935), high observation diversity (0.527), near-zero reward (-0.07). Agent explores broadly without converging on any plan. Without feedback to confirm progress, exploration never terminates. F+ rate: 1.5%, F- rate: 21.8%.

Memoryless Reactive Collapse

Moderate entropy (0.694), short episodes (0.129), moderate negative reward (-1.52). Agent abandons multi-step planning entirely, reacting to each observation independently. F+ rate: 1.1%, F- rate: 12.3%.

Failure Mode Rates Table

Failure ModeF+ RateF- RateChangeDirection
Action Looping0.2750.100-0.175DECREASE
State Mismanagement0.2920.065-0.227DECREASE
Early Termination0.2110.188-0.023decrease
Visual/Spatial Failure0.1620.149-0.013decrease
Novel / Uncaptured0.0600.498+0.438INCREASE