JANUS

Joint Adaptive Non-stationary Updating and Scoring for World Models — A framework for joint training, continual adaptation, and causal evaluation of world models in non-stationary environments.

5.09%
Improvement over Naive Baseline
95.6%
Forgetting Reduction (EWC)
0.6015
Normalized Causal Strength
597
Drift Events Detected

Per-Regime Performance

Mean Episodic Return by Regime

Causal Evaluation: ACE by Regime

Normalized Causal Strength (NCS) by Regime

Forgetting Analysis

Detailed Results

Mean Episodic Return per Regime

RegimeJANUSNaiveOracleRandomDrifts
0-1.489 ± 1.145-1.334 ± 0.801-0.712 ± 0.498-8.920 ± 2.145206
1-6.661 ± 3.253-7.688 ± 3.518-0.562 ± 0.429-9.210 ± 2.058138
2-4.943 ± 3.327-4.704 ± 3.171-0.555 ± 0.730-9.471 ± 2.344146
3-2.498 ± 2.052-2.700 ± 2.185-0.590 ± 0.447-6.909 ± 3.272107
Avg-3.898-4.107-0.605-8.627597

Causal Evaluation Metrics

RegimeACE (JANUS)ACE (Naive)ACE (Oracle)NCS (JANUS)NCS (Naive)
07.4317.5858.2080.9050.924
12.5491.5228.6480.2950.176
24.5284.7678.9160.5080.535
34.4124.2096.3200.6980.666
Avg4.7304.5218.0230.6020.575

Catastrophic Forgetting (Regime 0 Performance)

MethodInitial ReturnFinal ReturnForgetting ScoreReduction
JANUS (EWC)-1.489-1.499 ± 0.8660.01095.6%
Naive-1.334-1.568 ± 0.8440.233