SFC-Score

A unified metric framework balancing Sparsity, Fidelity, and Mechanistic Completeness for evaluating interpretability decompositions via weighted harmonic mean.

0.905
Peak SFC-Score (Equal Weights)
0.85
Optimal Sparsity Level
0.874
Pareto Hypervolume
4
Model Configurations

Three-Way Trade-off

SFC-Score vs Sparsity Level (Standard Model)

Individual Axes vs Sparsity

Weight Sensitivity Analysis

Model Configuration Comparison

Detailed Results

SFC-Score by Sparsity Level (Standard Model, Equal Weights)

SparsitySparsity ScoreFidelity ScoreCompletenessSFC-Score
0.500.5000.9950.9800.746
0.600.6000.9900.9600.803
0.700.7000.9800.9400.848
0.800.8000.9600.9100.884
0.850.8500.9400.9300.905
0.900.9000.9000.8700.890
0.950.9500.8200.7800.847
0.990.9900.6500.6000.727

Weight Sensitivity (Selected Profiles)

ProfileWeights (S:F:C)Best SparsitySFC-Score
Equal1:1:10.850.905
Sparsity-Heavy5:1:10.950.917
Fidelity-Heavy1:5:10.700.911
Completeness-Heavy1:1:50.800.892

Model Configurations

ConfigHidden DimCircuit SizePareto HV
Standard6480.874
Large12816~0.85
Dense Circuit6424~0.82
Sparse Circuit644~0.90