How LLM agent performance and calibration vary with user age across 7 countries — a fairness audit with cultural moderator analysis
Critical insights from the cross-national age-disparity analysis of LLM agent performance.
Per-country regression slopes quantifying performance decline per year of age. More negative values indicate steeper age-related drops.
Subgroup-level audit across 7 countries and 3 age bands. Rows flagged where disparate impact ratio falls below 0.80 threshold.
Correlation between cultural/infrastructure variables and the magnitude of age-related performance decline across countries.
Sample size requirements per cell to detect the observed age × country interaction effect at 80% power.
Study design and analytical framework for the cross-national age-disparity audit.