Causal Role of Reasoning Bonds in Long CoT Learning

Why Imitation-Based Distillation Fails to Induce Chain-of-Thought Structure

0.652
Deep-Reasoning ACE
0.549
Self-Reflection ACE
0.449
Self-Exploration ACE
0.032
SFT Weight Error

Average Causal Effects

Learning Curves

Learned Weights vs True Causal Strength

Structural Similarity Index

Key Findings