Comparing memory management strategies for external memory banks storing per-document modulation parameters in memory-augmented large language models.
| Strategy | Storage (KB) | Compression | Recon. Error | Throughput (docs/s) |
|---|---|---|---|---|
| Full Storage | 256.0 | 1.0x | 0.000 | 201,205 |
| PCA Compression | 256.0 | 1.0x | 0.000 | 265,103 |
| Random Eviction | 128.0 | 2.0x | 0.513 | 3,385 |
| LRU Eviction | 128.0 | 2.0x | 0.500 | 196,032 |
| Quantization | 64.0 | 4.0x | 0.005 | 59,518 |
| Clustering | 29.6 | 8.6x | 1.000 | 202,873 |