Appendix G

EMA Scoring Algorithm

Half-life: t 1/2 = -ln(2) / ln(1-α)
Steady state: For constant attention a, score → a
Decay: Without attention, score → 0 exponentially

Mathematical foundation, α parameter selection, and comparison to LRU.

G.1 EMA Update Rule

score_t(p) = α × attention_t(p) + (1 - α) × score_t-1(p)

This creates a "memory" of attention importance that decays gradually.

LRU problem: System prompt at position 5 evicted after 100 steps despite receiving 4% attention every step
EMA solution: Stable 0.04 score keeps it cached

Result: +15% hit rate improvement over LRU

← PreviousAppendix F Next →Appendix H