Expert activation depends on input content. The router learns which experts handle which types of inputs, but this means access patterns are:
• Data-dependent — can't predict without seeing input
• Irregular — no fixed pattern to exploit
• Sparse — only K of N experts active
Expert Activation Pattern (8 tokens)
Rows = tokens, Cols = experts. Orange = activated.