Same prompt, one-word perturbation — when do generations fork?

Left: Qwen 0.8B. Right: Qwen 4B. Top = Prompt A, bottom = Prompt B (one-word perturbed). Gray = word appears in both generations (LCS aligned); color intensity = length of the unique run — speckles mean reshuffling, solid blocks mean true divergence.
Qwen 3.5 — 0.8B
coupled: 0 / 0 tokens
Qwen 3.5 — 4B (thinkoff)
coupled: 0 / 0 tokens