Play NIM
—
Show the “winning test” (why xor matters)
Let x = a₁ ⊕ a₂ ⊕ … ⊕ a_m, where ⊕ is bitwise xor. If x = 0, the position is losing for the next player (with perfect play). If x ≠ 0, there is a move that makes the nim-sum zero.
In other words: many “different looking” boards have the same strategic type. This mismatch is one reason a single-frame network that tries to directly associate patterns in the board image with the best move can struggle, compared to a model that can compute or represent the xor-structure.