Why linear recurrent memory works in partially observable reinforcement learning