In this work, we ask for and answer what makes classical temporal-difference reinforcement learning with \(\epsilon\)-greedy strategies cooperative. Cooperating in social dilemma situations is vital ...