eLife assessment
This well-written and well-reasoned manuscript describes a behavioral and computational modeling study designed to understand pure exploration, where action selection is driven by pure information seeking and not rewards. Using a novel task, the authors find that a subset of people use information value to drive their selection behavior, consistent with a simple information maximization model of reinforcement learning. The rest of the participants did not exhibit this behavior. This valuable work provides intriguing, yet somewhat incomplete, insights into understanding directed exploration and its computational form.