{"pk":26975,"title":"Strategic exploration in human adaptive control","subtitle":null,"abstract":"How do people explore in order to gain rewards in uncer-tain dynamical systems? Within a reinforcement learningparadigm, control normally involves trading off between ex-ploration (i.e. trying out actions in order to gain more knowl-edge about the system) and exploitation (i.e. using currentknowledge of the system to maximize reward). We study anovel control task in which participants must steer a boat ona grid, aiming to follow a path of high reward whilst learninghow their actions affect the boat’s position. We find that partic-ipants explore strategically yet conservatively, exploring morewhen mistakes are less costly and practicing actions that willbe required later on.","language":"eng","license":{"name":"","short_name":"","text":null,"url":""},"keywords":[{"word":"Reinforcement Learning"},{"word":"Strategic Exploration"},{"word":"Control"},{"word":"Exploration-Exploitation"}],"section":"Talks: Papers","is_remote":true,"remote_url":"https://escholarship.org/uc/item/01w4p8ct","frozenauthors":[{"first_name":"Eric","middle_name":"","last_name":"Schulz","name_suffix":"","institution":"University College London","department":""},{"first_name":"Edgar","middle_name":"D.","last_name":"Klenske","name_suffix":"","institution":"Max Planck Institute for Intelligent Systems,","department":""},{"first_name":"Neil","middle_name":"R.","last_name":"Bramley","name_suffix":"","institution":"New York University","department":""},{"first_name":"Maarten","middle_name":"","last_name":"Speekenbrink","name_suffix":"","institution":"University College London","department":""}],"date_submitted":null,"date_accepted":null,"date_published":"2017-01-01T10:00:00-08:00","render_galley":null,"galleys":[{"label":"PDF","type":"pdf","path":"https://journalpub.escholarship.org/cognitivesciencesociety/article/26975/galley/16611/download/"}]}