{"pk":27031,"title":"Mapping the unknown: The spatially correlated multi-armed bandit","subtitle":null,"abstract":"We introduce the spatially correlated multi-armed banditas a task coupling function learning with the exploration-exploitation trade-off. Participants interacted with bi-variatereward functions on a two-dimensional grid, with the goal ofeither gaining the largest average score or finding the largestpayoff. By providing an opportunity to learn the underly-ing reward function through spatial correlations, we modelto what extent people form beliefs about unexplored payoffsand how that guides search behavior. Participants adapted toassigned payoff conditions, performed better in smooth thanin rough environments, and—surprisingly—sometimes per-formed equally well in short as in long search horizons. Ourmodeling results indicate a preference for local search options,which when accounted for, still suggests participants werebest-described as forming local inferences about unexploredregions, combined with a search strategy that directly tradedoff between exploiting high expected rewards and exploring toreduce uncertainty about the spatial structure of rewards.","language":"eng","license":{"name":"","short_name":"","text":null,"url":""},"keywords":[{"word":"Exploration-exploitation; Multi-armed bandits;Active Learning; Gaussian Processes;"}],"section":"Talks: Papers","is_remote":true,"remote_url":"https://escholarship.org/uc/item/5510q02k","frozenauthors":[{"first_name":"Charley","middle_name":"M.","last_name":"Wu","name_suffix":"","institution":"Max Planck Institute for Human Development","department":""},{"first_name":"Eric","middle_name":"","last_name":"Schulz","name_suffix":"","institution":"University College London","department":""},{"first_name":"Maarten","middle_name":"","last_name":"Speekenbrink","name_suffix":"","institution":"University College London","department":""},{"first_name":"Jonathan","middle_name":"D.","last_name":"Nelson","name_suffix":"","institution":"Max Planck Institute for Human Development","department":""},{"first_name":"Bjorn","middle_name":"","last_name":"Meder","name_suffix":"","institution":"Max Planck Institute for Human Development","department":""}],"date_submitted":null,"date_accepted":null,"date_published":"2017-01-01T18:00:00Z","render_galley":null,"galleys":[{"label":"PDF","type":"pdf","path":"https://journalpub.escholarship.org/cognitivesciencesociety/article/27031/galley/16667/download/"}]}