{"pk":30857,"title":"Dynamic Reinforcement Driven Error Propagation Networks with Application to Game Playing","subtitle":null,"abstract":"This paper discusses the problem of the reinforcement driven learning of a response to a time varying sequence. The problem has three parts: the adaptation of internal parameters to model complex mappings; the ability of the architecture to represent time varying input; and the problem of credit assignment with unknown delays between the input, output and reinforcement signals. The method developed in this paper is based on a connectionist network trained using the error propagation algorithm with internal feedback. The network is viewed both as a context dependent predictor of the reinforcement signal and as a means of temporal credit assignment. Several architectures for these networks are discussed and insight into the implementation problems is gained by an application to the game of noughts and crosses.","language":"eng","license":{"name":"","short_name":"","text":null,"url":""},"keywords":[],"section":"Poster Presentations","is_remote":true,"remote_url":"https://escholarship.org/uc/item/4c35p1kp","frozenauthors":[{"first_name":"Tony","middle_name":"","last_name":"Robinson","name_suffix":"","institution":"Cambridge University","department":""},{"first_name":"Frank","middle_name":"","last_name":"Fallside","name_suffix":"","institution":"Cambridge University","department":""}],"date_submitted":null,"date_accepted":null,"date_published":"1989-01-01T18:00:00Z","render_galley":null,"galleys":[{"label":"PDF","type":"pdf","path":"https://journalpub.escholarship.org/cognitivesciencesociety/article/30857/galley/20706/download/"}]}