{"pk":24097,"title":"Simplicity Bias in Human-generated data","subtitle":null,"abstract":"Texts available on the Web have been generated by human minds. We observe that simple patterns are over-represented: abcdef is more frequent than arfbxg and 1000 appears more often than 1282. We suggest that word frequency patterns can be predicted by cognitive models based on complexity minimization. Conversely, the observation of word frequencies offers an opportunity to infer particular cognitive mechanisms involved in their generation.","language":"eng","license":{"name":"","short_name":"","text":null,"url":""},"keywords":[{"word":"Computer Science; Other; Complex systems; Language and thought; Other; Semantic memory; Corpus studies; Mathematical modeling"}],"section":"Papers with Poster Presentation","is_remote":true,"remote_url":"https://escholarship.org/uc/item/8244x8kj","frozenauthors":[{"first_name":"Jean-Louis","middle_name":"","last_name":"Dessalles","name_suffix":"","institution":"Institut Polytechnique de Paris","department":""},{"first_name":"Giovanni","middle_name":"","last_name":"Sileno","name_suffix":"","institution":"University of Amsterdam","department":""}],"date_submitted":null,"date_accepted":null,"date_published":"2024-01-01T18:00:00Z","render_galley":null,"galleys":[{"label":"PDF","type":"pdf","path":"https://journalpub.escholarship.org/cognitivesciencesociety/article/24097/galley/13691/download/"},{"label":"PDF","type":"pdf","path":"https://journalpub.escholarship.org/cognitivesciencesociety/article/24097/galley/21550/download/"}]}