{"pk":49763,"title":"The Black Stories Experiment: Two Groups are Trying to Solve a Riddle Game Behind a Screen, Only One Group Is Alive","subtitle":null,"abstract":"Studying large language models (LLMs) can provide valuable insights into their strengths and limitations. This study explores problem-solving capabilities of GPT-4 by comparing the model's performance in solving Black Stories riddles, to human performance. The study utilized a set of 12 adjusted Black Stories, each tested twice within the human and GPT-4 group. The experiment was conducted through text messaging for a comparable set-up. The primary measure of performance was the number of questions and hints needed to solve the riddle. Results indicated no significant difference between the groups. Qualitative results showed that GPT-4 excelled in precise questioning and creativity but often fixated on details. Humans covered broader topics and adapted the focus quickly but struggled with uncommon details. This research suggests that despite different approaches, GPT-4's performance was comparable to that of humans, demonstrating its potential as a capable participant in these types of problem solving games.","language":"eng","license":{"name":"","short_name":"","text":null,"url":""},"keywords":[{"word":"Artificial Intelligence; Natural Language Processing; Problem Solving; Reasoning; Computer-based experiment"}],"section":"Papers with Poster Presentation","is_remote":true,"remote_url":"https://escholarship.org/uc/item/38m599bg","frozenauthors":[{"first_name":"Yanna","middle_name":"","last_name":"Smid","name_suffix":"","institution":"Leiden University","department":""},{"first_name":"Nikki","middle_name":"","last_name":"Rademaker","name_suffix":"","institution":"Leiden University","department":""},{"first_name":"Linthe","middle_name":"","last_name":"van Rooij","name_suffix":"","institution":"Leiden Institute of Advanced Computer Science","department":""},{"first_name":"Tessa","middle_name":"","last_name":"Verhoef","name_suffix":"","institution":"Leiden University","department":""}],"date_submitted":null,"date_accepted":null,"date_published":"2025-01-01T10:00:00-08:00","render_galley":null,"galleys":[{"label":"PDF","type":"pdf","path":"https://journalpub.escholarship.org/cognitivesciencesociety/article/49763/galley/37725/download/"}]}