这是用户在 2024-10-20 12:31 为 https://dl.acm.org/doi/abs/10.1145/3586183.3606763 保存的双语快照页面,由 沉浸式翻译 提供双语支持。了解如何保存?
skip to main content
10.1145/3586183.3606763acmconferencesArticle/Chapter ViewAbstractPublication PagesuistConference Proceedingsconference-collections
research-article 研究论文
Open access 开放获取

Generative Agents: Interactive Simulacra of Human Behavior
生成代理:人类行为的互动仿真体

Authors:
Joon Sung Park
Computer Science Department, Stanford University, United States
,
Joseph O'Brien
Computer Science Department, Stanford University, United States
 约瑟夫·奥布莱恩,
Carrie Jun Cai

凯莉·君·蔡
,
Meredith Ringel Morris

梅雷迪斯·林格尔·莫里斯
,
Percy Liang
Computer Science Department, Stanford University, United States
  佩西·梁,
Michael S. Bernstein
Computer Science Department, Stanford University, United States

迈克尔·S·伯恩斯坦

作者: 朴俊成
Authors Info & Claims
作者信息与声明
Published: 29 October 2023 Publication History
出版日期:2023 年 10 月 29 日 发表历史

Abstract 摘要

Believable proxies of human behavior can empower interactive applications ranging from immersive environments to rehearsal spaces for interpersonal communication to prototyping tools. In this paper, we introduce generative agents: computational software agents that simulate believable human behavior. Generative agents wake up, cook breakfast, and head to work; artists paint, while authors write; they form opinions, notice each other, and initiate conversations; they remember and reflect on days past as they plan the next day. To enable generative agents, we describe an architecture that extends a large language model to store a complete record of the agent’s experiences using natural language, synthesize those memories over time into higher-level reflections, and retrieve them dynamically to plan behavior. We instantiate generative agents to populate an interactive sandbox environment inspired by The Sims, where end users can interact with a small town of twenty-five agents using natural language. In an evaluation, these generative agents produce believable individual and emergent social behaviors. For example, starting with only a single user-specified notion that one agent wants to throw a Valentine’s Day party, the agents autonomously spread invitations to the party over the next two days, make new acquaintances, ask each other out on dates to the party, and coordinate to show up for the party together at the right time. We demonstrate through ablation that the components of our agent architecture—observation, planning, and reflection—each contribute critically to the believability of agent behavior. By fusing large language models with computational interactive agents, this work introduces architectural and interaction patterns for enabling believable simulations of human behavior.
可信的人类行为代理可以增强从沉浸式环境到人际沟通的排练空间再到原型工具的互动应用。在本文中,我们介绍了生成代理:模拟可信人类行为的计算软件代理。生成代理会起床、做早餐,然后去上班;艺术家绘画,作家写作;他们形成观点,注意彼此,并发起对话;他们记住并反思过去的日子,同时计划第二天的活动。为了实现生成代理,我们描述了一种架构,该架构扩展了大型语言模型,以使用自然语言存储代理经历的完整记录,随着时间的推移将这些记忆合成为更高层次的反思,并动态检索它们以规划行为。我们实例化生成代理,以填充一个受《模拟人生》启发的互动沙盒环境,在该环境中,最终用户可以使用自然语言与一个由二十五个代理组成的小镇进行互动。在评估中,这些生成代理产生了可信的个体和涌现的社会行为。 例如,从仅有一个用户指定的概念开始,即一个代理希望举办一个情人节派对,代理们在接下来的两天内自主传播派对邀请,结识新朋友,互相约会参加派对,并协调在正确的时间一起出席派对。我们通过消融实验展示了我们代理架构的各个组成部分——观察、规划和反思——在代理行为的可信度方面各自发挥了关键作用。通过将大型语言模型与计算交互代理相结合,这项工作引入了架构和交互模式,以实现对人类行为的可信模拟。

Supplementary Material 补充材料

ZIP File (3606763.zip) ZIP 文件 (3606763.zip)
Supplemental File 补充文件

References 参考文献

[1]
Gavin Abercrombie, Amanda Cercas Curry, Tanvi Dinkar, and Zeerak Talat. 2023. Mirages: On Anthropomorphism in Dialogue Systems. arxiv:2305.09800 [cs.CL]
加文·阿伯克龙比,阿曼达·塞尔卡斯·库里,坦维·丁卡尔,和泽拉克·塔拉特。2023 年。《海市蜃楼:对话系统中的拟人化》。arxiv:2305.09800 [cs.CL]
[2]
Robert Ackland, Jamsheed Shorish, Paul Thomas, and Lexing Xie. 2013. How dense is a network?http://users.cecs.anu.edu.au/ xlx/teaching/css2013/network-density.html.
罗伯特·阿克兰德,贾姆希德·肖里什,保罗·托马斯,和谢立兴。2013 年。网络的密度有多大?http://users.cecs.anu.edu.au/xlx/teaching/css2013/network-density.html。
[3]
Eytan Adar, Mira Dontcheva, and Gierad Laput. 2014. CommandSpace: Modeling the Relationships between Tasks, Descriptions and Features. In Proceedings of the 27th Annual ACM Symposium on User Interface Software and Technology (Honolulu, Hawaii, USA) (UIST ’14). Association for Computing Machinery, New York, NY, USA, 167–176. https://doi.org/10.1145/2642918.2647395
Eytan Adar, Mira Dontcheva, 和 Gierad Laput. 2014. CommandSpace: 建模任务、描述和特征之间的关系. 载于第 27 届年度 ACM 用户界面软件与技术研讨会论文集(美国夏威夷檀香山)(UIST ’14). 计算机协会, 纽约, NY, USA, 167–176. https://doi.org/10.1145/2642918.2647395
[4]
Saleema Amershi, Maya Cakmak, William Bradley Knox, and Todd Kulesza. 2014. Power to the people: The role of humans in interactive machine learning. AI Magazine 35, 4 (2014), 105–120.
Saleema Amershi, Maya Cakmak, William Bradley Knox, 和 Todd Kulesza. 2014. 权力归于人民:人类在交互式机器学习中的角色. AI 杂志 35, 4 (2014), 105–120.
[5]
Saleema Amershi, Dan Weld, Mihaela Vorvoreanu, Adam Fourney, Besmira Nushi, Penny Collisson, Jina Suh, Shamsi Iqbal, Paul N Bennett, Kori Inkpen, 2019. Guidelines for human-AI interaction. In Proceedings of the 2019 chi conference on human factors in computing systems. 1–13.
Saleema Amershi, Dan Weld, Mihaela Vorvoreanu, Adam Fourney, Besmira Nushi, Penny Collisson, Jina Suh, Shamsi Iqbal, Paul N Bennett, Kori Inkpen, 2019. 人工智能与人类互动指南。在 2019 年计算机系统人因会议论文集中,1–13。
[6]
John R. Anderson. 1993. Rules of the Mind. Lawrence Erlbaum Associates, Hillsdale, NJ.
约翰·R·安德森. 1993. 心智的规则. 劳伦斯·厄尔鲍姆协会, 希尔斯代尔, 新泽西.
[7]
Electronic Arts. 2009. The Sims 3. Video game.
电子艺界。2009 年。《模拟人生 3》。视频游戏。
[8]
Ruth Aylett. 1999. Narrative in virtual environments—towards emergent narrative. In Narrative Intelligence: Papers from the AAAI Fall Symposium (Technical Report FS-99-01). AAAI Press, 83–86.
鲁思·艾尔特. 1999. 虚拟环境中的叙事——走向涌现叙事. 收录于《叙事智能:来自 AAAI 秋季研讨会的论文》(技术报告 FS-99-01). AAAI 出版社, 83–86.
[9]
Christoph Bartneck and Jodi Forlizzi. 2004. A design-centered framework for social human-robot interaction. In Proceedings of the 13th IEEE International Workshop on Robot and Human Interactive Communication (RO-MAN’04). 591–594. https://doi.org/10.1109/ROMAN.2004.1374827
克里斯托夫·巴特内克和乔迪·福尔齐兹。2004 年。一个以设计为中心的社会人机交互框架。在第十三届 IEEE 国际机器人与人类互动通信研讨会(RO-MAN’04)论文集中。591–594。https://doi.org/10.1109/ROMAN.2004.1374827
[10]
Joseph Bates. 1994. The Role of Emotion in Believable Agents. Commun. ACM 37, 7 (1994), 122–125. https://doi.org/10.1145/176789.176803
约瑟夫·贝茨。1994 年。《情感在可信代理中的作用》。计算机协会通讯 37, 7 (1994), 122–125. https://doi.org/10.1145/176789.176803
[11]
Christopher Berner, Greg Brockman, Brooke Chan, Vicki Cheung, Przemysław Dębiak, Christy Dennison, David Farhi, Quirin Fischer, Shariq Hashme, Chris Hesse, Rafal Józefowicz, Scott Gray, Catherine Olsson, Jakub Pachocki, Michael Petrov, Henrique P. d.O. Pinto, Jonathan Raiman, Tim Salimans, Jeremy Schlatter, Jonas Schneider, Szymon Sidor, Ilya Sutskever, Jie Tang, Filip Wolski, and Susan Zhang. 2019. Dota 2 with Large Scale Deep Reinforcement Learning. arXiv preprint arXiv:1912.06680 (2019).
克里斯托弗·伯纳、格雷格·布罗克曼、布鲁克·陈、维基·张、普热梅斯瓦夫·德比亚克、克里斯蒂·丹尼森、大卫·法希、奎林·费舍尔、沙里克·哈什梅、克里斯·赫斯、拉法尔·尤泽夫维奇、斯科特·格雷、凯瑟琳·奥尔森、雅库布·帕霍基、迈克尔·彼得罗夫、亨里克·P·d.O.·平托、乔纳森·赖曼、蒂姆·萨利曼斯、杰里米·施拉特、乔纳斯·施奈德、斯维蒙·西多尔、伊利亚·苏茨克弗、唐杰、菲利普·沃尔斯基和苏珊·张。2019 年。《使用大规模深度强化学习的 Dota 2》。arXiv 预印本 arXiv:1912.06680(2019 年)。
[12]
Marcel Binz and Eric Schulz. 2023. Using cognitive psychology to understand GPT-3. Proceedings of the National Academy of Sciences 120, 6 (2023), e2218523120.
马塞尔·宾茨和埃里克·舒尔茨。2023 年。利用认知心理学理解 GPT-3。《国家科学院院刊》120 卷,第 6 期(2023 年),e2218523120。
[13]
BioWare. 2007. Mass Effect. Video game.
生物工坊. 2007. 《质量效应》。视频游戏。
[14]
Woody Bledsoe. 1986. I had a dream: AAAI presidential address. AI Magazine 7, 1 (1986), 57–61.
伍迪·布莱德索。1986 年。我有一个梦想:AAAI 总统演讲。《人工智能杂志》7 卷,第 1 期(1986 年),57–61 页。
[15]
Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, and et al.2022. On the Opportunities and Risks of Foundation Models. arxiv:2108.07258 [cs.LG]
Rishi Bommasani, Drew A. Hudson, Ehsan Adeli 等. 2022. 关于基础模型的机遇与风险. arxiv:2108.07258 [cs.LG]
[16]
Michael Brenner. 2010. Creating dynamic story plots with continual multiagent planning. In Proceedings of the 24th AAAI Conference on Artificial Intelligence.
迈克尔·布伦纳。2010 年。通过持续的多智能体规划创建动态故事情节。在第 24 届人工智能协会会议论文集中。
[17]
Rodney A. Brooks, Cynthia Breazeal, Marko Marjanovic, Brian Scassellati, and Matthew Williamson. 2000. The Cog Project: Building a Humanoid Robot. In Computation for Metaphors, Analogy, and Agents(Lecture Notes on Artificial Intelligence, 1562), Chrystopher Nehaniv (Ed.). Springer-Verlag, Berlin, 52–87.
罗德尼·A·布鲁克斯,辛西娅·布雷兹尔,马尔科·马尔贾诺维奇,布赖恩·斯卡塞拉提,和马修·威廉姆森。2000 年。《Cog 项目:构建类人机器人》。载于《隐喻、类比与智能体的计算》(人工智能讲义笔记,1562),克里斯托弗·内哈尼夫(编)。施普林格-维尔拉格,柏林,52–87。
[18]
Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, and Dario Amodei. 2020. Language Models are Few-Shot Learners. arxiv:2005.14165 [cs.CL]
汤姆·B·布朗,本杰明·曼,尼克·赖德,梅拉妮·萨比亚,贾里德·卡普兰,普拉富拉·达里瓦尔,阿尔文·尼拉坎坦,普拉纳夫·夏姆,吉里什·萨斯特里,阿曼达·阿斯克尔,桑迪尼·阿加瓦尔,阿里尔·赫伯特-沃斯,格雷琴·克鲁格,汤姆·亨尼汉,瑞温·查尔德,阿迪提亚·拉梅什,丹尼尔·M·齐格勒,杰弗里·吴,克莱门斯·温特,克里斯托弗·赫塞,马克·陈,埃里克·西格勒,马特乌什·利特温,斯科特·格雷,本杰明·切斯,杰克·克拉克,克里斯托弗·伯纳,萨姆·麦肯德利什,亚历克·拉德福德,伊利亚·苏茨克弗,和达里奥·阿莫代伊。2020 年。《语言模型是少样本学习者》。arxiv:2005.14165 [cs.CL]
[19]
Sébastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, 2023. Sparks of artificial general intelligence: Early experiments with gpt-4. arXiv preprint arXiv:2303.12712 (2023).
塞巴斯蒂安·布贝克,瓦伦·钱德拉塞卡兰,罗嫩·埃尔丹,约翰内斯·盖尔克,埃里克·霍维茨,埃切·卡马尔,彼得·李,尹·塔特·李,元志·李,斯科特·伦德伯格,2023 年。人工通用智能的火花:关于 gpt-4 的早期实验。arXiv 预印本 arXiv:2303.12712(2023 年)。
[20]
Robin Burkinshaw. 2009. Alice and Kev: The Story of Being Homeless in The Sims 3.
罗宾·伯金肖. 2009. 《爱丽丝与凯夫:在《模拟人生 3》中无家可归的故事》。
[21]
Chris Callison-Burch, Gaurav Singh Tomar, Lara Martin, Daphne Ippolito, Suma Bailis, and David Reitter. 2022. Dungeons and Dragons as a Dialog Challenge for Artificial Intelligence. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Abu Dhabi, United Arab Emirates, 9379–9393. https://aclanthology.org/2022.emnlp-main.637
克里斯·卡利森-伯奇,戈拉夫·辛格·托马尔,拉拉·马丁,达芙妮·伊波利托,苏玛·贝利斯,和大卫·赖特。2022 年。《龙与地下城》作为人工智能的对话挑战。在 2022 年自然语言处理实证方法会议论文集中。计算语言学协会,阿布扎比,阿拉伯联合酋长国,9379–9393。https://aclanthology.org/2022.emnlp-main.637
[22]
Stuart K Card, Thomas P Moran, and Allen Newell. 1980. The keystroke-level model for user performance time with interactive systems. Commun. ACM 23, 7 (1980), 396–410. https://doi.org/10.1145/358886.358895 arXiv:https://doi.org/10.1145/358886.358895
斯图尔特·K·卡德,托马斯·P·莫兰,艾伦·纽厄尔。1980 年。交互系统用户性能时间的击键级模型。《计算机协会通讯》23 卷,第 7 期(1980),396–410。https://doi.org/10.1145/358886.358895 arXiv:https://doi.org/10.1145/358886.358895
[23]
Stuart K Card, Thomas P Moran, and Alan Newell. 1983. The psychology of human-computer interaction. (1983).
斯图尔特·K·卡德,托马斯·P·莫兰,阿兰·纽厄尔。1983 年。人机交互的心理学。(1983 年)。
[24]
Alex Champandard. 2012. Tutorial presentation. In IEEE Conference on Computational Intelligence and Games.
亚历克斯·香潘达德。2012 年。教程演示。在 IEEE 计算智能与游戏会议上。
[25]
Dong kyu Choi, Tolga Konik, Negin Nejati, Chunki Park, and Pat Langley. 2021. A Believable Agent for First-Person Shooter Games. In Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, Vol. 3. 71–73.
东奎崔、托尔加·科尼克、内金·内贾提、春基·朴和帕特·兰格利。2021 年。用于第一人称射击游戏的可信代理。在《美国人工智能协会人工智能与互动数字娱乐会议论文集》第 3 卷。71–73 页。
[26]
Anind K Dey. 2001. Understanding and using context. Personal and ubiquitous computing 5 (2001), 4–7.
Anind K Dey. 2001. 理解和使用上下文。个人与普适计算 5 (2001), 4–7.
[27]
Kevin Dill and L Martin. 2011. A Game AI Approach to Autonomous Control of Virtual Characters. In Proceedings of the Interservice/Industry Training, Simulation, and Education Conference (I/ITSEC’11). Orlando, FL, USA.
凯文·迪尔和 L·马丁。2011 年。基于游戏人工智能的虚拟角色自主控制方法。载于《国际服务/行业培训、模拟与教育会议论文集》(I/ITSEC’11)。美国佛罗里达州奥兰多。
[28]
David Easley and Jon Kleinberg. 2010. Networks, crowds, and markets: Reasoning about a highly connected world. Cambridge university press.
大卫·伊斯利和乔恩·克莱因伯格. 2010. 《网络、众群与市场:关于高度互联世界的推理》. 剑桥大学出版社.
[29]
Arpad E Elo. 1967. The Proposed USCF Rating System, Its Development, Theory, and Applications. Chess Life XXII, 8 (August 1967), 242–247.
阿尔帕德·E·埃洛. 1967. 提议的美国国际象棋联合会评级系统及其发展、理论和应用. 国际象棋生活 XXII, 8 (1967 年 8 月), 242–247.
[30]
Jerry Alan Fails and Dan R Olsen Jr. 2003. Interactive machine learning. In Proceedings of the 8th international conference on Intelligent user interfaces. ACM, 39–45.
杰瑞·艾伦·费尔斯和丹·R·奥尔森 Jr. 2003. 交互式机器学习. 载于第八届国际智能用户界面会议论文集. ACM, 39–45.
[31]
Ethan Fast, William McGrath, Pranav Rajpurkar, and Michael S Bernstein. 2016. Augur: Mining human behaviors from fiction to power interactive systems. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. 237–247.
伊桑·法斯特,威廉·麦克格拉斯,普拉纳夫·拉吉普卡尔,迈克尔·S·伯恩斯坦。2016 年。《Augur:从虚构中挖掘人类行为以增强交互系统》。载于 2016 年计算机系统人因会议(CHI 会议)论文集。237–247。
[32]
Rebecca Fiebrink and Perry R Cook. 2010. The Wekinator: a system for real-time, interactive machine learning in music. In Proceedings of The Eleventh International Society for Music Information Retrieval Conference (ISMIR 2010)(Utrecht), Vol. 3. Citeseer, 2–1.
Rebecca Fiebrink 和 Perry R Cook. 2010. Wekinator:一个用于音乐实时交互式机器学习的系统。在第十一届国际音乐信息检索会议(ISMIR 2010)论文集(乌得勒支),第 3 卷。Citeseer, 2–1.
[33]
Uwe Flick. 2009. An Introduction to Qualitative Research. SAGE.
乌韦·弗里克. 2009. 《定性研究导论》。SAGE。
[34]
James Fogarty, Desney Tan, Ashish Kapoor, and Simon Winder. 2008. CueFlik: Interactive Concept Learning in Image Search. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Florence, Italy) (CHI ’08). Association for Computing Machinery, New York, NY, USA, 29–38. https://doi.org/10.1145/1357054.1357061
詹姆斯·福加提,德斯尼·谭,阿希什·卡普尔,西蒙·温德。2008 年。《CueFlik:图像搜索中的互动概念学习》。载于《人机交互系统会议论文集》(意大利佛罗伦萨)(CHI '08)。计算机协会,纽约,纽约,美国,29–38。https://doi.org/10.1145/1357054.1357061
[35]
Adam Fourney, Richard Mann, and Michael Terry. 2011. Query-feature graphs: bridging user vocabulary and system functionality. In Proceedings of the ACM Symposium on User Interface Software and Technology (UIST) (Santa Barbara, California, USA). ACM.
亚当·福尔尼,理查德·曼,迈克尔·特里。2011 年。查询特征图:连接用户词汇与系统功能。在 ACM 用户界面软件与技术研讨会(UIST)论文集中(加利福尼亚州圣巴巴拉,美国)。ACM。
[36]
Tom Francis. 2010. The Minecraft Experiment, day 1: Chasing Waterfalls. http://www.pcgamer.com/2010/11/20/the-minecraft-experiment-day-1-chasing-waterfalls/
汤姆·弗朗西斯. 2010. 《我的世界实验,第一天:追逐瀑布》。http://www.pcgamer.com/2010/11/20/the-minecraft-experiment-day-1-chasing-waterfalls/
[37]
Jonas Freiknecht and Wolfgang Effelsberg. 2020. Procedural Generation of Interactive Stories using Language Models. In International Conference on the Foundations of Digital Games (FDG ’20). ACM, Bugibba, Malta, 8. https://doi.org/10.1145/3402942.3409599
乔纳斯·弗雷克内希特和沃尔夫冈·埃费尔斯贝格。2020 年。使用语言模型进行互动故事的程序生成。在数字游戏基础国际会议(FDG '20)上。ACM,马耳他布吉巴,8。https://doi.org/10.1145/3402942.3409599
[38]
Tianyu Gao, Adam Fisch, and Danqi Chen. 2020. Making Pre-trained Language Models Better Few-shot Learners. CoRR abs/2012.15723 (2020). arxiv:2012.15723https://arxiv.org/abs/2012.15723
高天宇,亚当·菲施,丹琪·陈。2020 年。提升预训练语言模型的少样本学习能力。CoRR abs/2012.15723(2020)。arxiv:2012.15723 https://arxiv.org/abs/2012.15723
[39]
Perttu Hämäläinen, Mikke Tavast, and Anton Kunnari. 2023. Evaluating Large Language Models in Generating Synthetic HCI Research Data: a Case Study. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. ACM.
Perttu Hämäläinen, Mikke Tavast, 和 Anton Kunnari. 2023. 评估大型语言模型生成合成 HCI 研究数据:案例研究. 载于 2023 年计算系统人因会议(CHI 会议)论文集. ACM.
[40]
Matthew Hausknecht, Prithviraj Ammanabrolu, Marc-Alexandre Cote, and Xinyu Yuan. 2020. Interactive Fiction Games: A Colossal Adventure. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 7903–7910. https://doi.org/10.1609/aaai.v34i05.6297
马修·豪斯克内赫特,普里特维拉杰·阿曼纳布罗卢,马克-亚历山大·科特,和袁新宇。2020 年。《互动小说游戏:一场巨大的冒险》。载于《美国人工智能协会会议论文集》,第 34 卷,7903–7910。https://doi.org/10.1609/aaai.v34i05.6297
[41]
Chris Hecker. 2011. My Liner Notes for Spore. http://chrishecker.com/My_liner_notes_for_spore
克里斯·赫克. 2011. 我对《孢子》的附注. http://chrishecker.com/My_liner_notes_for_spore
[42]
Ralf Herbrich, Tom Minka, and Thore Graepel. 2006. TrueSkill™: A Bayesian Skill Rating System. In Advances in Neural Information Processing Systems, B. Schölkopf, J. Platt, and T. Hoffman (Eds.). Vol. 19. MIT Press. https://proceedings.neurips.cc/paper_files/paper/2006/file/f44ee263952e65b3610b8ba51229d1f9-Paper.pdf
拉尔夫·赫布里希、汤姆·敏卡和托雷·格雷佩尔。2006 年。《TrueSkill™:一种贝叶斯技能评分系统》。载于《神经信息处理系统进展》,B. 施尔科普夫、J. 普拉特和 T. 霍夫曼(编辑)。第 19 卷。麻省理工学院出版社。https://proceedings.neurips.cc/paper_files/paper/2006/file/f44ee263952e65b3610b8ba51229d1f9-Paper.pdf
[43]
Douglas Hofstadter. 1995. Fluid concepts and creative analogies: computer models of the fundamental mechanisms of thought. Basic Books.
道格拉斯·霍夫施塔特. 1995. 流动概念与创造性类比:思维基本机制的计算机模型. 基本书籍.
[44]
James D. Hollan, Edwin L. Hutchins, and Louis Weitzman. 1984. STEAMER: An Interactive Inspectable Simulation-Based Training System. AI Magazine 5, 2 (1984), 23–36.
詹姆斯·D·霍兰,埃德温·L·哈钦斯,路易斯·韦茨曼。1984 年。《STEAMER:一种互动可检查的基于模拟的培训系统》。人工智能杂志 5 卷,2 期(1984),23–36。
[45]
Sture Holm. 1979. A simple sequentially rejective multiple test procedure. Scandinavian Journal of Statistics 6, 2 (1979), 65–70. https://doi.org/not specified
斯图尔·霍尔姆。1979 年。一个简单的顺序拒绝多重检验程序。《斯堪的纳维亚统计学杂志》6 卷,2 期(1979),65–70。https://doi.org/未指定
[46]
John J. Horton. 2023. Large Language Models as Simulated Economic Agents: What Can We Learn from Homo Silicus?arxiv:2301.07543 [econ.GN]
约翰·J·霍顿. 2023. 大型语言模型作为模拟经济主体:我们能从硅人中学到什么?arxiv:2301.07543 [econ.GN]
[47]
Eric Horvitz. 1999. Principles of mixed-initiative user interfaces. In Proceedings of the SIGCHI conference on Human Factors in Computing Systems. 159–166.
埃里克·霍维茨. 1999. 混合主动用户界面的原则. 载于计算系统人因学会议 SIGCHI 论文集. 159–166.
[48]
Wenlong Huang, Fei Xia, Ted Xiao, Harris Chan, Jacky Liang, Pete Florence, Andy Zeng, Jonathan Tompson, Igor Mordatch, Yevgen Chebotar, Pierre Sermanet, Noah Brown, Tomas Jackson, Linda Luu, Sergey Levine, Karol Hausman, and Brian Ichter. 2022. Inner Monologue: Embodied Reasoning through Planning with Language Models. arxiv:2207.05608 [cs.RO]
黄文龙, 夏飞, 肖特德, 陈哈里斯, 梁杰基, 弗洛伦斯·皮特, 曾安迪, 汤普森·乔纳森, 莫达奇·伊戈尔, 切博塔尔·叶夫根, 塞尔曼特·皮埃尔, 布朗·诺亚, 杰克逊·托马斯, 陆琳达, 莱文·谢尔盖, 豪斯曼·卡罗尔, 和伊赫特·布赖恩. 2022. 内心独白:通过语言模型进行具身推理的规划. arxiv:2207.05608 [cs.RO]
[49]
Kristen Ibister and Clifford Nass. 2000. Consistency of personality in interactive characters: verbal cues, non-verbal cues, and user characteristics. International Journal of Human-Computer Studies 52, 1 (2000), 65–80.
克里斯滕·伊比斯特和克利福德·纳斯. 2000. 互动角色中的个性一致性:语言线索、非语言线索和用户特征. 国际人机交互研究杂志 52, 1 (2000), 65–80.
[50]
Ellen Jiang, Kristen Olson, Edwin Toh, Alejandra Molina, Aaron Donsbach, Michael Terry, and Carrie J Cai. 2022. PromptMaker: Prompt-Based Prototyping with Large Language Models. In Extended Abstracts of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI EA ’22). Association for Computing Machinery, New York, NY, USA, Article 35, 8 pages. https://doi.org/10.1145/3491101.3503564
Ellen Jiang, Kristen Olson, Edwin Toh, Alejandra Molina, Aaron Donsbach, Michael Terry, 和 Carrie J Cai. 2022. PromptMaker: 基于提示的大型语言模型原型设计. 载于 2022 年计算机系统人因会议扩展摘要 (新奥尔良, 洛杉矶, 美国) (CHI EA ’22). 计算机协会, 纽约, NY, 美国, 文章 35, 8 页. https://doi.org/10.1145/3491101.3503564
[51]
Bonnie E John and David E Kieras. 1996. The GOMS family of user interface analysis techniques: Comparison and contrast. ACM Transactions on Computer-Human Interaction (TOCHI) 3, 4 (1996), 320–351.
博尼·E·约翰和大卫·E·基拉斯。1996 年。《GOMS 用户界面分析技术家族:比较与对比》。计算机与人类互动学会会刊(TOCHI)3, 4(1996),320–351。
[52]
Randolph M Jones, John E Laird, Paul E Nielsen, Karen J Coulter, Patrick Kenny, and Frank V Koss. 1999. Automated Intelligent Pilots for Combat Flight Simulation. AI Magazine 20, 1 (1999), 27–42.
兰道夫·M·琼斯,约翰·E·莱尔德,保罗·E·尼尔森,卡伦·J·考尔特,帕特里克·肯尼,弗兰克·V·科斯。1999 年。用于战斗飞行模拟的自动智能飞行员。《人工智能杂志》20 卷,第 1 期(1999 年),27–42 页。
[53]
Omar Khattab, Keshav Santhanam, Xiang Lisa Li, David Hall, Percy Liang, Christopher Potts, and Matei Zaharia. 2023. Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP. arxiv:2212.14024 [cs.CL]
奥马尔·哈塔布,凯沙夫·桑坦纳姆,李香,戴维·霍尔,珀西·梁,克里斯托弗·波茨,马泰·扎哈里亚。2023。演示-搜索-预测:为知识密集型自然语言处理构建检索和语言模型。arxiv:2212.14024 [cs.CL]
[54]
Bjoern Knafla. 2011. Introduction to Behavior Trees. http://bjoernknafla.com/introduction-to-behavior-trees
比约恩·克纳夫拉. 2011. 行为树简介. http://bjoernknafla.com/introduction-to-behavior-trees
[55]
Ranjay Krishna, Donsuk Lee, Li Fei-Fei, and Michael S. Bernstein. 2022. Socially situated artificial intelligence enables learning from human interaction. Proceedings of the National Academy of Sciences 119, 39 (2022), e2115730119. https://doi.org/10.1073/pnas.2115730119 arXiv:https://www.pnas.org/doi/pdf/10.1073/pnas.2115730119
Ranjay Krishna, Donsuk Lee, Li Fei-Fei, 和 Michael S. Bernstein. 2022. 社会情境下的人工智能促进了从人类互动中学习. 美国国家科学院院刊 119, 39 (2022), e2115730119. https://doi.org/10.1073/pnas.2115730119 arXiv:https://www.pnas.org/doi/pdf/10.1073/pnas.2115730119
[56]
William H Kruskal and WA Wallis. 1952. Use of ranks in one-criterion variance analysis. J. Amer. Statist. Assoc. 47, 260 (1952), 583–621. https://doi.org/10.1080/01621459.1952.10483441
威廉·H·克鲁斯卡尔和 W·A·沃利斯。1952 年。单一标准方差分析中的秩的使用。《美国统计协会杂志》47 卷,260 期(1952 年),583–621。https://doi.org/10.1080/01621459.1952.10483441
[57]
Phaser Labs. 2023. Welcome to Phaser 3. https://phaser.io/phaser3. Accessed on: 2023-04-03.
Phaser Labs. 2023. 欢迎来到 Phaser 3. https://phaser.io/phaser3. 访问日期:2023-04-03.
[58]
John Laird. 2001. It Knows What You’re Going To Do: Adding Anticipation to a Quakebot. In Proceedings of the 2001 Workshop on Intelligent Cinematography and Editing. 63–69.
约翰·莱尔德. 2001. 它知道你将要做什么:为地震机器人添加预期. 载于 2001 年智能电影摄影与编辑研讨会论文集. 63–69.
[59]
John Laird and Michael VanLent. 2001. Human-Level AI’s Killer Application: Interactive Computer Games. AI Magazine 22, 2 (2001), 15. https://doi.org/10.1609/aimag.v22i2.1558
约翰·莱尔德和迈克尔·范伦特。2001 年。《人类水平人工智能的杀手级应用:互动计算机游戏》。人工智能杂志 22, 2 (2001), 15. https://doi.org/10.1609/aimag.v22i2.1558
[60]
John E. Laird. 2000. It Knows What You’re Going To Do: Adding Anticipation to a QUAKEBOT. In Papers from the AAAI 2000 Spring Symposium on Artificial Intelligence and Interactive Entertainment(Technical Report SS-00-02). AAAI Press, 41–50.
约翰·E·莱尔德. 2000. 它知道你将要做什么:为 QUAKEBOT 添加预期。在 2000 年春季人工智能与互动娱乐 AAAI 研讨会论文集(技术报告 SS-00-02)。AAAI 出版社, 41–50.
[61]
John E. Laird. 2012. The Soar Cognitive Architecture. MIT Press.
约翰·E·莱尔德. 2012. 《Soar 认知架构》。麻省理工学院出版社。
[62]
John E. Laird, Christian Lebiere, and Paul S. Rosenbloom. 2017. A Standard Model of the Mind: Toward a Common Computational Framework across Artificial Intelligence, Cognitive Science, Neuroscience, and Robotics. AI Magazine 38, 1 (2017), 13–26.
约翰·E·莱尔德,克里斯蒂安·勒比埃尔,保罗·S·罗森布鲁姆。2017 年。《心智的标准模型:朝着人工智能、认知科学、神经科学和机器人技术的共同计算框架迈进》。人工智能杂志 38, 1 (2017), 13–26。
[63]
Michelle S Lam, Zixian Ma, Anne Li, Izequiel Freitas, Dakuo Wang, James A Landay, and Michael S Bernstein. 2023. Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems.
米歇尔·S·拉姆,马子贤,安妮·李,伊泽基尔·弗雷塔斯,王大阔,詹姆斯·A·兰代,和迈克尔·S·伯恩斯坦。2023 年。模型草图:早期机器学习模型设计中的中心概念。《计算机系统人因因素 SIGCHI 会议论文集》。
[64]
Pat Langley, Dongkyu Choi, and Seth Rogers. 2005. Interleaving Learning, Problem Solving, and Execution in the Icarus Architecture. Technical Report. Stanford University, Center for the Study of Language and Information.
帕特·兰格利、崔东奎和塞斯·罗杰斯。2005 年。《在伊卡鲁斯架构中交错学习、问题解决和执行》。技术报告。斯坦福大学语言与信息研究中心。
[65]
Jason Linder, Gierad Laput, Mira Dontcheva, Gregg Wilensky, Walter Chang, Aseem Agarwala, and Eytan Adar. 2013. PixelTone: A Multimodal Interface for Image Editing. In CHI ’13 Extended Abstracts on Human Factors in Computing Systems (Paris, France) (CHI EA ’13). Association for Computing Machinery, New York, NY, USA, 2829–2830. https://doi.org/10.1145/2468356.2479533
杰森·林德,吉拉德·拉普特,米拉·东切娃,格雷格·威伦斯基,沃尔特·张,阿西姆·阿加瓦拉,和埃坦·阿达尔。2013 年。PixelTone:一种用于图像编辑的多模态界面。在 CHI '13 计算系统人因扩展摘要(法国巴黎)(CHI EA '13)。计算机协会,纽约,纽约,美国,2829–2830。https://doi.org/10.1145/2468356.2479533
[66]
Jiachang Liu, Dinghan Shen, Yizhe Zhang, Bill Dolan, Lawrence Carin, and Weizhu Chen. 2021. What Makes Good In-Context Examples for GPT-3?CoRR abs/2101.06804 (2021). arxiv:2101.06804https://arxiv.org/abs/2101.06804
刘家常,沈丁汉,张逸哲,比尔·多兰,劳伦斯·卡林,陈伟柱。2021 年。什么样的上下文示例对 GPT-3 有效?CoRR abs/2101.06804(2021)。arxiv:2101.06804 https://arxiv.org/abs/2101.06804
[67]
Vivian Liu, Han Qiao, and Lydia Chilton. 2022. Opal: Multimodal Image Generation for News Illustration. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology. 1–17.
刘薇薇,乔汉,和莉迪亚·奇尔顿。2022 年。《欧泊:用于新闻插图的多模态图像生成》。载于第 35 届年度 ACM 用户界面软件与技术研讨会论文集。1–17。
[68]
Pattie Maes. 1995. Artificial Life Meets Entertainment: Lifelike Autonomous Agents. Commun. ACM 38, 11 (nov 1995), 108–114. https://doi.org/10.1145/219717.219808
帕蒂·梅斯。1995 年。《人工生命与娱乐相遇:类人自主代理》。计算机协会通讯 38, 11 (1995 年 11 月), 108–114. https://doi.org/10.1145/219717.219808
[69]
Josh McCoy, Michael Mateas, and Noah Wardrip-Fruin. 2009. Comme il Faut: A System for Simulating Social Games Between Autonomous Characters. In Proceedings of the 7th International Conference on Digital Arts and Culture. 87–94.
乔什·麦考伊,迈克尔·马提亚斯,诺亚·沃德里普-弗鲁因。2009 年。《Comme il Faut:一个模拟自主角色之间社交游戏的系统》。载于第七届国际数字艺术与文化会议论文集。87–94。
[70]
Josh McCoy, Mike Treanor, Ben Samuel, Michael Mateas, and Noah Wardrip-Fruin. 2011. Prom Week: Social Physics as Gameplay. In Proceedings of the 6th International Conference on Foundations of Digital Games (FDG’11). ACM, Bordeaux, France, 70–77. https://doi.org/10.1145/2159365.2159377
乔什·麦考伊,迈克·特雷纳,本·塞缪尔,迈克尔·马提亚斯,诺亚·沃德里普-弗鲁因。2011 年。《舞会周:作为游戏的社会物理学》。载于第六届国际数字游戏基础会议论文集(FDG’11)。ACM,法国波尔多,70–77。https://doi.org/10.1145/2159365.2159377
[71]
Josh McCoy, Mike Treanor, Ben Samuel, Anna Reed, Michael Mateas, and Noah Wardrip-Fruin. 2012. Prom Week. In Proceedings of the 7th International Conference on Foundations of Digital Games (FDG’12). ACM, Raleigh, NC, USA, 1–8. https://doi.org/10.1145/2282338.2282340
乔什·麦考伊,迈克·特雷诺,本·塞缪尔,安娜·里德,迈克尔·马提亚斯,诺亚·沃德里普-弗鲁因。2012 年。《舞会周》。载于第七届国际数字游戏基础会议论文集(FDG’12)。ACM,北卡罗来纳州罗利,美国,1–8。https://doi.org/10.1145/2282338.2282340
[72]
Josh McCoy, Mike Treanor, Ben Samuel, Noah Wardrip-Fruin, and Michael Mateas. 2011. Comme il faut: A System for Authoring Playable Social Models. In Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE’11). AAAI, Stanford, CA, USA, 38–43.
乔什·麦考伊,迈克·特雷诺,本·塞缪尔,诺亚·沃德里普-弗鲁因,迈克尔·马提亚斯。2011 年。《如同应有的:一个用于创作可玩社交模型的系统》。载于《美国人工智能协会人工智能与互动数字娱乐会议论文集》(AIIDE’11)。美国人工智能协会,斯坦福,加利福尼亚州,美国,38–43。
[73]
Marvin Minsky and Seymour Papert. 1970. Draft of a proposal to ARPA for research on artificial intelligence at MIT, 1970–71.
马文·明斯基和西摩·帕普特。1970 年。向 ARPA 提交的关于麻省理工学院人工智能研究的提案草稿,1970-71 年。
[74]
Shohei Miyashita, Xinyu Lian, Xiao Zeng, Takashi Matsubara, and Kuniaki Uehara. 2017. Developing Game AI Agent Behaving Like Human by Mixing Reinforcement Learning and Supervised Learning. In Proceedings of the 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD). Kanazawa, Japan, 153–158. https://doi.org/10.1109/SNPD.2017.8023884
宫下翔平,连新宇,曾晓,松原隆,植原邦明。2017。通过混合强化学习和监督学习开发像人类一样行为的游戏 AI 代理。在第 18 届 IEEE/ACIS 国际软件工程、人工智能、网络及并行/分布式计算会议(SNPD)论文集中。日本金泽,153–158。https://doi.org/10.1109/SNPD.2017.8023884
[75]
Alexander Nareyek. 2007. Game AI is dead. Long live game AI!IEEE Intelligent Systems 22, 1 (2007), 9–11.
亚历山大·纳雷耶克. 2007. 游戏人工智能已死。游戏人工智能万岁!IEEE 智能系统 22, 1 (2007), 9–11.
[76]
Allen Newell. 1990. Unified Theories of Cognition. Harvard University Press, Cambridge, Massachusetts.
艾伦·纽厄尔. 1990. 统一的认知理论. 哈佛大学出版社, 马萨诸塞州剑桥.
[77]
OpenAI. 2022. Introducing ChatGPT. https://openai.com/blog/chatgpt. Accessed on: 2023-04-03.
OpenAI. 2022. 介绍 ChatGPT. https://openai.com/blog/chatgpt. 访问日期:2023-04-03.
[78]
Kyle Orland. 2021. So what is ’the metaverse’, exactly?Ars Technica (7 November 2021). arxiv:2111.04169https://arstechnica.com/gaming/2021/11/so-what-is-the-metaverse-exactly/
凯尔·奥兰德. 2021. 那么“元宇宙”究竟是什么?阿斯技术(2021 年 11 月 7 日)。arxiv:2111.04169 https://arstechnica.com/gaming/2021/11/so-what-is-the-metaverse-exactly/
[79]
Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, and Ryan Lowe. 2022. Training language models to follow instructions with human feedback. arxiv:2203.02155 [cs.CL]
欧阳龙,杰夫·吴,徐江,迪奥戈·阿尔梅达,卡罗尔·L·温赖特,帕梅拉·米什金,张冲,桑迪尼·阿加瓦尔,卡塔里娜·斯拉马,亚历克斯·雷,约翰·舒尔曼,雅各布·希尔顿,弗雷泽·凯尔顿,卢克·米勒,马迪·西门斯,阿曼达·阿斯克尔,彼得·韦林德,保罗·克里斯蒂亚诺,简·莱克,瑞安·洛。2022 年。通过人类反馈训练语言模型以遵循指令。arxiv:2203.02155 [cs.CL]
[80]
Joon Sung Park, Lindsay Popowski, Carrie J. Cai, Meredith Ringel Morris, Percy Liang, and Michael S. Bernstein. 2022. Social Simulacra: Creating Populated Prototypes for Social Computing Systems. In In the 35th Annual ACM Symposium on User Interface Software and Technology (UIST ’22) (Bend, OR, USA) (UIST ’22). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3526113.3545616
Joon Sung Park, Lindsay Popowski, Carrie J. Cai, Meredith Ringel Morris, Percy Liang, 和 Michael S. Bernstein. 2022. 社会模拟物:为社会计算系统创建人口原型. 载于第 35 届年度 ACM 用户界面软件与技术研讨会(UIST ’22)(美国俄勒冈州本德)(UIST ’22)。计算机协会,纽约,纽约,美国。https://doi.org/10.1145/3526113.3545616
[81]
Richard W. Pew and Ann S. Mavor (Eds.). 1998. Modeling Human and Organizational Behavior: Applications to Military Simulations. National Academy Press, Washington, D.C.
理查德·W·皮尤和安·S·马沃(编辑)。1998 年。《人类和组织行为建模:对军事模拟的应用》。国家科学院出版社,华盛顿特区。
[82]
Roberto Pillosu. 2009. Coordinating Agents with Behavior Trees: Synchronizing Multiple Agents in CryEngine 2. https://aiarchitect.wordpress.com/2009/10/19/coordinating-agents-with-behavior-trees-synchronizing-multiple-agents-in-cryengine-2/
罗伯托·皮洛苏。2009 年。使用行为树协调代理:在 CryEngine 2 中同步多个代理。https://aiarchitect.wordpress.com/2009/10/19/coordinating-agents-with-behavior-trees-synchronizing-multiple-agents-in-cryengine-2/
[83]
Prolific. 2022. Prolific: Quickly Find Research Participants You Can Trust. https://www.prolific.co/
Prolific. 2022. Prolific: 快速找到您可以信任的研究参与者。https://www.prolific.co/
[84]
Byron Reeves and Clifford Nass. 1996. The media equation: How people treat computers, television, and new media like real people and places. Cambridge University Press.
拜伦·里夫斯和克利福德·纳斯。1996 年。《媒体方程:人们如何将计算机、电视和新媒体视为真实的人和地方》。剑桥大学出版社。
[85]
Mark O. Riedl. 2012. Interactive narrative: A novel application of artificial intelligence for computer games. In Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence (AAAI’12). 2160–2165.
马克·O·里德尔. 2012. 互动叙事:人工智能在计算机游戏中的新应用. 载于第二十六届人工智能协会会议论文集 (AAAI’12). 2160–2165.
[86]
Mark O. Riedl and R. Michael Young. 2005. An Objective Character Believability Evaluation Procedure for Multi-Agent Story Generation Systems. In Proceedings of the 5th International Working Conference on Intelligent Virtual Agents (IVA’05). Kos, Greece, 58–70. https://doi.org/10.1007/11550617_5
马克·O·里德尔和 R·迈克尔·杨。2005 年。用于多智能体故事生成系统的客观角色可信度评估程序。在第五届国际智能虚拟代理工作会议(IVA’05)论文集中。希腊科斯,58–70。https://doi.org/10.1007/11550617_5
[87]
David Rolf. 2015. The Fight for $15: The Right Wage for a Working America. The New Press.
大卫·罗尔夫. 2015. 《为 15 美元而战:为美国工人争取合理工资》。新出版社。
[88]
Xin Rong, Shiyan Yan, Stephen Oney, Mira Dontcheva, and Eytan Adar. 2016. Codemend: Assisting interactive programming with bimodal embedding. In Proceedings of the 29th Annual Symposium on User Interface Software and Technology. 247–258.
辛荣,严诗燕,斯蒂芬·奥尼,米拉·东切瓦,和埃坦·阿达尔。2016 年。Codemend:通过双模嵌入辅助交互式编程。在第 29 届用户界面软件与技术年会论文集中。247–258。
[89]
Ben Shneiderman. 2022. Human-centered AI. Oxford University Press.
本·施奈德曼. 2022. 以人为本的人工智能. 牛津大学出版社.
[90]
Ben Shneiderman and Pattie Maes. 1997. Direct manipulation vs. interface agents. interactions 4, 6 (1997), 42–61.
本·施奈德曼和帕蒂·梅斯。1997 年。直接操作与界面代理。交互 4, 6 (1997), 42–61。
[91]
Ho Chit Siu, Jaime Peña, Edenna Chen, Yutai Zhou, Victor Lopez, Kyle Palko, Kimberlee Chang, and Ross Allen. 2021. Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi. In Advances in Neural Information Processing Systems, M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, and J. Wortman Vaughan (Eds.). Vol. 34. Curran Associates, Inc., 16183–16195. https://proceedings.neurips.cc/paper_files/paper/2021/file/86e8f7ab32cfd12577bc2619bc635690-Paper.pdf
霍志小、哈梅·佩尼亚、艾登娜·陈、周宇泰、维克托·洛佩斯、凯尔·帕尔科、金伯莉·张和罗斯·艾伦。2021 年。人类与人工智能团队在《汉诺塔》中的学习型和基于规则的代理评估。载于《神经信息处理系统进展》,M. 兰扎托、A. 贝格尔齐默、Y. 多芬、P.S. 梁和 J. 沃特曼·沃恩(编辑)。第 34 卷。Curran Associates, Inc.,16183–16195。https://proceedings.neurips.cc/paper_files/paper/2021/file/86e8f7ab32cfd12577bc2619bc635690-Paper.pdf
[92]
Taylor Sorensen, Joshua Robinson, Christopher Rytting, Alexander Shaw, Kyle Rogers, Alexia Delorey, Mahmoud Khalil, Nancy Fulda, and David Wingate. 2022. An Information-theoretic Approach to Prompt Engineering Without Ground Truth Labels. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics. https://doi.org/10.18653/v1/2022.acl-long.60
泰勒·索伦森,乔舒亚·罗宾逊,克里斯托弗·瑞廷,亚历山大·肖,凯尔·罗杰斯,阿莱克西亚·德洛雷,马哈茂德·哈利尔,南希·富尔达,和大卫·温盖特。2022 年。无真实标签的提示工程的信息论方法。在第 60 届计算语言学协会年会论文集(第 1 卷:长篇论文)中。计算语言学协会。https://doi.org/10.18653/v1/2022.acl-long.60
[93]
William Swartout, Jonathan Gratch, Randall Hill, Eduard Hovy, Stacy Marsella, Jeff Rickel, and David Traum. 2006. Toward virtual humans. AI Magazine 27, 1 (2006).
威廉·斯沃特,乔纳森·格拉奇,兰德尔·希尔,爱德华·霍维,斯泰西·马塞拉,杰夫·里克尔,和大卫·特劳姆。2006 年。走向虚拟人类。人工智能杂志 27 卷,1 期(2006 年)。
[94]
Milind Tambe, W Lewis Johnson, Randolph M Jones, Frank Koss, John E Laird, Paul S Rosenbloom, and Karl Schwamb. 1995. Intelligent agents for interactive simulation environments. AI Magazine 16, 1 (1995), 15.
米林德·坦贝,W·刘易斯·约翰逊,兰道夫·M·琼斯,弗兰克·科斯,约翰·E·莱尔德,保罗·S·罗森布卢姆,卡尔·施万布。1995 年。用于交互式仿真环境的智能代理。人工智能杂志 16, 1 (1995), 15。
[95]
David R. Thomas. 2006. A General Inductive Approach for Analyzing Qualitative Evaluation Data. American Journal of Evaluation 27, 2 (2006), 237–246. https://doi.org/10.1177/1098214005283748
大卫·R·托马斯. 2006. 一种分析定性评估数据的一般归纳方法. 美国评估杂志 27, 2 (2006), 237–246. https://doi.org/10.1177/1098214005283748
[96]
Frank Thomas and Ollie Johnston. 1981. Disney Animation: The Illusion of Life. Abbeville Press, New York.
弗兰克·托马斯和奥利·约翰斯顿. 1981. 《迪士尼动画:生命的幻觉》。阿贝维尔出版社,纽约。
[97]
Ilshat Umarov, Mikhail Mozgovoy, and Patrick C. Rogers. 2012. Believable and Effective AI Agents in Virtual Worlds: Current State and Future Perspectives. International Journal of Gaming and Computer-Mediated Simulations 4, 2 (2012), 37–59.
伊尔沙特·乌马罗夫、米哈伊尔·莫兹戈沃伊和帕特里克·C·罗杰斯。2012 年。虚拟世界中可信且有效的人工智能代理:现状与未来展望。《国际游戏与计算机媒介模拟期刊》4 卷,2 期(2012),37–59。
[98]
Graham Upton and Ian Cook. 2006. A Dictionary of Statistics (2 ed.). Oxford University Press, Oxford, United Kingdom.
格雷厄姆·厄普顿和伊恩·库克. 2006. 《统计学词典》(第二版)。牛津大学出版社,英国牛津。
[99]
Oriol Vinyals, Igor Babuschkin, Wojciech M. Czarnecki, and et al.2019. Grandmaster level in StarCraft II using multi-agent reinforcement learning. Nature 575 (2019), 350–354. https://doi.org/10.1038/s41586-019-1724-z
Oriol Vinyals, Igor Babuschkin, Wojciech M. Czarnecki 等. 2019. 使用多智能体强化学习在《星际争霸 II》中达到大师级水平. 自然 575 (2019), 350–354. https://doi.org/10.1038/s41586-019-1724-z
[100]
Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc Le, and Denny Zhou. 2023. Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. arxiv:2201.11903 [cs.CL]
Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc Le, 和 Denny Zhou. 2023. 连锁思维提示引发大型语言模型的推理. arxiv:2201.11903 [cs.CL]
[101]
Mark Weiser. 1991. The computer for the 21st century. Scientific American 265, 3 (1991), 94–104. https://doi.org/10.1038/scientificamerican0991-94
马克·威瑟. 1991. 21 世纪的计算机. 《科学美国人》265, 3 (1991), 94–104. https://doi.org/10.1038/scientificamerican0991-94
[102]
Joseph Weizenbaum. 1966. ELIZA—a computer program for the study of natural language communication between man and machine. Commun. ACM 9, 1 (1966), 36–45.
约瑟夫·韦岑鲍姆。1966 年。《ELIZA——一个用于研究人机自然语言交流的计算机程序》。计算机协会通讯 9, 1 (1966), 36–45。
[103]
Terry Winograd. 1971. Procedures as a Representation for Data in a Computer Program for Understanding Natural Language. (1971).
特里·温诺格拉德。1971 年。程序作为计算机程序中理解自然语言的数据表示。(1971 年)。
[104]
Jeff Wu, Long Ouyang, Daniel M. Ziegler, Nisan Stiennon, Ryan Lowe, Jan Leike, and Paul Christiano. 2021. Recursively Summarizing Books with Human Feedback. arxiv:2109.10862 [cs.CL]
杰夫·吴,龙·欧阳,丹尼尔·M·齐格勒,尼桑·斯蒂农,瑞安·洛,扬·莱克,保罗·克里斯蒂亚诺。2021。通过人类反馈递归总结书籍。arxiv:2109.10862 [cs.CL]
[105]
Tongshuang Wu, Ellen Jiang, Aaron Donsbach, Jeff Gray, Alejandra Molina, Michael Terry, and Carrie J Cai. 2022. PromptChainer: Chaining Large Language Model Prompts through Visual Programming. In CHI EA ’22: Extended Abstracts of the 2022 CHI Conference on Human Factors in Computing Systems.
吴通霜,姜艾伦,阿伦·多恩斯巴赫,杰夫·格雷,亚历杭德拉·莫利纳,迈克尔·特里,和蔡佳瑞。2022 年。《PromptChainer:通过视觉编程链接大型语言模型提示》。发表于 CHI EA '22:2022 年计算机系统人因会议扩展摘要。
[106]
Tongshuang Wu, Michael Terry, and Carrie J Cai. 2022. AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts. In CHI ’22: Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems.
吴通霜, 迈克尔·特里, 和凯莉·蔡. 2022. AI 链:通过链接大型语言模型提示实现透明和可控的人机交互. 在 CHI '22:2022 年计算系统人因会议论文集.
[107]
Qian Yang, Aaron Steinfeld, Carolyn Rosé, and John Zimmerman. 2020. Re-examining whether, why, and how human-AI interaction is uniquely difficult to design. In Proceedings of the 2020 chi conference on human factors in computing systems. 1–13.
钱阳,亚伦·斯坦菲尔德,卡罗琳·罗泽,约翰·齐默尔曼。2020 年。重新审视人机交互在设计上是否、为何以及如何独特地困难。在 2020 年计算系统人因会议论文集中。1–13。
[108]
Georgios N. Yannakakis. 2012. Game AI revisited. In Proceedings of the 9th Conference on Computing Frontiers. ACM, Cagliari, Italy, 285–292. https://doi.org/10.1145/2212908.2212950
乔治奥斯·N·扬纳卡基斯. 2012. 游戏人工智能的再探讨. 载于第九届计算前沿会议论文集. ACM, 卡利亚里, 意大利, 285–292. https://doi.org/10.1145/2212908.2212950
[109]
Robert Zubek. 2002. Towards implementation of social interaction. In AAAI Spring Symposium on Artificial Intelligence and Interactive Entertainment. AAAI Press. https://www.aaai.org/Papers/Symposia/Spring/2002/SS-02-01/SS02-01-003.pdf
罗伯特·祖贝克. 2002. 朝向社会互动的实现. 收录于《AAAI 春季研讨会:人工智能与互动娱乐》. AAAI 出版社. https://www.aaai.org/Papers/Symposia/Spring/2002/SS-02-01/SS02-01-003.pdf

Cited By 被引用次数

View all 查看全部
  • (2024)LLM-Powered Hierarchical Language Agent for Real-time Human-AI CoordinationProceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems10.5555/3635637.3662979(1219-1228)Online publication date: 6-May-2024
    刘俊宇,曹静,谢宇,廖琦,吴阳,王宇,达斯塔尼,米尔曼,阿列奇纳,迪格纳(2024)LLM-基于电源的层次语言代理用于实时人机协调《第 23 届国际自主代理与多代理系统会议论文集》10.5555/3635637.3662979(1219-1228)在线出版日期:2024 年 5 月 6 日
  • (2024)Negative social tipping dynamics resulting from and reinforcing Earth system destabilizationEarth System Dynamics10.5194/esd-15-1179-202415:5(1179-1206)Online publication date: 10-Sep-2024
    Spaiser VJuhola SConstantino SGuo WWatson TSillmann JCraparo ABasel ABruun JKrishnamurthy KScheffran JPinho POkpara UDonges JBhowmik AYasseri TSafra de Campos RCumming GChenet HKrampe FAbrams JDyke JRynders SAksenov YSpears B(2024)负面社会 tipping 动力学导致并强化地球系统不稳定性《地球系统动态》10.5194/esd-15-1179-202415:5(1179-1206)在线出版日期:2024 年 9 月 10 日
  • (2024)Research Agents and OutreachThe Rise of AI in Academic Inquiry10.4018/979-8-3693-6905-0.ch003(75-126)Online publication date: 30-Aug-2024
    哈特森 J(2024)研究代理与外展:人工智能在学术研究中的崛起 10.4018/979-8-3693-6905-0.ch003(75-126)在线出版日期:2024 年 8 月 30 日
  • Show More Cited By 显示更多引用文献

Recommendations 建议

Comments 评论

Information & Contributors

Information

Published In

cover image ACM Conferences
UIST '23: Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology
October 2023
1825 pages
ISBN:9798400701320
DOI:10.1145/3586183
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 October 2023

Check for updates

Badges

  • Best Paper

Author Tags

  1. Human-AI interaction
  2. agents
  3. generative AI
  4. large language models

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

UIST '23

Acceptance Rates

Overall Acceptance Rate 561 of 2,567 submissions, 22%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)21,707
  • Downloads (Last 6 weeks)3,809
Reflects downloads up to 16 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)LLM-Powered Hierarchical Language Agent for Real-time Human-AI CoordinationProceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems10.5555/3635637.3662979(1219-1228)Online publication date: 6-May-2024
  • (2024)Negative social tipping dynamics resulting from and reinforcing Earth system destabilizationEarth System Dynamics10.5194/esd-15-1179-202415:5(1179-1206)Online publication date: 10-Sep-2024
  • (2024)Research Agents and OutreachThe Rise of AI in Academic Inquiry10.4018/979-8-3693-6905-0.ch003(75-126)Online publication date: 30-Aug-2024
  • (2024)Ideation and Refocusing ResearchThe Rise of AI in Academic Inquiry10.4018/979-8-3693-6905-0.ch002(41-74)Online publication date: 30-Aug-2024
  • (2024)Advancements in Machine Learning and Deep LearningDeep Cognitive Modelling in Remote Sensing Image Processing10.4018/979-8-3693-2913-9.ch006(113-150)Online publication date: 7-Jun-2024
  • (2024)Some Emerging Communication Roles for Generative AIThe Role of Generative AI in the Communication Classroom10.4018/979-8-3693-0831-8.ch002(38-54)Online publication date: 12-Feb-2024
  • (2024)Prospective Role of Foundation Models in Advancing Autonomous VehiclesResearch10.34133/research.03997Online publication date: 16-Jul-2024
  • (2024)Leave It to Large Language Models! Correction and Planning with Memory IntegrationCyborg and Bionic Systems10.34133/cbsystems.00875Online publication date: 27-Mar-2024
  • (2024)Enhancing Task-Oriented Dialogue Systems through Synchronous Multi-Party Interaction and Multi-Group Virtual SimulationInformation10.3390/info1509058015:9(580)Online publication date: 19-Sep-2024
  • (2024)Effect of Private Deliberation: Deception of Large Language Models in Game PlayEntropy10.3390/e2606052426:6(524)Online publication date: 18-Jun-2024
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media

References

References

[1]
Gavin Abercrombie, Amanda Cercas Curry, Tanvi Dinkar, and Zeerak Talat. 2023. Mirages: On Anthropomorphism in Dialogue Systems. arxiv:2305.09800 [cs.CL]
[2]
Robert Ackland, Jamsheed Shorish, Paul Thomas, and Lexing Xie. 2013. How dense is a network?http://users.cecs.anu.edu.au/ xlx/teaching/css2013/network-density.html.
[3]
Eytan Adar, Mira Dontcheva, and Gierad Laput. 2014. CommandSpace: Modeling the Relationships between Tasks, Descriptions and Features. In Proceedings of the 27th Annual ACM Symposium on User Interface Software and Technology (Honolulu, Hawaii, USA) (UIST ’14). Association for Computing Machinery, New York, NY, USA, 167–176. https://doi.org/10.1145/2642918.2647395
[4]
Saleema Amershi, Maya Cakmak, William Bradley Knox, and Todd Kulesza. 2014. Power to the people: The role of humans in interactive machine learning. AI Magazine 35, 4 (2014), 105–120.
[5]
Saleema Amershi, Dan Weld, Mihaela Vorvoreanu, Adam Fourney, Besmira Nushi, Penny Collisson, Jina Suh, Shamsi Iqbal, Paul N Bennett, Kori Inkpen, 2019. Guidelines for human-AI interaction. In Proceedings of the 2019 chi conference on human factors in computing systems. 1–13.
[6]
John R. Anderson. 1993. Rules of the Mind. Lawrence Erlbaum Associates, Hillsdale, NJ.
[7]
Electronic Arts. 2009. The Sims 3. Video game.
[8]
Ruth Aylett. 1999. Narrative in virtual environments—towards emergent narrative. In Narrative Intelligence: Papers from the AAAI Fall Symposium (Technical Report FS-99-01). AAAI Press, 83–86.
[9]
Christoph Bartneck and Jodi Forlizzi. 2004. A design-centered framework for social human-robot interaction. In Proceedings of the 13th IEEE International Workshop on Robot and Human Interactive Communication (RO-MAN’04). 591–594. https://doi.org/10.1109/ROMAN.2004.1374827
[10]
Joseph Bates. 1994. The Role of Emotion in Believable Agents. Commun. ACM 37, 7 (1994), 122–125. https://doi.org/10.1145/176789.176803
[11]
Christopher Berner, Greg Brockman, Brooke Chan, Vicki Cheung, Przemysław Dębiak, Christy Dennison, David Farhi, Quirin Fischer, Shariq Hashme, Chris Hesse, Rafal Józefowicz, Scott Gray, Catherine Olsson, Jakub Pachocki, Michael Petrov, Henrique P. d.O. Pinto, Jonathan Raiman, Tim Salimans, Jeremy Schlatter, Jonas Schneider, Szymon Sidor, Ilya Sutskever, Jie Tang, Filip Wolski, and Susan Zhang. 2019. Dota 2 with Large Scale Deep Reinforcement Learning. arXiv preprint arXiv:1912.06680 (2019).
[12]
Marcel Binz and Eric Schulz. 2023. Using cognitive psychology to understand GPT-3. Proceedings of the National Academy of Sciences 120, 6 (2023), e2218523120.
[13]
BioWare. 2007. Mass Effect. Video game.
[14]
Woody Bledsoe. 1986. I had a dream: AAAI presidential address. AI Magazine 7, 1 (1986), 57–61.
[15]
Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, and et al.2022. On the Opportunities and Risks of Foundation Models. arxiv:2108.07258 [cs.LG]
[16]
Michael Brenner. 2010. Creating dynamic story plots with continual multiagent planning. In Proceedings of the 24th AAAI Conference on Artificial Intelligence.
[17]
Rodney A. Brooks, Cynthia Breazeal, Marko Marjanovic, Brian Scassellati, and Matthew Williamson. 2000. The Cog Project: Building a Humanoid Robot. In Computation for Metaphors, Analogy, and Agents(Lecture Notes on Artificial Intelligence, 1562), Chrystopher Nehaniv (Ed.). Springer-Verlag, Berlin, 52–87.
[18]
Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, and Dario Amodei. 2020. Language Models are Few-Shot Learners. arxiv:2005.14165 [cs.CL]
[19]
Sébastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, 2023. Sparks of artificial general intelligence: Early experiments with gpt-4. arXiv preprint arXiv:2303.12712 (2023).
[20]
Robin Burkinshaw. 2009. Alice and Kev: The Story of Being Homeless in The Sims 3.
[21]
Chris Callison-Burch, Gaurav Singh Tomar, Lara Martin, Daphne Ippolito, Suma Bailis, and David Reitter. 2022. Dungeons and Dragons as a Dialog Challenge for Artificial Intelligence. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Abu Dhabi, United Arab Emirates, 9379–9393. https://aclanthology.org/2022.emnlp-main.637
[22]
Stuart K Card, Thomas P Moran, and Allen Newell. 1980. The keystroke-level model for user performance time with interactive systems. Commun. ACM 23, 7 (1980), 396–410. https://doi.org/10.1145/358886.358895 arXiv:https://doi.org/10.1145/358886.358895
[23]
Stuart K Card, Thomas P Moran, and Alan Newell. 1983. The psychology of human-computer interaction. (1983).
[24]
Alex Champandard. 2012. Tutorial presentation. In IEEE Conference on Computational Intelligence and Games.
[25]
Dong kyu Choi, Tolga Konik, Negin Nejati, Chunki Park, and Pat Langley. 2021. A Believable Agent for First-Person Shooter Games. In Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, Vol. 3. 71–73.
[26]
Anind K Dey. 2001. Understanding and using context. Personal and ubiquitous computing 5 (2001), 4–7.
[27]
Kevin Dill and L Martin. 2011. A Game AI Approach to Autonomous Control of Virtual Characters. In Proceedings of the Interservice/Industry Training, Simulation, and Education Conference (I/ITSEC’11). Orlando, FL, USA.
[28]
David Easley and Jon Kleinberg. 2010. Networks, crowds, and markets: Reasoning about a highly connected world. Cambridge university press.
[29]
Arpad E Elo. 1967. The Proposed USCF Rating System, Its Development, Theory, and Applications. Chess Life XXII, 8 (August 1967), 242–247.
[30]
Jerry Alan Fails and Dan R Olsen Jr. 2003. Interactive machine learning. In Proceedings of the 8th international conference on Intelligent user interfaces. ACM, 39–45.
[31]
Ethan Fast, William McGrath, Pranav Rajpurkar, and Michael S Bernstein. 2016. Augur: Mining human behaviors from fiction to power interactive systems. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. 237–247.
[32]
Rebecca Fiebrink and Perry R Cook. 2010. The Wekinator: a system for real-time, interactive machine learning in music. In Proceedings of The Eleventh International Society for Music Information Retrieval Conference (ISMIR 2010)(Utrecht), Vol. 3. Citeseer, 2–1.
[33]
Uwe Flick. 2009. An Introduction to Qualitative Research. SAGE.
[34]
James Fogarty, Desney Tan, Ashish Kapoor, and Simon Winder. 2008. CueFlik: Interactive Concept Learning in Image Search. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Florence, Italy) (CHI ’08). Association for Computing Machinery, New York, NY, USA, 29–38. https://doi.org/10.1145/1357054.1357061
[35]
Adam Fourney, Richard Mann, and Michael Terry. 2011. Query-feature graphs: bridging user vocabulary and system functionality. In Proceedings of the ACM Symposium on User Interface Software and Technology (UIST) (Santa Barbara, California, USA). ACM.
[36]
Tom Francis. 2010. The Minecraft Experiment, day 1: Chasing Waterfalls. http://www.pcgamer.com/2010/11/20/the-minecraft-experiment-day-1-chasing-waterfalls/
[37]
Jonas Freiknecht and Wolfgang Effelsberg. 2020. Procedural Generation of Interactive Stories using Language Models. In International Conference on the Foundations of Digital Games (FDG ’20). ACM, Bugibba, Malta, 8. https://doi.org/10.1145/3402942.3409599
[38]
Tianyu Gao, Adam Fisch, and Danqi Chen. 2020. Making Pre-trained Language Models Better Few-shot Learners. CoRR abs/2012.15723 (2020). arxiv:2012.15723https://arxiv.org/abs/2012.15723
[39]
Perttu Hämäläinen, Mikke Tavast, and Anton Kunnari. 2023. Evaluating Large Language Models in Generating Synthetic HCI Research Data: a Case Study. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. ACM.
[40]
Matthew Hausknecht, Prithviraj Ammanabrolu, Marc-Alexandre Cote, and Xinyu Yuan. 2020. Interactive Fiction Games: A Colossal Adventure. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 7903–7910. https://doi.org/10.1609/aaai.v34i05.6297
[41]
Chris Hecker. 2011. My Liner Notes for Spore. http://chrishecker.com/My_liner_notes_for_spore
[42]
Ralf Herbrich, Tom Minka, and Thore Graepel. 2006. TrueSkill™: A Bayesian Skill Rating System. In Advances in Neural Information Processing Systems, B. Schölkopf, J. Platt, and T. Hoffman (Eds.). Vol. 19. MIT Press. https://proceedings.neurips.cc/paper_files/paper/2006/file/f44ee263952e65b3610b8ba51229d1f9-Paper.pdf
[43]
Douglas Hofstadter. 1995. Fluid concepts and creative analogies: computer models of the fundamental mechanisms of thought. Basic Books.
[44]
James D. Hollan, Edwin L. Hutchins, and Louis Weitzman. 1984. STEAMER: An Interactive Inspectable Simulation-Based Training System. AI Magazine 5, 2 (1984), 23–36.
[45]
Sture Holm. 1979. A simple sequentially rejective multiple test procedure. Scandinavian Journal of Statistics 6, 2 (1979), 65–70. https://doi.org/not specified
[46]
John J. Horton. 2023. Large Language Models as Simulated Economic Agents: What Can We Learn from Homo Silicus?arxiv:2301.07543 [econ.GN]
[47]
Eric Horvitz. 1999. Principles of mixed-initiative user interfaces. In Proceedings of the SIGCHI conference on Human Factors in Computing Systems. 159–166.
[48]
Wenlong Huang, Fei Xia, Ted Xiao, Harris Chan, Jacky Liang, Pete Florence, Andy Zeng, Jonathan Tompson, Igor Mordatch, Yevgen Chebotar, Pierre Sermanet, Noah Brown, Tomas Jackson, Linda Luu, Sergey Levine, Karol Hausman, and Brian Ichter. 2022. Inner Monologue: Embodied Reasoning through Planning with Language Models. arxiv:2207.05608 [cs.RO]
[49]
Kristen Ibister and Clifford Nass. 2000. Consistency of personality in interactive characters: verbal cues, non-verbal cues, and user characteristics. International Journal of Human-Computer Studies 52, 1 (2000), 65–80.
[50]
Ellen Jiang, Kristen Olson, Edwin Toh, Alejandra Molina, Aaron Donsbach, Michael Terry, and Carrie J Cai. 2022. PromptMaker: Prompt-Based Prototyping with Large Language Models. In Extended Abstracts of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI EA ’22). Association for Computing Machinery, New York, NY, USA, Article 35, 8 pages. https://doi.org/10.1145/3491101.3503564
[51]
Bonnie E John and David E Kieras. 1996. The GOMS family of user interface analysis techniques: Comparison and contrast. ACM Transactions on Computer-Human Interaction (TOCHI) 3, 4 (1996), 320–351.
[52]
Randolph M Jones, John E Laird, Paul E Nielsen, Karen J Coulter, Patrick Kenny, and Frank V Koss. 1999. Automated Intelligent Pilots for Combat Flight Simulation. AI Magazine 20, 1 (1999), 27–42.
[53]
Omar Khattab, Keshav Santhanam, Xiang Lisa Li, David Hall, Percy Liang, Christopher Potts, and Matei Zaharia. 2023. Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP. arxiv:2212.14024 [cs.CL]
[54]
Bjoern Knafla. 2011. Introduction to Behavior Trees. http://bjoernknafla.com/introduction-to-behavior-trees
[55]
Ranjay Krishna, Donsuk Lee, Li Fei-Fei, and Michael S. Bernstein. 2022. Socially situated artificial intelligence enables learning from human interaction. Proceedings of the National Academy of Sciences 119, 39 (2022), e2115730119. https://doi.org/10.1073/pnas.2115730119 arXiv:https://www.pnas.org/doi/pdf/10.1073/pnas.2115730119
[56]
William H Kruskal and WA Wallis. 1952. Use of ranks in one-criterion variance analysis. J. Amer. Statist. Assoc. 47, 260 (1952), 583–621. https://doi.org/10.1080/01621459.1952.10483441
[57]
Phaser Labs. 2023. Welcome to Phaser 3. https://phaser.io/phaser3. Accessed on: 2023-04-03.
[58]
John Laird. 2001. It Knows What You’re Going To Do: Adding Anticipation to a Quakebot. In Proceedings of the 2001 Workshop on Intelligent Cinematography and Editing. 63–69.
[59]
John Laird and Michael VanLent. 2001. Human-Level AI’s Killer Application: Interactive Computer Games. AI Magazine 22, 2 (2001), 15. https://doi.org/10.1609/aimag.v22i2.1558
[60]
John E. Laird. 2000. It Knows What You’re Going To Do: Adding Anticipation to a QUAKEBOT. In Papers from the AAAI 2000 Spring Symposium on Artificial Intelligence and Interactive Entertainment(Technical Report SS-00-02). AAAI Press, 41–50.
[61]
John E. Laird. 2012. The Soar Cognitive Architecture. MIT Press.
[62]
John E. Laird, Christian Lebiere, and Paul S. Rosenbloom. 2017. A Standard Model of the Mind: Toward a Common Computational Framework across Artificial Intelligence, Cognitive Science, Neuroscience, and Robotics. AI Magazine 38, 1 (2017), 13–26.
[63]
Michelle S Lam, Zixian Ma, Anne Li, Izequiel Freitas, Dakuo Wang, James A Landay, and Michael S Bernstein. 2023. Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems.
[64]
Pat Langley, Dongkyu Choi, and Seth Rogers. 2005. Interleaving Learning, Problem Solving, and Execution in the Icarus Architecture. Technical Report. Stanford University, Center for the Study of Language and Information.
[65]
Jason Linder, Gierad Laput, Mira Dontcheva, Gregg Wilensky, Walter Chang, Aseem Agarwala, and Eytan Adar. 2013. PixelTone: A Multimodal Interface for Image Editing. In CHI ’13 Extended Abstracts on Human Factors in Computing Systems (Paris, France) (CHI EA ’13). Association for Computing Machinery, New York, NY, USA, 2829–2830. https://doi.org/10.1145/2468356.2479533
[66]
Jiachang Liu, Dinghan Shen, Yizhe Zhang, Bill Dolan, Lawrence Carin, and Weizhu Chen. 2021. What Makes Good In-Context Examples for GPT-3?CoRR abs/2101.06804 (2021). arxiv:2101.06804https://arxiv.org/abs/2101.06804
[67]
Vivian Liu, Han Qiao, and Lydia Chilton. 2022. Opal: Multimodal Image Generation for News Illustration. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology. 1–17.
[68]
Pattie Maes. 1995. Artificial Life Meets Entertainment: Lifelike Autonomous Agents. Commun. ACM 38, 11 (nov 1995), 108–114. https://doi.org/10.1145/219717.219808
[69]
Josh McCoy, Michael Mateas, and Noah Wardrip-Fruin. 2009. Comme il Faut: A System for Simulating Social Games Between Autonomous Characters. In Proceedings of the 7th International Conference on Digital Arts and Culture. 87–94.
[70]
Josh McCoy, Mike Treanor, Ben Samuel, Michael Mateas, and Noah Wardrip-Fruin. 2011. Prom Week: Social Physics as Gameplay. In Proceedings of the 6th International Conference on Foundations of Digital Games (FDG’11). ACM, Bordeaux, France, 70–77. https://doi.org/10.1145/2159365.2159377
[71]
Josh McCoy, Mike Treanor, Ben Samuel, Anna Reed, Michael Mateas, and Noah Wardrip-Fruin. 2012. Prom Week. In Proceedings of the 7th International Conference on Foundations of Digital Games (FDG’12). ACM, Raleigh, NC, USA, 1–8. https://doi.org/10.1145/2282338.2282340
[72]
Josh McCoy, Mike Treanor, Ben Samuel, Noah Wardrip-Fruin, and Michael Mateas. 2011. Comme il faut: A System for Authoring Playable Social Models. In Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE’11). AAAI, Stanford, CA, USA, 38–43.
[73]
Marvin Minsky and Seymour Papert. 1970. Draft of a proposal to ARPA for research on artificial intelligence at MIT, 1970–71.
[74]
Shohei Miyashita, Xinyu Lian, Xiao Zeng, Takashi Matsubara, and Kuniaki Uehara. 2017. Developing Game AI Agent Behaving Like Human by Mixing Reinforcement Learning and Supervised Learning. In Proceedings of the 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD). Kanazawa, Japan, 153–158. https://doi.org/10.1109/SNPD.2017.8023884
[75]
Alexander Nareyek. 2007. Game AI is dead. Long live game AI!IEEE Intelligent Systems 22, 1 (2007), 9–11.
[76]
Allen Newell. 1990. Unified Theories of Cognition. Harvard University Press, Cambridge, Massachusetts.
[77]
OpenAI. 2022. Introducing ChatGPT. https://openai.com/blog/chatgpt. Accessed on: 2023-04-03.
[78]
Kyle Orland. 2021. So what is ’the metaverse’, exactly?Ars Technica (7 November 2021). arxiv:2111.04169https://arstechnica.com/gaming/2021/11/so-what-is-the-metaverse-exactly/
[79]
Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, John Schulman, Jacob Hilton, Fraser Kelton, Luke Miller, Maddie Simens, Amanda Askell, Peter Welinder, Paul Christiano, Jan Leike, and Ryan Lowe. 2022. Training language models to follow instructions with human feedback. arxiv:2203.02155 [cs.CL]
[80]
Joon Sung Park, Lindsay Popowski, Carrie J. Cai, Meredith Ringel Morris, Percy Liang, and Michael S. Bernstein. 2022. Social Simulacra: Creating Populated Prototypes for Social Computing Systems. In In the 35th Annual ACM Symposium on User Interface Software and Technology (UIST ’22) (Bend, OR, USA) (UIST ’22). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3526113.3545616
[81]
Richard W. Pew and Ann S. Mavor (Eds.). 1998. Modeling Human and Organizational Behavior: Applications to Military Simulations. National Academy Press, Washington, D.C.
[82]
Roberto Pillosu. 2009. Coordinating Agents with Behavior Trees: Synchronizing Multiple Agents in CryEngine 2. https://aiarchitect.wordpress.com/2009/10/19/coordinating-agents-with-behavior-trees-synchronizing-multiple-agents-in-cryengine-2/
[83]
Prolific. 2022. Prolific: Quickly Find Research Participants You Can Trust. https://www.prolific.co/
[84]
Byron Reeves and Clifford Nass. 1996. The media equation: How people treat computers, television, and new media like real people and places. Cambridge University Press.
[85]
Mark O. Riedl. 2012. Interactive narrative: A novel application of artificial intelligence for computer games. In Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence (AAAI’12). 2160–2165.
[86]
Mark O. Riedl and R. Michael Young. 2005. An Objective Character Believability Evaluation Procedure for Multi-Agent Story Generation Systems. In Proceedings of the 5th International Working Conference on Intelligent Virtual Agents (IVA’05). Kos, Greece, 58–70. https://doi.org/10.1007/11550617_5
[87]
David Rolf. 2015. The Fight for $15: The Right Wage for a Working America. The New Press.
[88]
Xin Rong, Shiyan Yan, Stephen Oney, Mira Dontcheva, and Eytan Adar. 2016. Codemend: Assisting interactive programming with bimodal embedding. In Proceedings of the 29th Annual Symposium on User Interface Software and Technology. 247–258.
[89]
Ben Shneiderman. 2022. Human-centered AI. Oxford University Press.
[90]
Ben Shneiderman and Pattie Maes. 1997. Direct manipulation vs. interface agents. interactions 4, 6 (1997), 42–61.
[91]
Ho Chit Siu, Jaime Peña, Edenna Chen, Yutai Zhou, Victor Lopez, Kyle Palko, Kimberlee Chang, and Ross Allen. 2021. Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi. In Advances in Neural Information Processing Systems, M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, and J. Wortman Vaughan (Eds.). Vol. 34. Curran Associates, Inc., 16183–16195. https://proceedings.neurips.cc/paper_files/paper/2021/file/86e8f7ab32cfd12577bc2619bc635690-Paper.pdf
[92]
Taylor Sorensen, Joshua Robinson, Christopher Rytting, Alexander Shaw, Kyle Rogers, Alexia Delorey, Mahmoud Khalil, Nancy Fulda, and David Wingate. 2022. An Information-theoretic Approach to Prompt Engineering Without Ground Truth Labels. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics. https://doi.org/10.18653/v1/2022.acl-long.60
[93]
William Swartout, Jonathan Gratch, Randall Hill, Eduard Hovy, Stacy Marsella, Jeff Rickel, and David Traum. 2006. Toward virtual humans. AI Magazine 27, 1 (2006).
[94]
Milind Tambe, W Lewis Johnson, Randolph M Jones, Frank Koss, John E Laird, Paul S Rosenbloom, and Karl Schwamb. 1995. Intelligent agents for interactive simulation environments. AI Magazine 16, 1 (1995), 15.
[95]
David R. Thomas. 2006. A General Inductive Approach for Analyzing Qualitative Evaluation Data. American Journal of Evaluation 27, 2 (2006), 237–246. https://doi.org/10.1177/1098214005283748
[96]
Frank Thomas and Ollie Johnston. 1981. Disney Animation: The Illusion of Life. Abbeville Press, New York.
[97]
Ilshat Umarov, Mikhail Mozgovoy, and Patrick C. Rogers. 2012. Believable and Effective AI Agents in Virtual Worlds: Current State and Future Perspectives. International Journal of Gaming and Computer-Mediated Simulations 4, 2 (2012), 37–59.
[98]
Graham Upton and Ian Cook. 2006. A Dictionary of Statistics (2 ed.). Oxford University Press, Oxford, United Kingdom.
[99]
Oriol Vinyals, Igor Babuschkin, Wojciech M. Czarnecki, and et al.2019. Grandmaster level in StarCraft II using multi-agent reinforcement learning. Nature 575 (2019), 350–354. https://doi.org/10.1038/s41586-019-1724-z
[100]
Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc Le, and Denny Zhou. 2023. Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. arxiv:2201.11903 [cs.CL]
[101]
Mark Weiser. 1991. The computer for the 21st century. Scientific American 265, 3 (1991), 94–104. https://doi.org/10.1038/scientificamerican0991-94
[102]
Joseph Weizenbaum. 1966. ELIZA—a computer program for the study of natural language communication between man and machine. Commun. ACM 9, 1 (1966), 36–45.
[103]
Terry Winograd. 1971. Procedures as a Representation for Data in a Computer Program for Understanding Natural Language. (1971).
[104]
Jeff Wu, Long Ouyang, Daniel M. Ziegler, Nisan Stiennon, Ryan Lowe, Jan Leike, and Paul Christiano. 2021. Recursively Summarizing Books with Human Feedback. arxiv:2109.10862 [cs.CL]
[105]
Tongshuang Wu, Ellen Jiang, Aaron Donsbach, Jeff Gray, Alejandra Molina, Michael Terry, and Carrie J Cai. 2022. PromptChainer: Chaining Large Language Model Prompts through Visual Programming. In CHI EA ’22: Extended Abstracts of the 2022 CHI Conference on Human Factors in Computing Systems.
[106]
Tongshuang Wu, Michael Terry, and Carrie J Cai. 2022. AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts. In CHI ’22: Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems.
[107]
Qian Yang, Aaron Steinfeld, Carolyn Rosé, and John Zimmerman. 2020. Re-examining whether, why, and how human-AI interaction is uniquely difficult to design. In Proceedings of the 2020 chi conference on human factors in computing systems. 1–13.
[108]
Georgios N. Yannakakis. 2012. Game AI revisited. In Proceedings of the 9th Conference on Computing Frontiers. ACM, Cagliari, Italy, 285–292. https://doi.org/10.1145/2212908.2212950
[109]
Robert Zubek. 2002. Towards implementation of social interaction. In AAAI Spring Symposium on Artificial Intelligence and Interactive Entertainment. AAAI Press. https://www.aaai.org/Papers/Symposia/Spring/2002/SS-02-01/SS02-01-003.pdf