research-article 研究论文

Open access 开放获取

Generative Agents: Interactive Simulacra of Human Behavior
生成代理：人类行为的互动仿真体

Authors:

Joon Sung Park

Computer Science Department, Stanford University, United States

https://orcid.org/0000-0001-5036-4409

View Profile

Joseph O'Brien

Computer Science Department, Stanford University, United States

https://orcid.org/0009-0004-0781-926X

View Profile

约瑟夫·奥布莱恩,

Carrie Jun Cai

Google, United States

https://orcid.org/0000-0001-9421-7128

View Profile

凯莉·君·蔡,

Meredith Ringel Morris

Google Research, United States

https://orcid.org/0000-0003-1436-9223

View Profile

梅雷迪斯·林格尔·莫里斯,

Percy Liang

Computer Science Department, Stanford University, United States

https://orcid.org/0000-0002-0458-6139

View Profile

，

佩西·梁,

Michael S. Bernstein

Computer Science Department, Stanford University, United States

https://orcid.org/0000-0001-8020-9434

View Profile

迈克尔·S·伯恩斯坦
作者：

朴俊成Authors Info & Claims
作者信息与声明

UIST '23: Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology
UIST '23：第 36 届年度 ACM 用户界面软件与技术研讨会论文集

Article No.: 2, Pages 1 - 22
文章编号：2，页码 1 - 22

https://doi.org/10.1145/3586183.3606763

Published: 29 October 2023 Publication History
出版日期：2023 年 10 月 29 日发表历史

All formats 所有格式 PDF

Abstract 摘要

Believable proxies of human behavior can empower interactive applications ranging from immersive environments to rehearsal spaces for interpersonal communication to prototyping tools. In this paper, we introduce generative agents: computational software agents that simulate believable human behavior. Generative agents wake up, cook breakfast, and head to work; artists paint, while authors write; they form opinions, notice each other, and initiate conversations; they remember and reflect on days past as they plan the next day. To enable generative agents, we describe an architecture that extends a large language model to store a complete record of the agent’s experiences using natural language, synthesize those memories over time into higher-level reflections, and retrieve them dynamically to plan behavior. We instantiate generative agents to populate an interactive sandbox environment inspired by The Sims, where end users can interact with a small town of twenty-five agents using natural language. In an evaluation, these generative agents produce believable individual and emergent social behaviors. For example, starting with only a single user-specified notion that one agent wants to throw a Valentine’s Day party, the agents autonomously spread invitations to the party over the next two days, make new acquaintances, ask each other out on dates to the party, and coordinate to show up for the party together at the right time. We demonstrate through ablation that the components of our agent architecture—observation, planning, and reflection—each contribute critically to the believability of agent behavior. By fusing large language models with computational interactive agents, this work introduces architectural and interaction patterns for enabling believable simulations of human behavior.
可信的人类行为代理可以增强从沉浸式环境到人际沟通的排练空间再到原型工具的互动应用。在本文中，我们介绍了生成代理：模拟可信人类行为的计算软件代理。生成代理会起床、做早餐，然后去上班；艺术家绘画，作家写作；他们形成观点，注意彼此，并发起对话；他们记住并反思过去的日子，同时计划第二天的活动。为了实现生成代理，我们描述了一种架构，该架构扩展了大型语言模型，以使用自然语言存储代理经历的完整记录，随着时间的推移将这些记忆合成为更高层次的反思，并动态检索它们以规划行为。我们实例化生成代理，以填充一个受《模拟人生》启发的互动沙盒环境，在该环境中，最终用户可以使用自然语言与一个由二十五个代理组成的小镇进行互动。在评估中，这些生成代理产生了可信的个体和涌现的社会行为。例如，从仅有一个用户指定的概念开始，即一个代理希望举办一个情人节派对，代理们在接下来的两天内自主传播派对邀请，结识新朋友，互相约会参加派对，并协调在正确的时间一起出席派对。我们通过消融实验展示了我们代理架构的各个组成部分——观察、规划和反思——在代理行为的可信度方面各自发挥了关键作用。通过将大型语言模型与计算交互代理相结合，这项工作引入了架构和交互模式，以实现对人类行为的可信模拟。

Supplementary Material 补充材料

ZIP File (3606763.zip) ZIP 文件 (3606763.zip)

Supplemental File 补充文件

Download 下载
12.85 MB

References 参考文献

[1]

Gavin Abercrombie, Amanda Cercas Curry, Tanvi Dinkar, and Zeerak Talat. 2023. Mirages: On Anthropomorphism in Dialogue Systems. arxiv:2305.09800 [cs.CL]
加文·阿伯克龙比，阿曼达·塞尔卡斯·库里，坦维·丁卡尔，和泽拉克·塔拉特。2023 年。《海市蜃楼：对话系统中的拟人化》。arxiv:2305.09800 [cs.CL]

Abstract 摘要

Supplementary Material 补充材料

References 参考文献

Cited By 被引用次数

Index Terms 索引词

Recommendations 建议

Human directability of agents人类对代理的直接引导能力

Designing for Human-Agent Alignment: Understanding what humans want from their agents为人机对齐设计：理解人类对其代理的期望

A legal analysis of human and electronic agents人类与电子代理的法律分析

Comments 评论

Information

Published In

Sponsors

Publisher

Publication History

Check for updates

Badges

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

HTML Format

Figures

Other

Share

Share this Publication link

Share on social media

References

Affiliations

Human directability of agents
人类对代理的直接引导能力

Designing for Human-Agent Alignment: Understanding what humans want from their agents
为人机对齐设计：理解人类对其代理的期望

A legal analysis of human and electronic agents
人类与电子代理的法律分析