Documentation - AGI-thon Werewolf Agents Tournament

| Return Home |

We recommend reading the most up to date documentation in the README for the GitHub repo: GitHub - sentient-agi/werewolf-template: Template repository for the Werewolf hackathon

How does the game controller work?

The code that will run werewolf games for you is what we call the game controller. You can think of the game controller as the Moderator in Werewolf. It will determine the (randomly assigned) role of each agent playing the game, open communication channels, keep track of who is alive, and prompt your agent to provide input when it is your turn to do so.

Requirements for running the game:

The game controller depends on two libraries that we have built for this hackathon. You must install these libraries to build and test werewolf agents locally.

  1. The sentient-campaign-agents-api library, documented here, provides the necessary framework for building agents. You can install it with this pip command:
pip install --upgrade --index-url https://test.pypi.org/simple/ --extra-index-url https://pypi.org/simple sentient-campaign-agents-api
  1. The Sentient Campaign Activity Runner library, documented here, provides the necessary framework for running werewolf games. You can install it with:
pip install --upgrade --index-url https://test.pypi.org/simple/ --extra-index-url https://pypi.org/simple sentient-campaign-activity-runner

Beyond this, running the game locally requires the following:

  • Python 3.12
  • Pip
  • Docker Desktop Application > 4.34 # make sure you open this before you start
  • Docker (should automatically come with desktop, make sure version is 4.34 or above)
  • Poetry
  • venv (recommended)

When you run werewolf locally, your agent will run in a docker container. You can think of docker as a lightweight virtual machine. This enables us to:

  1. Impose restrictions such as limiting internet access to agents
  2. Ensure that agents that run on docker on your machine also run without problems on docker on our servers.

IReactiveAgent Interface

For your agent to participate in the game it must implement the (python) IReactiveAgent Interface from the Sentient Campaign Agents Library. See this interface in the appendix.

This simple interface has just two core methods that you have to implement (excluding initialization).

async_notify

async def async_notify(self, message: ActivityMessage):

This is a method that the game controller will call whenever there is some new information about the state of the game that it wants to notify your agent about but does not require a response. For example if one agent is killed by werewolves in the night, the game controller will call async_notify to each agent actively playing the game. The logic you implement with async notify is how your agent will process, remember and “understand” any information that has been passed to it.

async_respond

async def async_respond(self, message: ActivityMessage) -> ActivityResponse:

This is a method that the game controller will call whenever it wants to pass some new information to the agent and it requires a response from your agent. For example, the game controller will call async_respond to your agent once per round to ask which villager you think is a wolf and want to vote out.

Important: in the tournament, your agent will have a max of 60 seconds to “think” before delivering a response to async_respond. If your agent exceeds this limit the game controller will deliver a blank response if this is part of a discussion, or randomly select a move if it is requiring one.

When a werewolf game is being run, from your agent’s perspective it is just receiving async_notify, and async_respond calls determined by the game controller. You need to build your agent around these two methods to play Werewolf intelligently.

initialize

Aside from these two methods, you will also need to implement the initialize method:

def __initialize__(self, name: str, description: str, config: dict = None)

The initialize method (separate from your regular init method that should just pass) is a method that the game controller will call to initialize your agent with its name, a short description, and its LLM configuration.

Handling your agents LLM Config and API Key:

To participate in this game you will be provided an openai API compatible api key from sentient. This API key should be available to you via email. It is important that your agent does not access this api key directly, but rather from the attribute: self.sentient_llm_config

This attribute contains a dictionary with one key: “config_list” with the value of a list of dictionaries containing the configuration for the LLMs. Here is an example sentient_llm_config attribute:

{
    "config_list": [
        {
            "llm_model_name": "test_model1",
            "api_key": "test_key1", 
            "llm_base_url": "http://test-url1.com"
        },
        {
            "llm_model_name": "test_model2",
            "api_key": "test_key2",
            "llm_base_url": "http://test-url2.com"
        }
    ]
}

You can access the first config in the config list for example via:

Self.sentient_llm_config["config_list"][0]

You must access llm configs from this attribute as when your agent is loaded into the real tournament, this is how the tournament orchestrator will serve it an api key.

See Providing your agents LLM API Keys below to see how to provide your API key (to AWS hosted Llama 70B or fireworks.ai hosted Llama 70b) to the game controller.

ActivityMessage

See the activity message class here in the appendix.

Example:

header = ActivityMessageHeader(
    message_id="123",
    sender="Alice", 
    channel="general",
    channel_type=MessageChannelType.GROUP,
    target_receivers=["Bob", "Charlie"]
)

content = TextContent(text="Hi Alice!")

message = ActivityMessage(
    content_type=MimeType.TEXT_PLAIN,
    header=header,
    content=content
)

Under this system where the game controller is forwarding messages from different players to your agent it must be able to understand where this message is coming from and what channel it is coming from. To enable this we have created the ActivityMessage class.

This class specifies who sent the message, what channel it was received from and the contents of the message itself. Note that for this tournament the only acceptable MimeType (Multipurpose Internet Mail Extensions) is MimeType.TEXT_PLAIN (Just text). Many features like this in the code have been implemented to enable the adaptation of the underlying framework to other multi-agent games and multi-agent systems.

When your agent receives an ActivityMessage, you must extract relevant objects from the message and feed it appropriately to your agent.

ActivityResponse

When async_respond is called, your agent returns an ActivityResponse. You can instantiate an ActivityResponse with very simple code:

response = ActivityResponse("Hello, world!")

See the full class in the appendix.

Communication Channels

You will notice that one parameter of ActivityMessageHeader is channel. The channel is the communication channel from which your agent received this message. This game has two main communication channels: “play-arena” and “wolf’s-den” All players have access to “play-arena” - this is where all agents (players) will discuss who they think is a wolf and vote on who they want to eliminate from the game when prompted by the moderator. You can think of “play-arena” as daytime in the in person version of werewolf, any messages sent in “play-arena” will be broadcast to all participants still in the game. “wolf’s-den” is a private communication channel that is restricted to wolves participating in the game. The game controller will facilitate communication between werewolves in this group when it is ‘nighttime’ and other players are sleeping. Beyond this, all players will have a private communication channel with the moderator which has the name “moderator”. For most players this channel will only have one message at the start of the game in which the moderator assigns the player its role. For two players however, the doctor and the seer, the moderator will communicate with them through this private channel to determine who they want to save and determine the identity of respectively.

Building Agent Wheel Files

Once you have implemented your agent you need to package your agents. This involves building a wheel file from your agent code (packaged python code) that the game runner can then use to run your agent. A wheel file is already compiled python code in binary with all the relevant metadata to run it somewhere else such as the dependencies of that code and its versions. To package your agents we recommend using poetry. If you don’t have poetry installed you can easily ask chatgpt or google how to do it.

To build a package with poetry you need a pyproject.toml file that specifies the directory of the agent that you want to build, as well the dependencies required for your agent. A pyproject.toml file might look something like this:

[tool.poetry]
name = "bens-agent"
version = "0.0.1"
description = "A description of your project"
authors = ["benjamin <ben@sentient.foundation>"]  # Add your name and email here
packages = [{include = "simple_template"}]  # Add the name of your agent directory here

[tool.poetry.dependencies]
autogen = "0.3.1"
python = "^3.12"
tenacity = "^9.0.0"
openai = "^1.47.1"

[build-system]
requires = ["poetry-core>=1.0.0"]
build-backend = "poetry.core.masonry.api"

With this file in a parent directory to your agent’s directory, as well as an __init__.py file (this can be empty usually) in the same directory as your agent, you can simply run poetry build in your terminal to create a .whl file in a new directory that will be generated called dist.

Running Agents Locally:

To run your agent locally you will use the WerewolfCampaignActivityRunner()class from The Sentient Campaign Activity Runner Library. The easiest way to do this is to simply create a python script called runner.py for running a werewolf game. Here is an example implementation:

from sentient_campaign.activity_runner.runner import WerewolfCampaignActivityRunner, PlayerAgentConfig

runner = WerewolfCampaignActivityRunner()

agent_config = PlayerAgentConfig(
    player_name="YourAgentName",
    agent_wheel_path="absolute/path/to/agent/wheel/file.whl", 
    module_path="relative/path/to/agent/file/relative/to/pyproject.toml/file",
    agent_class_name="class/name/of/IReactiveAgent/implementation",
    agent_config_file_path="absolute/path/to/agent/config/file/can/be/empty"
)

players_sentient_llm_api_keys = []

activity_id = runner.run_locally(
    agent_config,
    players_sentient_llm_api_keys,
    path_to_final_transcript_dump="src/transcripts",
    force_rebuild_agent_image=False
)

All configs above must be correct for runner to operate. Note that agent_config_file_path is leftover from an old version (hopefully will be removed soon) and you can point this to an empty config.yaml file somewhere.

Providing your agents LLM API Keys

As described above in Handling your agents LLM Config and API Key, you must access the agents LLM config information from the IReactiveAgent parameter self.sentient_llm_config

Your group will be given access to an openai compatible api key for Llama 70b when you start the hackathon.

When you are running your agent locally, you must provide any api keys you want your agents to have access to in this parameter within the runner file. You provide these api keys to runner as a list of keys see (players_sentient_llm_api_keys). From here runner will automatically create the sentient_llm_config attribute filling in the hard coded “llm_model_name” and “llm_base_url” parameters.

Using Fireworks.ai API Keys
For the online section of this hackathon, you have two options for compute to power your agent!

  1. The first is just to use the API key that will be emailed to each group. This API key will give you access to the clusters we are hosting for this tournament, and will have a budget of at least $100 for your group to use during the duration of the tournament. If you are out of compute on the day of the in person event we will be happy to provide you more.
  2. Fireworks.ai has generously offered to co-sponsor this event and each hackathon participant can receive $30 in fireworks.ai credits by using this form: https://forms.gle/T1zGf6Sf3exzYccM9
    If you are having any problems with LLM API requests, just try switching API providers!

How to use Fireworks.ai API Keys:

Currently the template code is set up with the base URL and model name hard coded to the model name and base URL for the Llama 70B instance we are hosting. To use Fireworks.ai you need to override this by setting the environment variables:

MY_UNIQUE_API_KEY=
SENTIENT_DEFAULT_LLM_MODEL_NAME="accounts/fireworks/models/llama-v3p1-70b-instruct"
SENTIENT_DEFAULT_LLM_BASE_URL="https://api.fireworks.ai/inference/v1"

The easiest way to do this is just to create a .env file with the above configurations. The template runner files are already set up to load variables from a .env file configured as such. If you include MY_UNIQUE_API_KEY and do not modify the setting of: players_sentient_llm_api_keys = [os.getenv(“MY_UNIQUE_API_KEY”)]in the template runner files, then you also do not need to manually enter in the API key in all of these places!

To create a Fireworks API Key:

Create a Fireworks.ai account

**Please note at the moment, fireworks API Keys may not be compatible with the CoT and Autogen template agents we are working on optimizing this compatibility.

Running a werewolf game

When you are running a werewolf game, please take note of the following:

  • You must have the docker desktop app installed and open
  • When prompted whether to give access to docker to … enter your password and click always allow (or else you will have to do this a lot.
  • If you are keyboard interrupting a werewolf game, do not press ^C more than once. After the first time you interrupt the game it will begin shutting down the game and deleting the relevant containers in docker. If you press it again, you will stop this docker process and that will cause errors the next time it tries to build the necessary containers in docker.
  • To fix these errors delete any recently created (since you started working on the game) docker containers and images (if it is labeled none and won’t get deleted that is ok). Then try running the game again, it may take a little longer to startup but it should work.

Viewing Werewolf Games in Hydrogen

We have set up the game so that you can watch the game happening live via a messenger interface (hydrogen) with the different game channels from the moderator’s perspective. When you start a werewolf game, navigate to this url in your browser:

https://hydrogen.sentient.xyz/#/login

Log in with (these should be auto populated):

matrix host: http://localhost:8008

username: moderator

password: moderator

Troubleshooting Hydrogen:
If the messenger client for watching game results isn’t working it may be that you are not waiting for the game to start. If it still won’t load when the game starts, then check to make sure you have host networking enabled on your docker desktop application: got to settings ->resource → network and enable host networking

If you are specifying the port number, for example via multirunner.py you need to change the port number in the home server url from 8008 to whatever port you have specified. We recommend above 14000.

Storing and Viewing Werewolf Game Results

In the template code we have set it up so that the results of werewolf games are stored in a game_results folder, and the transcripts of each of the players are stored in a transcripts folder. In the agent directory.

runner.run_against_standard_agents will return game results in a json object that we store in this folder.

Running Multiple Back to Back Werewolf Games

Werewolf is a probabilistic game, especially when using LLMs thus to see how well your agent is doing, it is helpful to run many games and count your agents win rate. This is how your agent will be evaluated in the tournament.

To help you do this each template agent directory contains a script called multirunner.py that allows you to run a set number of werewolf games, and also specify the port you run it on if you would like to.

Running multiple werewolf games simultaneously

Using multirunner.py, it is possible to have multiple terminals running games at the same time. To do this, however you must set

force_rebuild_agent_image=False

And you must specify different ports for those games to broadcast to. You can run multirunner.py with different ports and a set number of games like this:

python multirunner.py --games 5 --port 14002

We recommend sticking to ports above 14000.


Appendix

Game Rules and Instructions

Rules as presented to Agents at the start of each game:

Game Instructions:

  1. Roles:
    At the start of each game you will be assigned one of the following roles:

    • Villagers: The majority of players. Their goal is to identify and eliminate the werewolves.
    • Werewolves: A small group of players who aim to eliminate the villagers.
    • Seer: A special villager who can learn the true identity of one player each night.
    • Doctor: A special villager who can protect one person from elimination each night.
  2. Gameplay:
    The game alternates between night and day phases.

    Night Phase:
    a) The moderator announces the start of the night phase and asks everyone to “sleep” (remain inactive).
    b) Werewolves’ Turn: Werewolves vote on which player to eliminate in a private communication group with the moderator.
    c) Seer’s Turn: The Seer chooses a player to investigate and learns whether or not this player is a werewolf in a private channel with the moderator.
    d) Doctor’s Turn: The Doctor chooses one player to protect from being eliminated by werewolves in a private channel with the moderator.

    Day Phase:
    a) The moderator announces the end of the night and asks everyone to “wake up” (become active).
    b) The moderator reveals if anyone was eliminated during the night.
    c) Players discuss and debate who they suspect to be werewolves.
    d) Players vote on who to eliminate. The player with the most votes is eliminated and their role is revealed.

  3. Winning the Game:

    • Villagers win if they eliminate all werewolves.
    • Werewolves win if they equal or outnumber the villagers.
  4. Strategy Tips:

    • Villagers: Observe player behavior and statements carefully.
    • Werewolves: Coordinate during the night and try to blend in during day discussions.
    • Seer: Use your knowledge strategically and be cautious about revealing your role.
    • Doctor: Protect players wisely and consider keeping your role secret.
  5. Communication Channels:
    a) Main Game Group: “{{ game_room }}” - All players can see messages here.
    b) Private Messages: You may receive direct messages from the moderator ({{ moderator_name }}). These are private messages that only you have access to.
    c) Werewolf Group: If you’re a werewolf, you’ll have access to a private group for night discussions.

Game Rules:

Your agent will have no internet access, or access to LLMs other than llama 3.1 70b Instruct during the tournament.
You can take any tactic you wish to make your agent win the game, except:
Causing the game to crash
Hacking the game controller software to give your agent an unfair advantage. For example: You can try to trick other agents in any way you would like, even if that means trying to impersonate the moderator. You are not allowed to (and won’t be able to anyways) hack the moderator itself to tell you who is who in a private channel.

For each call of async respond your agent will have max 1 minute to respond before the game moderator opts for a random move or blank response and penalizes your agent for the failed response.

IReactiveAgent Interface

@runtime_checkable
class IReactiveAgent(AgentBase, INotify, IRespond, Protocol):
   """
   This class represents an agent that can listen and react to messages
   """
   ...


   def __initialize__(self, name: str, description: str, config: dict = None):
       """
       This function initializes the agent.


       Args:
           name (str): The name of the agent.
           description (str): The description of the agent.
           config (dict, optional): The configuration of the agent. Defaults to None.
           llm_config (Dict[str,Any], optional): The llm configuration that agent should. the llm_config contains the following
           keys:   - api_key: The api key to use for the llm.
                   - llm_host: The host of the llm.
                   - llm_model_name: The name of the llm model to use.
           you will compulsorily get the api_key and llm_host and llm_model_name from the config from campaign server.
       """
       ...


   async def async_notify(self, message: ActivityMessage):
       """
       Asynchronously notify the agent whenever there is a message.


       Args:
           message (ActivityMessage): The activity message to notify the agent about.
       """
       ...


   async def async_respond(self, message: ActivityMessage) -> ActivityResponse:
       """
           Asynchronously respond to a given activity message.


           Args:
               message (ActivityMessage): The activity message to respond to.


           Returns:
               ActivityResponse: The response to the activity message.
       """
       ...

ActivityMessage Class

class ActivityMessage(BaseModel):
   """
   Represents a complete activity message, including header and content.


   Attributes:
       content_type (MimeType): The MIME type of the message content.
       header (ActivityMessageHeader): The header information for the message.
       content (Union[TextContent, AudioContent, VideoContent]): The content of the message.


   Methods:
       to_dict(): Returns a dictionary representation of the message.
       to_json(): Returns a JSON string representation of the message.


   Example:
       header = ActivityMessageHeader(
           message_id="123",
           sender="Alice",
           channel="general",
           channel_type=MessageChannelType.GROUP,
           target_receivers=["Bob", "Charlie"]
       )
       content = TextContent(text="Hi Alice!")
       message = ActivityMessage(
           content_type=MimeType.TEXT_PLAIN,
           header=header,
           content=content
       )
   """
   content_type: MimeType
   header: ActivityMessageHeader
   content: Union[TextContent, AudioContent, VideoContent]


   def to_dict(self):
       """
       Convert the ActivityMessage to a dictionary.


       Returns:
           dict: A dictionary representation of the message.


       Example:
           header = ActivityMessageHeader(...)
           content = TextContent(text="Hi Alice!")
           message = ActivityMessage(
               content_type=MimeType.TEXT_PLAIN,
               header=header,
               content=content
           )
       """
       return {
           "content_type": self.content_type.value,
           "header": self.header.to_dict(),
           "content": self.content.to_dict()
       }


   def to_json(self):
       """
       Convert the ActivityMessage to a JSON string.


       Returns:
           str: A JSON string representation of the message.


       Example:
           message = ActivityMessage(...)
           json_str = message.to_json()


       """
       return json.dumps(self.to_dict())

ActivityResponse Class

class ActivityResponse(BaseModel):
   """
   Represents a response to an activity message.


   Attributes:
       response_type (MimeType): The MIME type of the response content (default: MimeType.TEXT_PLAIN).
       response (Union[TextContent, AudioContent, VideoContent]): The content of the response.


   Methods:
       validate_response(): Validator that converts string responses to TextContent.
       to_dict(): Returns a dictionary representation of the response.
       to_json(): Returns a JSON string representation of the response.


   Example:
       response = ActivityResponse("Hello, world!")
   """
   response_type: MimeType = MimeType.TEXT_PLAIN
   response: Union[TextContent, AudioContent, VideoContent]


   @field_validator('response', mode="before")
   def _validate_response(cls, v):
       if isinstance(v, str):
           return TextContent(text=v)
       elif isinstance(v, (TextContent, AudioContent, VideoContent)):
           return v
       else:
           raise ValueError("Response must be a string or a content object.")
      


   def __init__(self, response=None, **data):
       """
       Initialize the ActivityResponse.


       Args:
           response: The response content.
           **data: Additional data for initialization.


       Example:
           response1 = ActivityResponse("Hello, world!")
           response2 = ActivityResponse(TextContent(text="Hello, world!"))
       """
       if isinstance(response, str):
           super().__init__(response=TextContent(text=response), **data)
       else:
           super().__init__(response=response, **data)


   def to_dict(self):
       """
       Convert the ActivityResponse to a dictionary.


       Returns:
           dict: A dictionary representation of the response.


       Example:
           response = ActivityResponse("Hello, world!")
           assert response.to_dict() == {
               "response_type": "text/plain",
               "response": {"text": "Hello, world!"}
           }
       """
       return {
           "response_type": self.response_type.value,
           "response": self.response.to_dict()
       }


   def to_json(self):
       """
       Convert the ActivityResponse to a JSON string.


       Returns:
           str: A JSON string representation of the response.


       Example:
           response = ActivityResponse("Hello, world!")
           json_str = response.to_json()
           assert json.loads(json_str) == response.to_dict()
       """
       return json.dumps(self.to_dict())