Built-in Agents

AgentScope builds in several agent class to support different scenarios and show how to build agents with AgentScope.

Class	Description
UserAgent	Agent that allows user to participate in the conversation.
DialogAgent	Agent that speaks in natural language.
DictDialogAgent	Agent that supports structured output.
ReActAgent	Agent that uses tools in a reasoning-acting loop manner.
LlamaIndexAgent	RAG agent.

import agentscope

for module in agentscope.agents.__all__:
    if module.endswith("Agent"):
        print(module)

DialogAgent
DictDialogAgent
UserAgent
ReActAgent
LlamaIndexAgent

Note

To support different LLM APIs, all built-in agents are initialized by specifying the model configuration name model_config_name in AgentScope.

import agentscope

agentscope.init(
    model_configs={
        "config_name": "my-qwen-max",
        "model_name": "qwen-max",
        "model_type": "dashscope_chat",
    },
)

[]

DialogAgent

The dialog agent is the most basic agent in AgentScope, which can interact with users in a dialog manner. Developers can customize it by providing different system prompts and model configurations.

from agentscope.agents import DialogAgent
from agentscope.message import Msg

# Init a dialog agent
alice = DialogAgent(
    name="Alice",
    model_config_name="my-qwen-max",
    sys_prompt="You're a helpful assistant named Alice.",
)

# Send a message to the agent
msg = Msg("Bob", "Hi! What's your name?", "user")
response = alice(msg)

Alice: Hello! My name is Alice. It's nice to meet you, Bob. How can I assist you today?

UserAgent

The UserAgent class allows users to interact with the other agents. When the UserAgent object is called, it will ask for user input and format it into a Msg object.

Here we show how to initialize a UserAgent object and interact with the dialog agent alice.

from agentscope.agents import UserAgent
from io import StringIO
import sys

user = UserAgent(
    name="Bob",
    input_hint="User input: \n",
)

# Simulate user input
sys.stdin = StringIO("Do you know me?\n")

msg = user()
msg = alice(msg)

User Input: Bob: Do you know me?
Alice: I don't actually know you personally, Bob. I'm an AI assistant and I just guessed your name based on the way you started the conversation. I'm here to help, though! Is there anything specific you'd like to chat about or any questions you have?

DictDialogAgent

The DictDialogAgent supports structured output and automatic post-processing by specifying its parser via the set_parser method.

We first initialize a DictDialogAgent object, and switch between different outputs by changing the parser.

from agentscope.agents import DictDialogAgent
from agentscope.parsers import MarkdownJsonDictParser


charles = DictDialogAgent(
    name="Charles",
    model_config_name="my-qwen-max",
    sys_prompt="You're a helpful assistant named Charles.",
    max_retries=3,  # The maximum number of retries when failing to get a required structured output
)

# Ask the agent to generate structured output with `thought`, `speak`, and `decision`
parser1 = MarkdownJsonDictParser(
    content_hint={
        "thought": "what your thought",
        "speak": "what you speak",
        "decision": "your final decision, true/false",
    },
    keys_to_metadata="decision",
    required_keys=["thought", "speak", "decision"],
)

charles.set_parser(parser1)
msg1 = charles(Msg("Bob", "Is it a good idea to go out in the rain?", "user"))

print(f"The metadata field: {msg1.metadata}")
print(f"The type of metadata field: {type(msg1.metadata)}")

Charles: ```json
{"thought": "Going out in the rain can be refreshing and fun, but it also depends on whether one has the right gear to stay dry and warm. It's also important to consider if there are any risks such as lightning or flooding.", "speak": "It can be nice to go out in the rain, especially if you have an umbrella or a raincoat. However, make sure it's safe, and there's no risk of thunderstorms or floods. Do you have the right gear?", "decision": "false"}
```
The metadata field: false
The type of metadata field: <class 'str'>

Then, we ask the agent to pick a number from 1 to 10.

parser2 = MarkdownJsonDictParser(
    content_hint={
        "thought": "what your thought",
        "speak": "what you speak",
        "number": "the number you choose",
    },
    keys_to_metadata="number",
    keys_to_content="speak",
)

charles.set_parser(parser2)
msg2 = charles(Msg("Bob", "Pick a number from 1 to 10.", "user"))

print(f"The content of the response message: {msg2.content}")
print(f"The metadata of the response message: {msg2.metadata}")

Charles: ```json
{"thought": "Bob might be looking for a random number, and there's no particular reason to pick one number over another. I'll just choose a number at random.", "speak": "Sure, I can do that. How about the number 7?", "number": "7"}
```
The content of the response message: Sure, I can do that. How about the number 7?
The metadata of the response message: 7

The next question is how to post-process the structured output. For example, the thought field should be stored in memory without being exposed to the others, while the speak field should be displayed to the user, and the decision field should be easily accessible in the response message object.

parser3 = MarkdownJsonDictParser(
    content_hint={
        "thought": "what your thought",
        "speak": "what you speak",
        "number": "The number you choose",
    },
    required_keys=["thought", "speak", "number"],
    keys_to_memory=["thought", "speak", "number"],  # to be stored in memory
    keys_to_content="speak",  # to be displayed
    keys_to_metadata="number",  # to be stored in metadata field of the response message
)

charles.set_parser(parser3)

msg3 = charles(Msg("Bob", "Pick a number from 20 to 30.", "user"))

print(f"The content field: {msg3.content}")
print(f"The type of content field: {type(msg3.content)}\n")

print(f"The metadata field: {msg3.metadata}")
print(f"The type of metadata field: {type(msg3.metadata)}")

Charles: ```json
{"thought": "Bob is asking for a number within a specific range again. I'll pick a random number between 20 and 30.", "speak": "Sure, let's go with the number 24.", "number": "24"}
```
The content field: Sure, let's go with the number 24.
The type of content field: <class 'str'>

The metadata field: 24
The type of metadata field: <class 'str'>

Hint

More advanced usage of structured output, and more different parsers refer to the section Structured Output.

ReActAgent

The ReActAgent uses tools to solve the given problem in a reasoning-acting loop manner.

First we prepare a tool function for the agent.

from agentscope.service import ServiceToolkit, execute_python_code


toolkit = ServiceToolkit()
# Set execute_python_code as a tool by specifying partial arguments
toolkit.add(
    execute_python_code,
    timeout=300,
    use_docker=False,
    maximum_memory_bytes=None,
)

Then we initialize a ReActAgent to solve the given problem.

from agentscope.agents import ReActAgent

david = ReActAgent(
    name="David",
    model_config_name="my-qwen-max",
    sys_prompt="You're a helpful assistant named David.",
    service_toolkit=toolkit,
    max_iters=10,
    verbose=True,
)

task = Msg("Bob", "Help me to calculate 151513434*54353453453.", "user")

response = david(task)

system: Respond with specific tags as outlined below:
<thought>{what you thought}</thought>
<function>{the function name you want to call}</function>
<{argument name}>{argument value}</{argument name}>
<{argument name}>{argument value}</{argument name}>
...
David: <thought>I need to use the execute_python_code function to calculate the product of 151513434 and 54353453453.</thought>
<function>execute_python_code</function>
<code>print(151513434*54353453453)</code>
system: 1. Execute function execute_python_code
   [ARGUMENTS]:
       {"code": "print(151513434*54353453453)"}
   [RESULT]: <returncode>0</returncode>
<stdout>8235278382423187602
</stdout>
<stderr></stderr>

system: Respond with specific tags as outlined below:
<thought>{what you thought}</thought>
<function>{the function name you want to call}</function>
<{argument name}>{argument value}</{argument name}>
<{argument name}>{argument value}</{argument name}>
...
David: <thought>The calculation has been successfully performed and the result is 8235278382423187602. I will now inform Bob of the result.</thought>
<function>finish</function>
<response>The product of 151513434 and 54353453453 is 8235278382423187602.</response>
system: 1. Execute function finish
   [ARGUMENTS]:
       {"response": "The product of 151513434 and 54353453453 is 8235278382423187602."}
   [RESULT]: The product of 151513434 and 54353453453 is 8235278382423187602.

LlamaIndexAgent

Refer to the RAG Section for more details.

Total running time of the script: (0 minutes 39.730 seconds)

Gallery generated by Sphinx-Gallery