How to train one agent and don't update another one in multi-agent reinforcement learning setup?

Question

Lingfeng Tao on 30 Jul 2021

0
Link

Direct link to this question

https://uk.mathworks.com/matlabcentral/answers/889522-how-to-train-one-agent-and-don-t-update-another-one-in-multi-agent-reinforcement-learning-setup

Answered: Hornett on 25 Sep 2024

Hi, I have a multi-agent reinforcment learning environment with two agents. One agent is pretrained (agentA) and the perofrmance is good. I need to train another one (agentB) to cooperate with agentA. I'm wondering if I can only train agentB and keep agentA unchanged so its performance will not be influenced.

Thanks!

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Hornett on 25 Sep 2024

0
Link

Direct link to this answer

https://uk.mathworks.com/matlabcentral/answers/889522-how-to-train-one-agent-and-don-t-update-another-one-in-multi-agent-reinforcement-learning-setup#answer_1522475

You can train AgentB in a multi-agent reinforcement learning environment while keeping AgentA's policy unchanged. This approach allows AgentB to learn how to effectively cooperate with the pretrained AgentA without affecting AgentA's performance. Here are concise steps and considerations for implementing this strategy:

Fixed Policy for AgentA:

Keep AgentA's policy unchanged during AgentB's training sessions.

Training AgentB:

Update only AgentB's policy based on its received observations and rewards, aiming to improve cooperation with AgentA.

Environment Setup:

Ensure the environment supports interactions between both agents and provides appropriate observations and rewards to each.

Key Considerations:

- Observability: AgentB needs to observe relevant aspects of the environment and AgentA's actions or state.

- Reward Structure: Design AgentB's rewards to encourage effective cooperation with AgentA.

- Exploration: Employ exploration strategies for AgentB to discover cooperative strategies.

Evaluation:

- Regularly assess both AgentB's individual performance and the overall system performance to ensure effective cooperation.

By focusing on these aspects, you can train AgentB to cooperate with a fixed-policy AgentA, enhancing the multi-agent system's performance without compromising the pretrained agent's behavior.

Hope it helps!

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

How to train one agent and don't update another one in multi-agent reinforcement learning setup?

0 Comments
Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

How to train one agent and don't update another one in multi-agent reinforcement learning setup?

0 Comments Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

0 Comments
Show -2 older commentsHide -2 older comments