feat: enabled saving and evaluation for moderator #271

JXZhou0224 · 2025-01-02T08:20:32Z

Closes #

📑 Description

added two new features for moderator:

saving

Now LLMAgents will store their profile in a AgentProfile on redis.
Later, LLMAgent can be loaded from existing AgentProfiles
and moderator can access AgentProfile to generate an EpisodeLog

evaluation

created evaluators for multiagent assessment (current Evaluators in sotopia only supports two agents)
moderator can now evaluate an episode on demand, currently there is only one evaluator available, new evaluators will be added later.

✅ Checks

My pull request adheres to the code style of this project
My code requires changes to the documentation
I have updated the documentation as required
All the tests have passed
Branch name follows type/descript (e.g. feature/add-llm-agents)
Ready for code review

ℹ Additional Information

examples/experimental/sotopia_original_replica/origin.toml

sotopia/experimental/agents/logs.py

XuhuiZhou · 2025-01-06T16:53:45Z

sotopia/experimental/agents/evaluators.py

@@ -0,0 +1,25 @@
+from abc import ABC, abstractmethod


why not using the existing evaluators

The existing evaluators base class can only take in message history. However, there might be cases where we need other information in EpisodeLog to generate an evaluation. So I made a new class of evaluator for future use.

feat: enabled saving and evaluation for moderator (#271)

JXZhou and others added 5 commits January 1, 2025 19:40

feat: enable saving AgentProfile and chat history on redis

e40b94d

feat: enable saving AgentProfile and chat history on redis

4086c49

fix: AgentProfile can now be loaded by EpisodeLog correctly

396db1a

feat: enable evaluation of EpisodeLog

448293f

[autofix.ci] apply automated fixes

a3abfee

JXZhou0224 marked this pull request as ready for review January 2, 2025 08:24

XuhuiZhou self-requested a review January 6, 2025 16:16

XuhuiZhou requested changes Jan 6, 2025

View reviewed changes

JXZhou and others added 3 commits January 7, 2025 17:49

fix: use EpisodeLog and AgentProfile from sotopia directly

bf40479

Merge branch 'sotopia-lab:demo' into demo

038ffa3

fix: use EpisodeLog and AgentProfile from sotopia directly

aad61a0

JXZhou0224 requested a review from XuhuiZhou January 7, 2025 10:03

XuhuiZhou changed the base branch from demo to feature/multiparty January 7, 2025 19:34

XuhuiZhou approved these changes Jan 7, 2025

View reviewed changes

XuhuiZhou merged commit 66da649 into sotopia-lab:feature/multiparty Jan 7, 2025
1 check passed

JXZhou0224 added a commit that referenced this pull request Jan 18, 2025

Merge pull request #272 from sotopia-lab/feature/multiparty

ea14fda

feat: enabled saving and evaluation for moderator (#271)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: enabled saving and evaluation for moderator #271

feat: enabled saving and evaluation for moderator #271

JXZhou0224 commented Jan 2, 2025 •

edited

Loading

XuhuiZhou Jan 6, 2025

JXZhou0224 Jan 7, 2025

feat: enabled saving and evaluation for moderator #271

feat: enabled saving and evaluation for moderator #271

Conversation

JXZhou0224 commented Jan 2, 2025 • edited Loading

📑 Description

saving

evaluation

✅ Checks

ℹ Additional Information

XuhuiZhou Jan 6, 2025

Choose a reason for hiding this comment

JXZhou0224 Jan 7, 2025

Choose a reason for hiding this comment

JXZhou0224 commented Jan 2, 2025 •

edited

Loading