Getting Started
Before you run any benchmarks, you’ll need to set up your environment. General Prerequisites:- MemMachine Backend: Both tools require that your MemMachine backend be installed and configured. If you need help with this, you can check out our QuickStart Guide.
-
Start the Backend: Once everything is set up, start MemMachine with this command:
cfg.yml file has been copied into your locomo directory (/memmachine/evaluation/locomo/) and renamed to locomo_config.yaml.
Running the Benchmark
Ready to go? Follow these simple steps. A. All commands should be run from their respective tool directory (defaultlocomo/episodic_memory/).
B. The path to your data file, locomo10.json, should be updated to match its location. By default, you can find it in /memmachine/evaluation/locomo/.
C. Once you have performed step 1 below, you can repeat the benchmark run by performing steps 2-4. Once are you finished performing the benchmark, run step 5.
Note: Although the process is simple, the commands for each type of Tool Set may differ. Please refer to the Episodic Memory Tool Set for the exact commands to run for each step.
1
Ingest a Conversation
First, let’s add conversation data to MemMachine. This only needs to be done once per test run.
2
Search the Conversation
Let’s search through the data you just added.
3
Evaluate the Responses
Next, run a LoCoMo evaluation against the search results.
4
Generate Your Final Score
Once the evaluation is complete, you can generate the final scores.The output will be a table in your shell showing the mean scores for each category and an overall score, like the example below:
5
Clean Up Your Data
When you’re finished, you may want to delete the test data.

