Skip to content

SynthPanel · open source · backed by SynthBench

Run a synthetic panel. Know what it's worth.

When you can't run real user research, a synthetic panel is a reasonable stand-in. It won't give you the truth, but it will give you a direction. The catch is knowing how much to read into any given result, which is the problem SynthBench works on. SynthPanel is the open-source harness you run. SynthBench is the benchmarking service that tells you how well it holds up.

How they fit

Each one makes
the other better.

You can use either on its own. Together they form a loop: SynthBench scores runs against its test datasets, and the setups that score well shape how SynthPanel runs elsewhere.

01 — BENCHMARK

SynthBench scores the runs

SynthPanel runs against SynthBench's test datasets, questions where the real human answers are already known, so each model and configuration gets a clear score on how closely it tracks real people.

02 — RECOMMEND

SynthPanel learns

Whatever holds up best on those datasets becomes a recommendation inside SynthPanel, so the agent running your panel starts from a setup that's already been measured, not a guess.

03 — RUN

You run on what scored well

Your own panel runs on those vetted setups. As SynthBench adds datasets and tests new models, the recommendations underneath you keep improving.

↻ It compounds over time. SynthBench gives you a measurement instead of a guess, SynthPanel is what you run day to day, and as the datasets grow, the recommendations behind every panel get a little more reliable.

What's inside

What each
one does.

SynthPanel is the harness you run. SynthBench is the service that measures how representative it is.

synthpanel

An agent-first API

SynthPanel is built for an agent to operate. The API handles the orchestration, batching, and map-reduce, so the agent can stay focused on the research itself.

synthpanel

Persona packs & populations

Ready-made batches of personas to draw from, plus controls to choose who's in your panel, so you're sampling the group you care about rather than a generic crowd.

synthpanel

Bias surfaced up front

SynthPanel shows the known biases before you read too much into a result, and points the agent toward tuning that has tested well, like ensemble blends.

synthbench

Representativeness testing

Runs against curated datasets where the real human answers are known, measuring how closely synthetic answers follow actual people, including the demographic subgroups that aggregate numbers tend to hide.

synthbench

Cross-model bias

How bias shifts from one model to the next, which groups a large set of agents represents well, and the cases where nondeterminism helps rather than hurts.

synthbench

Public findings

SynthBench's results are public. They're worth reading on their own, and they feed back into the recommendations everyone running SynthPanel sees.

Source: SynthPanel on GitHub · SynthBench on GitHub · PyPI