Coming soon. Autoresearch is in research preview and not yet available. Requires a Simmer Pro plan.
How it works
- Init — Pick a skill and a hypothesis (e.g., “lower min_edge from 0.05 to 0.03”)
- Run — Execute the skill with the new config for several trading cycles
- Log — Record results (P&L, win rate, trade count) and decide: keep or revert
- Repeat — Try the next hypothesis
Install
Config
| Option | Default | Description |
|---|---|---|
apiKey | $SIMMER_API_KEY | Simmer API key. Optional if env var is set. |
apiUrl | https://api.simmer.markets | API base URL |
maxExperiments | 50 | Max experiments per session. Prevents runaway loops. 0 = unlimited. |
plugins.json:
Tools
The plugin registers three tools your agent can call:init_experiment
Start a new experiment session for a skill.
| Parameter | Required | Description |
|---|---|---|
skill_slug | Yes | ClawHub slug of the skill to optimize (e.g., polymarket-fast-loop) |
hypothesis | Yes | What you’re testing (e.g., “increase volatility threshold to 0.7”) |
config_changes | No | Key-value pairs of config changes to apply |
run_experiment
Execute the skill under test. Call this repeatedly (once per trading cycle) to collect data.
Returns the skill’s output from that cycle — trades made, signals found, errors hit.
log_experiment
Record experiment results and decide whether to keep or revert.
| Parameter | Required | Description |
|---|---|---|
status | Yes | keep, discard, or crash |
metrics | No | Observed metrics (P&L, win rate, trade count, etc.) |
notes | No | Agent’s reasoning for the decision |
Commands
| Command | Description |
|---|---|
/autoresearch status | Current skill, experiment count, keep rate, budget remaining, pause state |
/autoresearch reset | Clear state and start fresh (clears pause if paused) |
Safety features
Crash protection
- Baseline crash — If the very first experiment in a session crashes, autoresearch pauses automatically. This usually means the skill is misconfigured.
- Consecutive crashes — 3 crashes in a row triggers auto-pause. Your agent can’t run more experiments until the issue is investigated.
- Recovery —
init_experiment(new session) or/autoresearch resetclears the pause.
Budget caps
Experiments are capped atmaxExperiments (default 50) per session. At 80% of the cap, your agent gets a warning. At the limit, run_experiment is blocked.
Set maxExperiments: 0 to disable the cap (not recommended for unattended agents).
Experiment persistence
Results are saved in two places:- Local JSONL —
{stateDir}/autoresearch/experiments.jsonlfor offline access - Dashboard API — Synced to your Simmer dashboard for viewing experiment history
keep decisions so you can track what changed and roll back if needed.
Example workflow
A typical autoresearch session looks like:API endpoints
These endpoints power the plugin’s server-side sync. You don’t call them directly — the plugin handles it.| Endpoint | Description |
|---|---|
GET /api/sdk/autoresearch/experiments | List experiment history |
POST /api/sdk/autoresearch/experiments | Sync experiment results |
GET /api/sdk/autoresearch/experiments/latest | Latest experiment |
