Autoresearch - Turn Prompt Tuning Into Measured Experiments

Name: Autoresearch - Turn Prompt Tuning Into Measured Experiments
Brand: Anvil
Price: 9.00 USD
Availability: InStock
Rating: 5 (1 reviews)

Skill

Run eval loops on any prompt or skill, mutate one variable at a time, and keep only what measurably improves output.

ProductivityAll platforms1 sale5.0 ★ 1 reviewv1

About

Most AI skills and prompts work... sometimes. They nail it on one run, drift on the next, and you end up rewriting by gut feel until something sticks. That's not optimization. That's guessing with extra steps.

Autoresearch turns prompt improvement into an actual experiment. It takes any skill or prompt you've built, runs it repeatedly against real test inputs, scores every output with binary pass/fail evals (not vibes), and then mutates one variable at a time - keeping only changes that measurably improve results.

Think of it as A/B testing for your AI workflows, except the AI runs the tests, tracks the scores, and makes the edits for you.

What makes this different:

Binary evals, not taste. You define what good means as yes/no checks.
One change at a time. Every mutation is isolated.
Live dashboard. Watch the optimization happen in real time.
Full research log. Every experiment is logged.
It actually stops. Built-in convergence detection.

Core Capabilities

Binary eval design — no vibes, just pass or fail
Autonomous mutation loops — one change at a time, keep only improvements
Live HTML dashboard with score progression charts
Detailed changelog of every mutation tried and why
Works on any skill or prompt file, not locked to one framework

Customer ratings

1 review

5.0

5 star
1
4 star
0
3 star
0
2 star
0
1 star
0

Verified customer · Mar 30, 2026
5.0

Version History

This skill is actively maintained.

Version 1Latest

March 30, 2026

Initial release. Full autoresearch methodology with binary evals, mutation loops, live dashboard, and complete worked example.

One-time purchase

By continuing, you agree to the Buyer Terms of Service.

Creator

Anvil

Things get built here

AI tools forged from real workflows.

View creator profile →

Details

Type: Skill
Category: Productivity
Price: $9
Version: 1
License: One-time purchase

Works With

OpenClawRaw FilesClaude ProjectsCustom GPTsCursor

Works with OpenClaw, Claude Projects, Custom GPTs, Cursor and other instruction-friendly AI tools.

Compatible With

OpenClaw, Claude Code, Codex CLI, any agent that can run prompts

Required Tools

read, write, exec (standard tools)

Works great with

Personas that pair well with this skill.

Agent CEO Framework

Persona

Most AI agents do what you tell them. This one decides what actually needs doing.

$49

Project Manager Agent

Persona

Goals into plans. Plans into shipped work.

$34

ClawMart QA Review Shield

Persona

Preflight ClawMart packages for originality, safety, installability, and buyer usefulness before anyone hits publish.

$24