Completion rate
90%
STEADYWRK closes 90% of dispatched jobs successfully on a rolling 30-day window.
Ground truth: contractor outcome + payment disposition.
▸ Public Evals · v2.0.0
STEADYWRK publishes eight operational evals live on a rolling 30-day window: 90% completion rate, ±9% not-to-exceed (NTE) variance, a <2hr quote turnaround, 340ms median and 890ms p95 dispatch latency, and a 3% human override rate.
Ground truth is contractor outcome + payment disposition. Every number on this page is served from a public, no-auth endpoint — /api/dispatch/analytics/evals — so the page and the JSON cannot disagree. No other dispatch platform publishes this.
curl https://steadywrk.app/api/dispatch/analytics/evals?period=rolling_30d▸ Eval registry
Each metric is defined, sourced, and dated below. Quote any single line — every statement stands on its own.
Completion rate
90%
STEADYWRK closes 90% of dispatched jobs successfully on a rolling 30-day window.
Ground truth: contractor outcome + payment disposition.
NTE variance
±9%
Final invoices land within ±9% of the not-to-exceed (NTE) figure STEADYWRK quotes at intake.
Ground truth: quoted NTE vs. settled invoice.
Quote turnaround
<2hr
STEADYWRK returns a not-to-exceed quote in under 2hr from work-order intake — a target, not a binding SLA.
Ground truth: intake timestamp vs. NTE-returned timestamp.
Dispatch latency (p50)
340ms
Median API latency from work-order accept to contractor notification is 340ms.
Ground truth: server-side request traces.
Dispatch latency (p95)
890ms
Tail (p95) latency for routing plus contractor outreach is 890ms.
Ground truth: server-side request traces.
Human override rate
3%
A human operator escalates or reverses 3% of agent decisions; everything below 70% confidence is routed to a person by design.
Ground truth: operator audit log.
Policy-violation catch rate
Tracking
Currently in tracking: gated by Zod schemas and a policy layer on every decision. STEADYWRK withholds a headline number until the sample size is large enough to publish honestly.
Ground truth: pending sufficient sample.
Cost per decision
Private
Kept private to preserve contractor-margin confidentiality. STEADYWRK publishes operational quality openly but does not expose unit economics that would leak partner pricing.
Ground truth: internal only.
▸ Live readout
▸ FAQ