Skip to main content

// case studies

Operators running the spine, not the squeegee.

The teams below replaced spreadsheets, dashboards, and standing meetings with one append-only log. Every result here is a fictional composite of patterns we've seen across 100+ deployments — every metric the kind we measure on day one.

6 case studies 6 industries Verified by operator

// browse

Filter by industry.

// featured

Solace — 4.3× decision throughput in 90 days.

A growth team of six, three paid channels, and one kill switch. Below: the numbers Solace's Head of Paid Media tracks now that the decision_log replaced the Tuesday sync.
SOLACEApparel · DTC · US + EU
4.3×decision throughput, 90 days

Solace runs paid social across Meta, TikTok, and Google with a 6-person growth team. Before Magistry, decisions queued behind weekly syncs. Their planner-judge-executor loop now runs every 6 hours, every channel, with the kill switch held by their Head of Paid Media.

The decision_log is the killer feature. Every move is a row we can audit and reverse. We replaced four spreadsheets, three Looker dashboards, and the awkward Tuesday standup with one append-only log. My CFO sleeps better.

Jacob Dorian
Head of Paid Media

// the deltas

Decisions / day

228
+330%

Avg ROAS, paid

3.4×
+0.9×

Time to action

<6h
-72h

Operator hours / wk

12h
-28h
Read the full Solace case study

// more operators

Five more stores. Five more shapes of autonomy.

Each card below is one team, one cycle pattern, and one number that moved. Click through for the full breakdown — executors, gates, evidence.
Linen House

Home goods · EU

82%

of CS auto-resolved

From 4 CS agents to 1 — same response time, half the noise.

The CS Specialist now handles refund-eligible orders end-to-end. Reply Judge gates everything below 0.78 brand-voice fidelity for human review. The remaining 18% is the work humans should actually be doing.

Read case study
NORTHWOOD

Outdoor · US + CA

12days

connect → first live cycle

Connected on a Monday. First live decision row on a Friday two weeks later.

Northwood ran 14 days of dry-run-only proof across 2,800 SKUs, then flipped Catalog Specialist to live. 1,200 dead SKUs vaulted in week three. Margin policy now enforced in the type system, not a Notion doc.

Read case study
Reef & Range

Apparel · AU + NZ

213%

new-customer acquisition lift

Campaign Specialist drove first-quarter NCA above plan by triple-digits.

Anomaly responder writes NEGATIVE_TERMS, scales winners, and pauses losers across Google + Meta with per-action rate limits. Self-calibrating thresholds anchored to Reef's own distribution — not someone else's playbook.

Read case study
Stoke Goods

Pet · US

40%

CS volume dropped in week 1

The remaining queue is the work humans should be doing.

Stoke's CS Specialist resolves shipping queries, refund-eligible cases, and policy-bounded exchanges autonomously. Reply Judge surfaces the empathy-gated cases to humans — and gives the team a brand-voice anchor everyone references now.

Read case study
BREVARD

Wellness · US + UK

0

rogue writes in 6 months live

Kill switch held by Head of Retention. Never tripped. Always tested.

Brevard's monthly drill: trip the kill switch, watch every executor stand down inside 90 seconds, verify the decision_log chain signature, flip back on. The audit trail is now part of their compliance pack — pre-built for them.

Read case study
OKAY WILD

Outdoor · US

28SKUs

from research → live in 9 days

Researcher surfaces winning products before the trend hits Reels.

Six discovery lanes — Meta Ads Library, AliExpress trend graph, competitor bestsellers, reverse-image supplier match, LLM copy, autopricing — feed the draft pipeline. Trademark + uniqueness filters run before anything enters the catalog.

Read case study

// the pattern

Six teams. One shape.

Different industries, different stacks, different team sizes — but every team here followed the same arc.
01

Connect, dry-run.

Read-only Shopify token in. First dry-run cycle inside an hour. The agents draft what they would have done — for 14 days, zero writes.

02

Flip, gated.

Operator flips Phase 2 with kill switch held. Per-action rate limits capped to 20% of suggested write volume for the first 7 days. Judge floor enforced.

03

Ramp, reversible.

Volume ramp tracked in audit plane. Every action stays reversible by row. The team's role shifts from doing the work to setting the policy that does the work.

// your story next

See your own case study in 90 days.

The fastest path to a numbers-driven story is to connect a read-only token. We'll show you exactly what the agents would have done over your last 30 days — before anything writes.

Dry-run by default · Append-only logs · One-click rollback