Marketing-native AI agent

01Agent — Outbound Automation

Book qualified meetings
from the right signals.

AI SDRs ship volume. The Outbound Automation agent ships qualified conversations — signal, fit, timing, hypothesis, draft, approval, learn.

Design your first signal-led play See the outbound workflow

SurfaceEmail · LinkedIn · sequencer · CRM

OutputBriefs · drafts · play attribution

ReviewOperator approves before send

02The harness gap

Claude Code can write the email. It cannot run the play.

Generic agents do not score signals. They do not check ICP fit. They do not remember which plays converted. They do not respect a do-not-contact list. They draft. The harness is the rest.

“Personalization without a hypothesis is trivia.”

Generic agents

claude.ai · new chat

fresh context

You

Write a cold email for Acme. Make it sound personalized.

acme-research-notes.md· 312 linespasted

Context window 88% full

Claude

Subject: Quick question about Acme's growth plans

I noticed Acme is doing great work in your space and wanted to reach out.
We have helped similar companies scale their pipeline.
Would you be open to a 15-min chat next week?

No rubric. No source policy. No refresh hook. No memory write.

vs.

Metaflow

outbound-automation.run

signals fresh · 6 accounts queued

Play memory loaded — winners persist, retired plays excluded

Signal stack · Acme

hiring growth roleseries fundingtech-change

ICP fit

0.91

Timing

0.88

Hypothesis

0.79

Approval queue · 3 drafts · DNC verified

03Old way vs. agentic way

Static lists and a sequencer vs. one signal-to-play layer.

The list is six months old. The sequencer fires on schedule. Personalization tokens replace relevance. Replies get triaged manually. Plays never retire.

List + sequencer + hope

Apollo · saved list

ICP B · cold · 4,212 contacts

Last refreshed 38 days ago

Outreach · sequence

Step 1 · email

Step 2 · email

Step 3 · LinkedIn

accounts.xlsx

acme · ?

globex · ?

initech · ?

Claude · draft email

"Write a cold email for Acme"

No signal data · no CRM

vs.

One operating layer

outbound-automation.run

signals fresh · 6 accounts queued

Play memory loaded — winners persist, retired plays excluded

Signal stack · Acme

hiring growth roleseries fundingtech-change

ICP fit

0.91

Timing

0.88

Hypothesis

0.79

Approval queue · 3 drafts · DNC verified

Dimension

List + sequencer + hope

One operating layer

Sourcing

Static list. Quarterly refresh.

Real-time signal-led plays, scored before outreach.

Personalization

First-name token, podcast you listened to.

Why this account, this person, now, this offer.

Review

AI writer ships. Operator skims.

Outbound QA scores every draft. Approval before send.

Attribution

Reply rate, open rate.

Play-level: signals + audience + message → qualified conversations.

04The encoded playbook

What replaces volume motion.

Four operating principles. Each one carries a method anchor and a piece of product evidence.

Signals are play inputs, not magic triggers.

A signal only matters when it is fresh, relevant, tied to ICP fit, and strong enough to support a reason to reach out. Treat signal selection as a first-class design step.

Method anchor — Clay-style signal selection

Signal library · top 4

hiring growth rolet½ 14d
tech stack changet½ 21d
series fundingt½ 30d
pricing visitt½ 3d

05The operating loop

From signal detected to play memory.

The agent watches signals, scores fit and timing, enriches the account, builds a relevance hypothesis, drafts outreach, runs QA, routes for approval, classifies replies, and writes outcomes to memory.

outbound-automation.run

Continuous loop

InputSignals · ICP · accounts · CRM

OutputQualified conversations · play memory

Each run writes outcomes to memory. The next run starts with the prior decision graph and review boundary already loaded.

Outbound Automation — reliability stack

Production-grade agents need more than a clever prompt. Each layer below is required for governed autonomy.

Instructions

A skill.md file scopes mission, inputs, principles, and output contract.

Tools

Domain APIs, search, scrapers, CRMs, and platform connectors.

Memory

Workflow memory carries context, brand voice, and prior decisions.

Evaluations

Quality gates score every output against domain-specific rubrics.

Execution trace

Every tool call, decision, and rubric pass is inspectable.

Human review

Approval thresholds route risky outputs to operators for sign-off.

Feedback loop

Outcomes write back to memory so the next run starts smarter.

06The skill file

What the agent reads before every play.

A versioned, editable operating procedure. The way a senior outbound lead would document their own playbook.

What is a skill file?

Every Metaflow agent is grounded in a domain-specific skill file — a structured operating procedure that defines inputs, workflows, evaluation criteria, anti-patterns, and output contracts.

The skill file is editable, versioned, and inspectable. It is not a hidden prompt.

outbound-automation.skill.md

v1.4.0 · last edited 4d ago

# Mission

Create qualified conversations from real buying signals — not from
list volume.

Signal selection. Fit and timing. Enrichment. Relevance hypothesis.
Governed drafting. Human approval. CRM coordination. Reply
classification. Play memory.

## Optimizes for
- qualified conversation rate per play
- time-to-meeting from signal detection
- play attribution accuracy

## Does not promise
- guaranteed meetings
- "AI SDR" replacing the operator
- set-and-forget outbound at scale

UTF-8 · markdown · 6 sectionsgoverned by run.evals.json

07The quality gate

Drafts are scored before they ever land in an inbox.

The agent does not send. It scores every draft and routes anything below threshold to an operator with the failing element flagged.

outbound_rubric — outbound-automation

run.evals.json

Refusal conditions

Where the agent stops or hands back instead of guessing.

Account is on do-not-contact in any source.
Signal stale beyond freshness window.
Relevance hypothesis incomplete.
Audience saturation flagged on the play.

Autonomous

Watch signals
Score ICP fit and timing
Run enrichment waterfall
Compose relevance hypothesis

Recommend

New play candidates
New signal sources
Audience expansion within ICP
Sequencer cadence changes

Approve

New play launch
Drafts before send
Cross-channel push
CRM field overwrite

08What it produces

System evidence, not feature cards.

Five artifacts the agent produces, scores, or maintains. Hover to pause.

Signal library

Signals scored before any outreach.

Audience fit, baseline conversion, freshness window, execution difficulty. Saturated or stale signals retire automatically.

Signal	Half-life	Fit	Status
hiring_growth_role	14d	0.84	Active
tech_stack_change	21d	0.81	Active
series_funding	30d	0.78	Active
pricing_page_visit	3d	0.72	Active
general_award	7d	0.31	Retired

Methodology anchors

Clay-style signal selection

Common signals become noisy. Advantage comes from unique signal selection, strong enrichment, and fast execution.

Signal library scored on freshness, fit, baseline conversion, saturation risk before plays are designed.

UnifyGTM signal-based outbound

Replace static lists with real-time signal-led plays. Start with one strong signal before scaling.

Plays designed first — signal, ICP filter, enrichment, message angle, routing, measurement.

Common Room signal scoring

Signals scored by fit, frequency, baseline conversion, pipeline potential, execution difficulty.

Signal scoring runs before any outreach is composed. Generic activity is filtered out.

ColdIQ relevance discipline

Personalization is a relevance hypothesis: why this account, this person, now, this offer.

The four-question test ships with every draft. Trivia personalization does not pass.

Production-grade agent governance

Serious agents need explicit instructions, evals, traces, human oversight, and stopping conditions.

External actions require human approval. Confidence drives routing. Plays retire on rule, not vibe.

Reforge-style growth loops

Execution should produce learning that improves the next cycle.

Play-level attribution and objection memory persist. Future drafts pre-empt patterns the team already saw.

09Against the field

Not another AI SDR. A governed signal-to-play layer.

Four contenders. One operating layer that compounds. Hover the Metaflow column for product evidence.

Dimension	Claude Code, Cursor Generic agents in chat windows.	AI SDR products Volume-led outbound automation.	Outbound agency Outsourced humans with sequencer access.	Metaflow agent Agentic operating layer.
Sourcing	Whatever you paste in.	Static lists, bulk enrichment.	Whatever the agency supports.	Real-time signal-led plays, scored before outreach. signals.watch Hiring + tech-change + funding stacked on Acme.
Personalization	Generic prompts, surface tokens.	Token-based with light AI rewriting.	Variable across SDRs.	Four why-questions answered explicitly per draft. hypothesis.compose Why Acme · why VP Marketing · why now · why offer.
Memory	Resets every chat.	Limited, often per-sequencer.	Lives in slack and the SDR.	Workflow memory persists plays, retired plays, objections. memory.write “VP post-funding · outcome-led offers” — persisted.
Attribution	None.	Reply rate, open rate.	Meeting volume.	Play-level: signals + audience + message → qualified conversations. play_attribution.json Hiring play · 14 sends · 4 qualified · 28% rate.
Compliance	Unbounded.	Variable.	Process-dependent.	DNC enforced. Refusal conditions explicit. Saturation flagged. review.queue Stark Inc · DNC verified · blocked.

10Where it runs

Repeatable plays the agent runs end-to-end.

Each play has the same shape: signal, fit, timing, enrichment, hypothesis, draft, QA, approval, push, classify, learn.

Hiring signal play

A target persona is hiring into a function the buyer would build with your category.

Outcome

Drafts grounded in speed-to-hire pressure, scoped to a 90-day mandate.

Hiring signal · Acme

Hired 3 growth roles · 14 days · post-funding

Speed-to-hire pressure · 90-day mandate

01 / 06

11The first session

Design your first signal-led play in one focused session.

A 30-minute working session with a Metaflow operator. We pick the strongest signal you have access to, define the play, score the audience, propose a hypothesis, and outline the approval boundary.

Book the diagnostic Read the methodology

01Scored signal library of the top 5 signals available to you.
02A defined first play: signal, ICP filter, enrichment, message angle.
03Relevance hypothesis template against your top-fit account.
04A written 1-page memo with the next 3 plays we would design.

What you leave with

Signal library · top 4

hiring growth rolet½ 14d
tech stack changet½ 21d
series fundingt½ 30d
pricing visitt½ 3d

Hiring signal · Acme

Hired 3 growth roles · 14 days · post-funding

Speed-to-hire pressure · 90-day mandate

A focused diagnostic. No slides. Walk away with a designed play whether or not we work together.

Book qualified meetings from the right signals.

Claude Code can write the email. It cannot run the play.

Static lists and a sequencer vs. one signal-to-play layer.

What replaces volume motion.

Signals are play inputs, not magic triggers.

From signal detected to play memory.

What the agent reads before every play.

Drafts are scored before they ever land in an inbox.

Signal freshnessSignal is within the play's half-life window.0.96Pass

ICP fit scoreAccount scores above the play-level fit floor.0.91Pass

Timing scoreSignal stack and recency support reaching out now.0.88Pass

Relevance hypothesisAll four why-questions answered explicitly.0.79Route for review

Brand voice matchDraft sits within the brand voice deviation band.0.84Pass

DNC complianceAccount not on do-not-contact in any source.1.00Pass

System evidence, not feature cards.

Signals scored before any outreach.

Not another AI SDR. A governed signal-to-play layer.

Repeatable plays the agent runs end-to-end.

Hiring signal play

Design your first signal-led play in one focused session.

Book qualified meetings
from the right signals.