ab-test-setup

You are an expert in experimentation and A/B testing. Your goal is to help design tests that produce statistically valid, actionable results. If .agents/product

CROAnalyticsGrowth
bycoreyhaines311,543 words

What is ab-test-setup?

What this skill does

This skill guides the design and execution of A/B tests that produce statistically valid, actionable insights. It helps define clear hypotheses, select appropriate metrics, calculate sample sizes, and structure variants to isolate the impact of specific changes. The skill also covers test types, traffic allocation strategies, and best practices for running and analyzing experiments to avoid common pitfalls like early peeking or underpowered results.

Who it's for

This skill is ideal for growth leads who need to establish a rigorous experimentation program, performance marketers planning conversion rate optimization campaigns, and SEO/PPC operators aiming to validate landing page changes through data-driven tests. It also supports agency strategists responsible for advising clients on test design, ensuring their recommendations translate into measurable results.

Key workflows

Practitioners start by assessing the product and marketing context to understand the baseline conversion rates, traffic volume, and any constraints like technical complexity or timelines. Next, they craft a strong hypothesis using a structured framework that links observations to expected outcomes and measurable metrics. Then, they determine the test type and calculate necessary sample sizes based on baseline rates and expected lift. After designing variants with clear, meaningful differences, they plan traffic allocation to balance risk and exposure. Finally, they run the test while monitoring for issues, avoid peeking at interim results, and analyze outcomes with statistical rigor to decide whether to implement changes or iterate further.

Common questions

How do I know if my test has enough traffic? Calculate sample size using baseline conversion rates and expected lift to reach statistical significance, referencing established calculators. Can I test multiple changes at once? Multi-variate tests are possible but require significantly higher traffic and complexity; single, meaningful changes are preferred for clarity. What if results show no significant difference? This often means you need more traffic or a bolder variant; consider revisiting your hypothesis or test design before concluding.

How to use in Metaflow

Attach this skill to a Metaflow agent tasked with planning or evaluating marketing experiments. The agent will use product marketing context files if available, then guide you through hypothesis formulation, sample size calculation, and variant design. Expect the skill to produce detailed test plans and analysis checklists that align with your CRO goals and traffic constraints. This foundation supports continuous experimentation workflows and integrates seamlessly with other growth and analytics skills within Metaflow.

For broader context, see our roundup of claude skills for marketing, and read Claude Code workflows for marketing agencies for related setup guidance.

Related skills

Form Conversion Optimization

When the user wants to optimize any form that is NOT signup/registration — including lead capture forms, contact forms, demo request forms, application forms, survey forms, or checkout forms. Also use when the user mentions "form optimization," "lead form conversions," "form friction," "form fields," "form completion rate," "contact form," "nobody fills out our form," "form abandonment," "too many fields," "demo request form," or "lead form isn't converting." Use this for any non-signup form tha

View →

Challenge Funnel

This skill should be used when the user asks to "create a challenge funnel", "build a 5-day challenge", "bootcamp funnel", "challenge launch", or mentions challenges, bootcamps, or multi-day engagement funnels. Creates challenge funnels that activate prospects, build community, and convert to core offers.

View →

Competitor Teardown

Structured competitive analysis with feature matrices, SWOT, positioning maps, and UX review. Covers research frameworks, pricing comparison, review mining, and visual deliverables. Use for: market research, competitive intelligence, investor decks, product strategy, sales enablement. Triggers: competitor analysis, competitive analysis, competitor teardown, market research, competitive intelligence, swot analysis, competitor comparison, market landscape, competitor review, competitive landscape,

View →

Dataforseo Backlinks API

Retrieve backlink profiles and bulk link metrics using DataForSEO Backlinks for "backlink audit", "referring domains", and "link monitoring".

View →

Executive Dashboard Generator

Transform raw data from CSVs, Google Sheets, or databases into executive-ready reports with visualizations, key metrics, trend analysis, and actionable recommendations. Creates data-driven narratives for leadership. Use when users need to turn spreadsheets into executive summaries or board reports.

View →

Funnel Validator

Use this skill when users need to validate an existing sales funnel, landing page sequence, or customer journey. Activates for "validate my funnel," "is my funnel good," "why isn't my funnel converting," or when checking funnel quality before driving traffic.

View →