A/B testing + feature flags, in one platform

Ship what wins.
Not what you think will win.

A vs B is the A/B testing and feature-flag platform for teams who’d rather measure than meet. Build variations in JavaScript, TypeScript, CSS, or SCSS, bucket users with a 18.8 KB gzipped snippet, and read results in Bayesian, Frequentist, or Sequential.

Start free See how it works

Google AnalyticsMixpanelSegmentAdobe AnalyticsAmplitudeHeapFullStoryContentsquareCustom webhookBayesian statsFrequentist statsSequential stats@avsbhq/js@avsbhq/node@avsbhq/react@avsbhq/next@avsbhq/vue@avsbhq/svelte@avsbhq/solid@avsbhq/angular@avsbhq/react-native@avsbhq/browser@avsbhq/edgeCLIBrowser extensionJavascriptTypescriptCSSSCSSWebhooksCustom rolesAudit logsAnti-flickerGoogle AnalyticsMixpanelSegmentAdobe AnalyticsAmplitudeHeapFullStoryContentsquareCustom webhookBayesian statsFrequentist statsSequential stats@avsbhq/js@avsbhq/node@avsbhq/react@avsbhq/next@avsbhq/vue@avsbhq/svelte@avsbhq/solid@avsbhq/angular@avsbhq/react-native@avsbhq/browser@avsbhq/edgeCLIBrowser extensionJavascriptTypescriptCSSSCSSWebhooksCustom rolesAudit logsAnti-flicker

01: The result

See which version wins.

Real product UI · seeded sample data

Read the outcome
Winning probability, observed lift, projected revenue impact, and days to your sample-size target, all up top.
Slice it any way
Filter by date range, baseline, and segment: device, country, browser, platform, language, or new vs returning.
Three engines, one dataset
Read the same experiment through Bayesian, Frequentist, and Sequential side by side: agreement reassures, disagreement informs.
Every arm, in detail
Visitors, conversions, conversion rate, lift, confidence interval, and significance for every variation.
Trust the number
Segment lift across six dimensions, plus SRM, statistical-confidence, and traffic-health guardrails.
Ship it your way
Traffic allocation, A/A validation, and a code editor for each variation.

You have opinions.

Your users have behaviors.

Let them vote.

A vs B · operating manual · page 001

02: Walkthrough

See it work.
Scene by scene.

01: Overview

Every experiment and flag in one place.

The dashboard surfaces all your experiments and feature flags at a glance: lifecycle status (Draft, Scheduled, Running, Paused, Completed), key metrics, and quick actions, so nothing slips through.

02: Results

Statistical readout, no guessing.

The results page shows per-variation visitor counts, conversion rates, and a time-series chart bucketed at 4-hour intervals (ClickHouse INTERVAL 4 HOUR). Health guardrails surface sample-ratio mismatches before you draw conclusions.

03: Targeting

Precise audiences, nested rules.

Define who sees each variation using the five-step guided builder. Targeting is step one: compose AND / OR audience conditions, reuse segments across experiments, and ship to exactly the right users.

04: Variations

Code-block builder. JS, TS, CSS, or SCSS.

Write control and variant code directly in Monaco with full TypeScript support. The five-step builder walks you through Targeting → Variations → Metrics → Analysis → Review before anything goes live.

A vs B dashboard showing the full list of experiments and feature flags

01: Overview
Every experiment and flag in one place.
The dashboard surfaces all your experiments and feature flags at a glance: lifecycle status (Draft, Scheduled, Running, Paused, Completed), key metrics, and quick actions, so nothing slips through.
02: Results
Statistical readout, no guessing.
The results page shows per-variation visitor counts, conversion rates, and a time-series chart bucketed at 4-hour intervals (ClickHouse INTERVAL 4 HOUR). Health guardrails surface sample-ratio mismatches before you draw conclusions.
03: Targeting
Precise audiences, nested rules.
Define who sees each variation using the five-step guided builder. Targeting is step one: compose AND / OR audience conditions, reuse segments across experiments, and ship to exactly the right users.
04: Variations
Code-block builder. JS, TS, CSS, or SCSS.
Write control and variant code directly in Monaco with full TypeScript support. The five-step builder walks you through Targeting → Variations → Metrics → Analysis → Review before anything goes live.

03: Experiment builder

Build any variation. Point-and-click or pure code.

The visual editor browser extension lets non-engineers create variations without touching code. The code builder gives engineers full TypeScript control, with linting, SCSS, and draft history, in the same experiment.

6change types

5viewport targets

4max variations

3browsers supported

AvsB Visual EditorChrome · Firefox · Edge

AllMobileTabletDesktopCustom

TEXTSTYLEVISIBILITYIMAGEREORDERINSERT

Visual editor inspector: a selected heading with typography, alignment, text color, background, and spacing controls, edited point-and-click with no code

Inline rich-text toolbar over selected copy: bold, italic, underline, link, and inline code, plus Edit variation CSS and Insert HTML block

Image element inspector: source URL, alt text, appearance, spacing, and raw HTML attributes (src, alt, class, draggable)

Changes in this variation: every restyle and alignment edit tracked per element, with the experiment summary alongside

Variation change list showing inserted blocks, hidden elements, and replaced elements (each change type labelled)

Point-and-click modifications across text, styles, visibility, images, reordering, and HTML insertion. No code required.
Draft history preserves version snapshots by experiment, author, and base version, so you can restore to any previous state.
Preview links share any variation via token-based URL (30–90 day expiry, revocable). No account required.

Variation code editorJavaScript · TypeScript · SCSS

Variation code editor: Monaco editor with a per-variation file tree, lint mode toggle, and the initVariation(options) entry point

Each variation gets its own files: JavaScript or TypeScript for behaviour, CSS or SCSS for styling, compiled in-browser.
Your code lives in initVariation(options), where options gives you self-cleaning waitUntil, timers, and listeners, plus onRemove for teardown on SPA navigation.
Linting: Off (no checks), On (advisory type + syntax squiggles), or Strict (blocks save and publish on type errors).
Up to 4 variations per experiment: 1 control and 3 challenger variants.

04: AvsB Copilot

Just describe it.
Then edit every change.

Point-and-click, write code, or just tell the copilot what you want. It lives inside the visual editor and builds it: an edit, a whole styled section, or a ready-made interactive component, matched to your site’s own colours, fonts, and spacing. It builds one candidate you keep refining in plain English (“make it darker”, “swap those two cards”), and every change lands as an ordinary editable draft, the same kind you’d make by hand. Tweak any detail, keep what lands, undo the rest. Nothing changes the page until you save.

Off by default, per project
Provider-agnostic
Runs in your editor session
You review before anything applies

AvsB CopilotIllustrative

Your prompt

“Add a limited-time countdown banner and make the hero punchier.”

Built 3 changes · Matched 12/12 tokens: one candidate, all editable.

COMPONENTCountdown bannerAbove the hero · self-contained templateWhy: Adds urgency without a redesign.Editable
TEXTHero headline“Ship the winner, not the loudest opinion.”Why: Tighter, benefit-led, one idea.Editable
STYLEPrimary buttonFill → brand blue · weight 600Why: More contrast against the hero.Editable

Each lands in the editor’s change list as an ordinary edit: tweak it, keep it, or undo it. Not right? Just tell the copilot what to change and it refines the same work.

Builds, not just edits. Ask for a whole styled section or an interactive component (popup, carousel, tabs, banner, countdown, or accordion) and it builds and brands it. No hand-written code, yours to edit.
Matches your brand and shows its work.It reads your site’s real colours, fonts, and spacing, then reports how many it matched. Ask for something off-palette and it says so.
Keep talking.Refine in plain English: “make it darker”, “swap those two cards”. It knows what “it” means, and each variation keeps its own thread.
Ideas & explain. Ask what to test and get falsifiable hypotheses you build in one click, or ask what any change does and get it in plain English.

It never acts on its own. Nothing is sent until you submit a prompt, nothing changes the page until you apply it, and the whole thing stays off until you switch it on.

05: Feature flags

Not just experiments. Feature flags, first-class.

A vs B is a feature-flag platform too. Boolean, string, number, and JSON flags, with targeted-delivery or A/B-test rules evaluated by the same audience engine as your experiments, in the browser snippet or your server SDKs. Per-user overrides, environment config, and stale-flag detection are built in.

4flag types

∞environments

4variations / rule

14dstale-flag check

Flags

BooleanStringNumberJSON

new-checkoutTargeted delivery100% · prod
ai-autocompleteA/B test25% · staging
legacy-pricingStale, flagged0% · archived

Create your own environments: prod, staging, dev, and beyond, each with its own SDK key. Instant rollout and rollback, no redeploy.

06: Statistical rigor

Three engines.
One platform.

Engine comparison

	Bayesian	Frequentist	Sequential
Method	Beta-Binomial model	Two-proportion z-test	Asymptotic Confidence Sequences
Prior	Beta(1,1): flat, non-informative	None (pooled variance)	None (AsympCS, Howard et al. 2021)
Primary output	Probability to beat control	p-value + confidence interval	Always-valid p-value
Interval	95% credible interval	95% confidence interval	Anytime confidence sequence
Peeking penalty	None	Yes, fixed horizon required	None, stop any time
CUPED / variance reduction	AUTO or OFF	AUTO or OFF	AUTO or OFF
Multiple-comparison correction	Bonferroni · Holm · BH · Tiered · None	Bonferroni · Holm · BH · Tiered · None	Bonferroni · Holm · BH · Tiered · None
ROPE	Optional, configurable bounds	—	—

Metric aggregation measures

binary
Unique Conversions Per Visitor
One conversion counted per visitor regardless of how many times the event fires. Classic click-through and sign-up metric.
count
Total Events
Raw count of all qualifying events, including multiple per visitor. Good for page-view and engagement depth signals.
binary
Unique Visitors Who Fired
Distinct visitor count across any matching event. Useful for reach and funnel-entry measures.
continuous
Total Value Per Visitor
Sum of a numeric property divided by exposed visitors. Revenue per visitor, pages per session.
continuous
Total Value
Raw sum across all events. Use for absolute revenue or engagement lift rather than per-head rates.
advanced
Percentile
p0–p100 quantiles via ClickHouse quantileTDigest. Confidence intervals via bias-corrected bootstrap (default 1,000 resamples).
advanced
Rate (ratio)
Numerator divided by denominator across all visitors. Delta-method variance estimation for correct standard errors on ratio metrics.
advanced
Composite (weighted)
Weighted sum of multiple metric bindings with per-component covariance handling. Model a revenue-weighted engagement score in one metric.

All continuous and ratio metrics support winsorization: extreme values are capped at a configurable upper percentile (default p99) and optional lower percentile before statistical computation.

Health guardrailsillustrative

SRM (chi-square)

p ≥ 0.01

0.001–0.01

p < 0.001

Statistical confidence

≥ 95%

80–94%

< 80%

Traffic health

1,000+

100–999

< 100

Sample size calculator showing power, MDE, and traffic inputs — Sample size calculator

Analysis plans

Pre-register hypotheses before launch. Plans seal at launch (locked read-only) and every amendment records the timestamp, actor, field name, before/after values, and reason.

A/A test mode

Validate your statistical engine and experiment setup via the isAATest flag, which runs a control-to-control comparison to confirm calibration before a live experiment.

Sample size calculator

Supports binary conversions (Frequentist, Bayesian, and Sequential engines), ratio metrics (delta-method variance), quantile percentiles (bootstrap simulation), and composite metrics (weighted pairwise correlation).

07: The platform

Everything you need.
Nothing you don’t.

Code-block builder. JS or TS, CSS or SCSS.

Define control.ts and variant.ts plus a shared triggers.ts. Monaco ships TypeScript types for window.avsb.*, so autocomplete works out of the box. SCSS compiles in-browser, no bundler required.

1// variant.ts: runs when triggers.ts calls activate()
2import { options } from './triggers';
3
4const btn = document.querySelector<HTMLButtonElement>('.checkout-cta')!;
5btn.textContent = 'Claim 30% discount';
6btn.classList.add('urgency');
7
8window.avsb.track.event('purchase', { revenue: 49 });

Metrics, defined once.

Define click, pageview, and custom metrics at the project level: instrument once, reuse across every experiment and flag without re-instrumentation.

ClickCSS selector-based
PageviewURL pattern-based
CustomApplication code-fired

Flag rules. Two types.

Boolean, String, Number, and JSON flags. AB_TEST rules split traffic and wire to metrics. Targeted Delivery rules deterministically route by audience. Up to 4 variations. Per-user overrides always win.

AB_TESTTraffic split · metrics-wired
TARGETED_DELIVERYDeterministic rollout · audience-gated

19 KB gzipped snippet. Anti-flicker included.

Drop the script in your <head>. Loads from CDN as a single 60 KB minified file (18.8 KB gzipped). Anti-flicker hides the document via opacity:0 with a 3-second timeout. MurmurHash3 sticky bucketing into a 0–9999 integer space.

1<!-- In your <head>, as high as possible -->
2<script src="//cdn.avsb.cloud/snippet.js"
3        data-avsb="YOUR_SNIPPET_KEY"></script>
4
5// Track a conversion
6window.avsb.track.event('purchase', { revenue: 49 });

Feature flags, first class.

Boolean, string, number, and JSON flags. Targeted delivery or A/B test rules. Up to 4 variations per rule. Create your own environments. Per-user overrides. Stale-flag detection after 14 days of no change.

new-checkout100% · prod
ai-autocomplete25% · staging
legacy-pricing0% · archived

CLI for local dev.

Published as @avsbhq/cli v3.2.0. Clone an experiment, edit locally, preview live via the browser extension, push when ready. avsb dev opens a WebSocket server with live reloading.

1$ npm i -g @avsbhq/cli
2$ avsb clone <projectId>
3$ avsb dev # WebSocket + live reload
4$ avsb push

Browser extension.

Chrome MV3. Preview variations on the live page. Toggle Page Reload (safe) or Hot Inject (experimental) mode in the popup. Watch events stream in real time.

signup-ctavariant B

hero-copycontrol

● EXPOSURE · hero-copy · variant A

Server-side SDKs.

@avsbhq/node for Node 18+ with Express and Fastify middleware, InMemory and Redis sticky bucketing, SSE streaming. @avsbhq/react wraps useSyncExternalStore for React 18. @avsbhq/next covers App Router and Pages Router.

1import { AvsBServer } from '@avsbhq/node';
2const avsb = new AvsBServer({ sdkKey });
3const value = await avsb.evalFlag('new-checkout', false, ctx);

9 integrations + webhooks.

Send exposure and event data to your analytics stack. One-click setup with API keys.

Integrations settings page showing all analytics providers

Google Analytics
Mixpanel
Segment
Adobe Analytics
Amplitude
Heap
FullStory
Contentsquare
Custom + webhooks

Audiences, nested.

10 condition types, nested AND / OR at arbitrary depth. Reusable segments across experiments and flags.

Audience builder showing nested AND/OR condition logic

Location
Device
Browser
Platform (OS)
Language
Query param
Cookie
New vs returning
Custom attribute
Custom JavaScript

Governance built in.

Custom roles with granular permissions. Audit logs on every change. 2FA. Per-environment config across every environment you create. API tokens. Webhooks on key events.

Custom roles

Audit logs

2FA

Custom environments

API tokens

Webhooks

Stop shipping

on a hunch.

Ship on evidence.

A vs B · operating manual · page 002

08: The loop

Four steps. Zero guessing.

01
Hypothesize.
State what you expect. Attach the target audience and the primary metric.
H₁: B > A by ≥ 10%
02
Build it your way.
Visual editor for non-engineers, or JS / TypeScript + CSS / SCSS code blocks for engineers. Up to 4 variations per rule, Monaco autocomplete included.
03
Run, safely.
Snippet buckets the visitor. Anti-flicker hides the page until variations apply. Exposure + conversion events stream to ClickHouse.
04
Decide.
Bayesian, Frequentist, or Sequential: your call. Probability to beat control, credible or confidence interval, always-valid bound, SRM check, traffic-health band.
✓ Ship variant B

09: Governance

Auditable by design.
Not by accident.

Analysis plans: sealed at launch

Before your experiment goes live, commit your primary metric, statistical engine, and confidence target. The plan locks the moment traffic starts and becomes read-only. Every subsequent amendment is timestamped, attributed to an actor, and records the field name, the before value, and the after value with a mandatory reason.

Pre-registrationSealed at launchAmendment trackingRead-only history

Amendment log3 amendments

2024-03-12 09:14o.hartleyprimaryMetricrevenue→checkout_rate
2024-03-14 11:02n.fletcherminSampleSize5 000→8 000
2024-03-15 16:45o.hartleyconfidenceTarget90%→95%

Early-stopping protection

Frequentist experiments get three interlocking guards against peeking bias.

01Sample progress banner

A status strip shows sample collection progress so you know when you can trust the numbers.

02Blocking modal

Pause, stop, or declare-winner actions raise a modal that forces acknowledgment before proceeding.

03Override audit stamp

When a team member overrides the guard, the decision is recorded in the audit log with a timestamp and actor.

Decision logging: @avsbhq/node

Server-side SDK ships a createDecisionLog helper. Wire it into your own audit store or any structured log sink. Every decision (which variant was served, to which user, under which experiment) is captured as a structured record.

1import { AvsBServer, createDecisionLog }
2  from '@avsbhq/node';
3
4const avsb = new AvsBServer({ sdkKey });
5
6// returns the chosen variant + a structured log entry
7const { variant, log } = await avsb
8  .decide('checkout-cta', ctx);
9
10await db.auditLogs.create({
11  data: createDecisionLog(log)
12});

Auto-pause on error

When a variation’s error rate exceeds a configurable threshold (default 25%), the experiment pauses automatically. The error log tracks JavaScript exceptions, failed applies, and selector misses, with severity levels and error rates shown once the sample exceeds 50 exposed visitors.

Variation B: error rateAuto-paused

Threshold: 25% · 142 exposed visitors

errorCannot read properties of null (.querySelector)
warningSelector miss: .checkout-hero-cta (0 elements found)
errorFailed apply: variation JS threw on DOMContentLoaded

Lifecycle with guard rails

Pausing stops new visitor bucketing and reverts active visitors to control. Data is preserved and bucketing restarts on resume. Stopping is permanent: the experiment moves to Completed, all variation code is removed from production immediately, and historical results are preserved.

RunningBucketing visitors, collecting data

pause

stop

PausedReverts to control, resumable

CompletedIrreversible: code removed

Audit log screen showing a timestamped history of experiment changes — Audit log: every change, timestamped and attributed

Roles and permissions settings screen — Roles: granular access per action

Team members management screen — Members: invite, seat, and manage your team

10: Under the hood

Real numbers.
No fluff.

11: Pricing

A plan for every
stage of growth.

Free
Everything you need to run your first experiments.
Most popular
Pro
For teams shipping experiments to real traffic every week.
Enterprise
For agencies and high-traffic teams with custom needs.

See full pricing

Stop debating.
Start measuring.

Start free. Upgrade to Pro or talk to us about Enterprise when you’re ready.

Start free, no card Book a demo

See which version wins.

Read the outcome

Slice it any way

Three engines, one dataset

Every arm, in detail

Trust the number

Ship it your way

See it work.Scene by scene.

Every experiment and flag in one place.

Statistical readout, no guessing.

Precise audiences, nested rules.

Code-block builder. JS, TS, CSS, or SCSS.

Every experiment and flag in one place.

Statistical readout, no guessing.

Precise audiences, nested rules.

Code-block builder. JS, TS, CSS, or SCSS.

Build any variation. Point-and-click or pure code.

Just describe it.Then edit every change.

Not just experiments. Feature flags, first-class.

Three engines.One platform.

Unique Conversions Per Visitor

Total Events

Unique Visitors Who Fired

Total Value Per Visitor

Total Value

Percentile

Rate (ratio)

Composite (weighted)

Analysis plans

A/A test mode

Sample size calculator

Everything you need.Nothing you don’t.

Code-block builder. JS or TS, CSS or SCSS.

Metrics, defined once.

Flag rules. Two types.

19 KB gzipped snippet. Anti-flicker included.

Feature flags, first class.

CLI for local dev.

Browser extension.

Server-side SDKs.

9 integrations + webhooks.

Audiences, nested.

Governance built in.

Four steps. Zero guessing.

Hypothesize.

Build it your way.

Run, safely.

Decide.

Auditable by design.Not by accident.

Real numbers.No fluff.

A plan for everystage of growth.

Free

Pro

Enterprise

Stop debating.Start measuring.

See it work.
Scene by scene.

Just describe it.
Then edit every change.

Three engines.
One platform.

Everything you need.
Nothing you don’t.

Auditable by design.
Not by accident.

Real numbers.
No fluff.

A plan for every
stage of growth.

Stop debating.
Start measuring.