Sample Ratio Mismatch

Math of Sample Ratio Mismatch (SRM)

Sample Ratio Mismatch (SRM) is a diagnostic test used in Randomized Controlled Trials (RCTs) to detect if the actual distribution of participants between variants (e.g., Control and Treatment) significantly deviates from the intended design. It uses a Pearson’s Chi-square Goodness-of-Fit test.

1. Setup

Variants: $k$ experimental groups (e.g., $k=2$ for Control and Treatment).
Observed Counts ( $O_i$ ): The actual number of users assigned to variant $i$ .
Total Sample Size ( $N$ ): The sum of all observed counts, $N = \sum_{i=1}^k O_i$ .
Target Allocation ( $p_i$ ): The intended probability of assignment for variant $i$ (e.g., $0.5$ for a 50/50 split).

2. Expected Counts ( $E_i$ )

The number of users we expected to see in each variant if the randomization worked perfectly: $E_i = N \times p_i$

3. Chi-square Statistic ( $\chi^2$ )

We calculate the cumulative squared deviation between observed and expected counts, normalized by the expected counts: $\chi^2 = \sum_{i=1}^k \frac{(O_i - E_i)^2}{E_i}$

4. Hypothesis Testing

Null Hypothesis ( $H_0$ ): The observed counts follow the target distribution (no mismatch).
Degrees of Freedom ( $df$ ): $df = k - 1$ .
P-value: The probability of observing a $\chi^2$ statistic as extreme as the one calculated, assuming $H_0$ is true. It is derived from the Chi-square distribution: $p\text{-value} = P(\chi^2_{df} > \chi^2_{calculated})$
Decision: If $p\text{-value} < \alpha$ (where $\alpha$ is a conservative threshold like $0.001$ ), we reject $H_0$ and flag an SRM.

Example (50/50 split)

If you intended to split 1,000 users evenly but observed 450 in Control and 550 in Treatment:

$E_{control} = 1000 \times 0.5 = 500$
$E_{treatment} = 1000 \times 0.5 = 500$
$\chi^2 = \frac{(450-500)^2}{500} + \frac{(550-500)^2}{500} = \frac{2500}{500} + \frac{2500}{500} = 5 + 5 = 10$
With $df=1$ , a $\chi^2$ of 10 gives $p\text{-value} \approx 0.0015$ , indicating a likely mismatch.

Real Example

Result

CausalData(df=(10000, 6), treatment='d', outcome='conversion', confounders=['platform_ios', 'country_usa', 'source_paid'], user_id='user_id')

Result

	treatment	count	mean	std	min	p10	p25	median	p75	p90	max
0	0.0	4955	0.198991	0.399281	0.0	0.0	0.0	0.0	0.0	1.0	1.0
1	1.0	5045	0.232904	0.422723	0.0	0.0	0.0	0.0	0.0	1.0	1.0

Result

SRMResult(status=no SRM, p_value=0.36812, chi2=0.8100)

Using without CausalData

Result

SRMResult(status=no SRM, p_value=0.47778, chi2=0.5039)

Sample Ratio Mismatch

Sample Ratio Mismatch

Math of Sample Ratio Mismatch (SRM)

1. Setup

2. Expected Counts (EiE_iEi​)

3. Chi-square Statistic (χ2\chi^2χ2)

4. Hypothesis Testing

Example (50/50 split)

Real Example

Using without CausalData

2. Expected Counts ( $E_i$ )

3. Chi-square Statistic ( $\chi^2$ )