Senior Data Scientist
Statistical modeling, A/B testing, causal inference, and advanced analytics.
Install for
Real Conversations
See how the Senior Data Scientist skill responds to actual requests. These are representative examples of what your AI agent will produce.
We want to A/B test removing the coupon code field from checkout. Baseline conversion is 3.2%. We need 95% confidence. How do I size the experiment?
Sample Size Calculation:
from scipy.stats import norm
import math
def sample_size_per_variant(p_baseline, mde, alpha=0.05, power=0.80):
"""Calculate required n per variant for a two-proportion z-test."""
p_treatment = p_baseline * (1 + mde)
p_avg = (p_baseline + p_treatment) / 2
z_alpha = norm.ppf(1 - alpha / 2) # two-tailed
z_beta = norm.ppf(power)
n = (z_alpha + z_beta)**2 * (p_baseline*(1-p_baseline) + p_treatment*(1-p_treatment)) / (p_treatment - p_baseline)**2
return math.ceil(n)
# Your numbers:
n = sample_size_per_variant(p_baseline=0.032, mde=0.10) # detect 10% relative lift
# → ~8,300 users per variant = 16,600 total
At what traffic volume does this complete?
- 1,000 checkouts/day → 17 days
- 5,000 checkouts/day → 4 days
Key decisions before you launch:
- MDE choice matters more than alpha: A 10% relative lift on 3.2% = detecting 3.52% vs 3.2% — is that a meaningful business outcome? If not, set MDE to 5% (doubles sample size).
- Randomization unit: Randomize by
user_id, not session — a user seeing both variants inflates false positives - Guardrail metrics: Track
add_to_cartrate andrevenue_per_visitoralongside conversion — removing the coupon field could hurt AOV even if it lifts conversion
Analysis (post-experiment):
from scipy.stats import chi2_contingency
contingency = [[control_converted, control_not], [treat_converted, treat_not]]
chi2, p_value, _, _ = chi2_contingency(contingency)
Get Started
How to install and use this skill in your preferred environment.
Skills are designed for AI coding agents (Claude Code, Cursor, Windsurf) and IDE-based workflows where the agent can read files, run scripts, and act on your codebase.
Models & Context
Which AI models and context windows work best with this skill.
Recommended Models
Larger models produce more detailed, production-ready outputs.
Context Window
This skill's SKILL.md is typically 3–10 KB — fits in any modern context window.
All current frontier models support 100K+ context.
Pro tips for best results
Be specific
Include numbers — users, budget, RPS — so the skill can size the architecture.
Share constraints
Compliance needs, team size, and existing stack all improve the output.
Iterate
Start with a high-level design, then ask follow-ups for IaC, cost analysis, or security review.
Combine skills
Pair with companion skills below for end-to-end coverage.
Ready to try Senior Data Scientist?
Install the skill and start getting expert-level guidance in your workflow — any agent, any IDE.
Install for