Daily - January 15, 2025

Daily Shaarli

All links of one day in a single page.

January 15, 2025

20 lines of code that will beat A/B testing every time

Treat an A/B test as an multi-armed bandit RL problem. First we need to record the performance for each option. Then every time we are faced with a decision, calculate the reward for each option, pick the best one. Leave a small chance (e.g. 10%) for exploration where a random option is selected.

statistics