Have a personal or library account? Click to login
Open Access
|Jul 2018

Abstract

A standard method to evaluate new features and changes to e.g. Web sites is A/B testing. A common pitfall in performing A/B testing is the habit of looking at a test while it’s running, then stopping early. Due to the implicit multiple testing, the p-value is no longer trustworthy and usually too small. We investigate the claim that Bayesian methods, unlike frequentist tests, are immune to this “peeking” problem. We demonstrate that two regularly used measures, namely posterior probability and value remaining are severely affected by repeated testing. We further show a strong dependence on the prior probability of the parameters of interest.

DOI: https://doi.org/10.2478/zireb-2018-0004 | Journal eISSN: 1849-1162 | Journal ISSN: 1331-5609
Language: English
Page range: 95 - 104
Published on: Jul 3, 2018
Published by: University of Zagreb, Faculty of Economics & Business
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2018 Markus Loecher, published by University of Zagreb, Faculty of Economics & Business
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.