Improving Data Analysis by Testing by Betting: Optional Continuation and Descriptive Statistics
Volume 2, Issue 2 (2024), pp. 215–228
Pub. online: 13 December 2023
Type: Statistical Methodology
Open Access
Accepted
2 December 2023
2 December 2023
Published
13 December 2023
13 December 2023
Abstract
When testing a statistical hypothesis, is it legitimate to deliberate on the basis of initial data about whether and how to collect further data? Game-theoretic probability’s fundamental principle for testing by betting says yes, provided that you are testing the hypothesis’s predictions by betting and do not risk more capital than initially committed. Standard statistical theory uses Cournot’s principle, which does not allow such optional continuation. Cournot’s principle can be extended to allow optional continuation when testing is carried out by multiplying likelihood ratios, but the extension lacks the simplicity and generality of testing by betting.
Testing by betting can also help us with descriptive data analysis. To obtain a purely and honestly descriptive analysis using competing probability distributions, we have them bet against each other using the principle. The place of confidence intervals is then taken by sets of distributions that do relatively well in the competition. In the simplest implementation, these sets coincide with R. A. Fisher’s likelihood ranges.
References
Augustin, T., Coolen, F. P. A., de Cooman, G. and Troffaes, M. C. M., eds. (2014) Introduction to Imprecise Probabilities. Wiley. https://doi.org/10.1002/9781118763117. MR3236913
Barnard, G. (1947). Review of Sequential Analysis by Abraham Wald. Journal of the American Statistical Association 42(240) 658–665. MR0020764
Barnard, G. A. (1949). Statistical inference. Journal of the Royal Statistical Society B 11(2) 115–149. MR0034983
Breiman, L. (1961). Optimal gambling systems for favorable games. In Fourth Berkeley Symposium on Probability and Mathematical Statistics (J. Neyman, ed.) 1 65–78. University of California Press. MR0135630
Bru, B., Bru, M.-F. and Bienaymé, O. (1997). La statistique critiquée par le calcul des probabilités: Deux manuscrits inédits d’Irenée Jules Bienaymé. Revue d’Histoire des Mathématiques 3 137–239. MR1620388
Darling, D. A. and Robbins, H. (1967). Confidence sequences for mean, variance, and median. Proceedings of the National Academy of Sciences of the United States of America 58(1) 66–68. https://doi.org/10.1073/pnas.58.1.66. MR0215406
Dawid, A. P. (1984). Present position and potential developments: Some personal views. Statistical theory, the prequential approach. Journal of the Royal Statistical Society: Series A 147(2) 278–290. https://doi.org/10.2307/2981683. MR0763811
Dawid, A. P. (1991). Fisherian inference in likelihood and prequential frames of reference. Journal of the Royal Statistical Society: Series B 53(1) 79–109. MR1094276
Diaconis, P. and Skyrms, B. (2018) Ten Great Ideas about Chance. Princeton. MR3702017
Donoho, D. (2017). 50 years of data science. Journal of Computational and Graphical Statistics 26(4) 745–766. https://doi.org/10.1080/10618600.2017.1384734. MR3765335
Doob, J. L. (1953) Stochastic Processes. Wiley, New York. MR0058896
Edwards, A. W. F. (1972) Likelihood: An Account of the Statistical Concept of Likelihood and its Application to Scientific Inference. Cambridge. MR0348869
Ethier, S. (2010) The Doctrine of Chances: Probabilistic Aspects of Gambling. Springer, Berlin. https://doi.org/10.1007/978-3-540-78783-9. MR2656351
Feller, W. (1950) An Introduction to Probability Theory and Its Applications, 1st ed. Wiley, New York. MR0038583
Feller, W. K. (1940). Statistical aspects of ESP. The Journal of Parapsychology 4(2) 271–298. MR0004461
Fisher, R. A. (1956) Statistical Methods and Scientific Inference. Hafner. Later editions in 1959 and 1973. MR0131909
Freedman, D. A. (2008). Randomization does not justify logistic regression. Statistical Science 23(2) 237–249. https://doi.org/10.1214/08-STS262. MR2516822
Freedman, D. A. (2009) Statistical Models: Theory and Practice, Revised Edition. Cambridge. https://doi.org/10.1017/CBO9780511815867. MR2489600
Hendriksen, A. A. (2017). Betting as an alternative to p-values. Master’s thesis, University of Leiden, under the direction of Peter Grünwald.
Kelly Jr., J. L. (1956). A new interpretation of information rate. Bell System Technical Journal 35(4) 917–926. https://doi.org/10.1002/j.1538-7305.1956.tb03809.x. MR0090494
Office of Diversity, U. o. M. Equity & Inclusion (2017). Results of the 2016 University of Michigan Faculty Campus Climate Survey on Diversity, Equity & Inclusion. https://diversity.umich.edu/wp-content/uploads/2017/11/DEI-FACULTY-REPORT-FINAL.pdf
Ramdas, A., Grünwald, P., Vovk, V. and Shafer, G. (2023). Game-theoretic statistics and safe anytime-valid inference. Statistical Science 38(4) 576–601. https://doi.org/10.1214/23-sts894. MR4665027
Rissanen, J. (1989) Stochastic Complexity in Statistical Inquiry. World Scientific. MR1082556
Royall, R. M. (1997) Statistical Evidence: A Likelihood Paradigm. Chapman & Hall. MR1629481
Shafer, G. (2019). On the nineteenth-century origins of significance testing and p-hacking. Working paper 55, Game-Theoretic Probability Project.
Shafer, G. (2020). Game-theoretic foundations for statistical testing and imprecise probabilities. SIPTA School 2020/2021, Slides and videos.
Shafer, G. (2021). Testing by betting: A strategy for statistical and scientific communication (with discussion). Journal of the Royal Statistical Society: Series A 184(2) 407–478. https://doi.org/10.1111/rssa.12647. MR4255905
Shafer, G. (2022). That’s what all the old guys said: The many faces of Cournot’s principle. Working Paper 60, Game-Theoretic Probability Project.
Shafer, G. and Vovk, V. (2001) Probability and Finance: It’s Only a Game. Wiley, New York. https://doi.org/10.1002/0471249696. MR1852450
Shafer, G. and Vovk, V. (2006). The sources of Kolmogorov’s Grundbegriffe. Statistical Science 21(1) 70–98. See also Working Paper 4, Game-Theoretic Probability Project.
Shafer, G. and Vovk, V. (2019) Game-Theoretic Foundations for Probability and Finance. Wiley, New York. https://doi.org/10.1002/0471249696. MR1852450
Shafer, G., Shen, A., Vereshchagin, N. and Vovk, V. (2011). Test martingales, Bayes factors and p-values. Statistical Science 26(1) 84–101. https://doi.org/10.1214/10-STS347. MR2849911
Troffaes, M. C. M. and de Cooman, G. (2014) Lower Previsions. Wiley. https://doi.org/10.1002/9781118762622. MR3222242
Vovk, V. (2023). The diachronic Baysian. Working paper 64, The Game-Theoretic Probability Project, http://probabilityandfinance.com.
Vovk, V. and Wang, R. (2021). E-values: Calibration, combination, and applications. Annals of Statistics 49(3) 1736–1754. https://doi.org/10.1214/20-aos2020. MR4298879
Wald, A. (1947) Sequential Analysis. Wiley, New York. MR0020764