The New England Journal of Statistics in Data Science logo


  • Help
Login Register

  1. Home
  2. Issues
  3. Volume 2, Issue 2 (2024)
  4. E-detectors: A Nonparametric Framework f ...

The New England Journal of Statistics in Data Science

Submit your article Information Become a Peer-reviewer
  • Article info
  • Full article
  • More
    Article info Full article

E-detectors: A Nonparametric Framework for Sequential Change Detection
Volume 2, Issue 2 (2024), pp. 229–260
Jaehyeok Shin   Aaditya Ramdas   Alessandro Rinaldo  

Authors

 
Placeholder
https://doi.org/10.51387/23-NEJSDS51
Pub. online: 1 December 2023      Type: Methodology Article      Open accessOpen Access
Area: Statistical Methodology

Accepted
12 October 2023
Published
1 December 2023

Abstract

Sequential change detection is a classical problem with a variety of applications. However, the majority of prior work has been parametric, for example, focusing on exponential families. We develop a fundamentally new and general framework for sequential change detection when the pre- and post-change distributions are nonparametrically specified (and thus composite). Our procedures come with clean, nonasymptotic bounds on the average run length (frequency of false alarms). In certain nonparametric cases (like sub-Gaussian or sub-exponential), we also provide near-optimal bounds on the detection delay following a changepoint. The primary technical tool that we introduce is called an e-detector, which is composed of sums of e-processes—a fundamental generalization of nonnegative supermartingales—that are started at consecutive times. We first introduce simple Shiryaev-Roberts and CUSUM-style e-detectors, and then show how to design their mixtures in order to achieve both statistical and computational efficiency. Our e-detector framework can be instantiated to recover classical likelihood-based procedures for parametric problems, as well as yielding the first change detection method for many nonparametric problems. As a running example, we tackle the problem of detecting changes in the mean of a bounded random variable without i.i.d. assumptions, with an application to tracking the performance of a basketball team over multiple seasons.

References

[1] 
Barnard, G. A. (1959). Control charts and stochastic processes. Journal of the Royal Statistical Society: Series B (Methodological) 21(2) 239–257.
[2] 
Chen, Y., Wang, T. and Samworth, R. J. (2022). High-dimensional, multiscale online changepoint detection. Journal of the Royal Statistical Society Series B: Statistical Methodology 84(1) 234–266. https://doi.org/10.1111/rssb.12447. MR4400396
[3] 
Desobry, F., Davy, M. and Doncarli, C. (2005). An online kernel change detection algorithm. IEEE Transactions on Signal Processing 53(8) 2961–2974. https://doi.org/10.1109/TSP.2005.851098. MR2169647
[4] 
Durrett, R. (2019) Probability: Theory and Examples 49. Cambridge University Press. https://doi.org/10.1017/9781108591034. MR3930614
[5] 
Fan, X., Grama, I. and Liu, Q. (2015). Exponential inequalities for martingales with applications. Electronic Journal of Probability 20, 1. https://doi.org/10.1214/EJP.v20-3496. MR3311214
[6] 
Gangrade, A., Rinaldo, A. and Ramdas, A. (2023). A Sequential Test for Log-Concavity. arXiv preprint arXiv:2301.03542.
[7] 
Grünwald, P., de Heide, R. and Koolen, W. (2023). Safe testing. Journal of the Royal Statistical Society, Series B. 195 47–63. https://doi.org/10.1016/j.jspi.2017.09.014. MR3760837
[8] 
Hadjiliadis, O., Zhang, H. and Poor, H. V. (2009). One shot schemes for decentralized quickest change detection. IEEE Transactions on Information Theory 55(7) 3346–3359. https://doi.org/10.1109/TIT.2009.2021311. MR2598025
[9] 
Honda, J. and Takemura, A. (2010). An Asymptotically Optimal Bandit Algorithm for Bounded Support Models. In Conference on Learning Theory 67–79. Citeseer.
[10] 
Honda, J. and Takemura, A. (2015). Non-asymptotic analysis of a new bandit algorithm for semi-bounded rewards. Journal of Machine Learning Research 16 3721–3756. MR3450551
[11] 
Howard, S. R., Ramdas, A., McAuliffe, J. and Sekhon, J. (2020). Time-uniform Chernoff bounds via nonnegative supermartingales. Probability Surveys 17 257–317. https://doi.org/10.1214/18-PS321. MR4100718
[12] 
Howard, S. R., Ramdas, A., McAuliffe, J. and Sekhon, J. (2021). Time-uniform, nonparametric, nonasymptotic confidence sequences. The Annals of Statistics 49(2) 1055–1080. https://doi.org/10.1214/20-aos1991. MR4255119
[13] 
Lai, T. L. (1995). Sequential changepoint detection in quality control and dynamical systems. Journal of the Royal Statistical Society: Series B (Methodological) 57(4) 613–644. MR1354072
[14] 
Lai, T. L. (1998). Information bounds and quick detection of parameter changes in stochastic systems. IEEE Transactions on Information Theory 44(7) 2917–2929. https://doi.org/10.1109/18.737522. MR1672051
[15] 
Lai, T. L. (2001). Sequential analysis: some classical problems and new challenges. Statistica Sinica 11(2) 303–351. MR1844531
[16] 
Li, S., Xie, Y., Dai, H. and Song, L. (2015). M-Statistic for Kernel Change-Point Detection. In Advances in Neural Information Processing Systems 28.
[17] 
Lorden, G. (1970). On excess over the boundary. The Annals of Mathematical Statistics 41 520–527. https://doi.org/10.1214/aoms/1177697092. MR0254981
[18] 
Lorden, G. (1971). Procedures for reacting to a change in distribution. The Annals of Mathematical Statistics 42 1897–1908. https://doi.org/10.1214/aoms/1177693055. MR0309251
[19] 
Manole, T. and Ramdas, A. (2023). Martingale methods for sequential estimation of convex functionals and divergences. IEEE Transactions on Information Theory 69(7) 4641–4658. MR4613565
[20] 
Mei, Y. (2010). Efficient scalable schemes for monitoring a large number of data streams. Biometrika 97(2) 419–433. https://doi.org/10.1093/biomet/asq010. MR2650748
[21] 
Moustakides, G. V. (1986). Optimal stopping times for detecting changes in distributions. The Annals of Statistics 14(4) 1379–1387. https://doi.org/10.1214/aos/1176350164. MR0868306
[22] 
Moustakides, G. V. (2008). Sequential change detection revisited. The Annals of Statistics 36(2) 787–807. https://doi.org/10.1214/009053607000000938. MR2396815
[23] 
Page, E. S. (1954). Continuous inspection schemes. Biometrika 41(1/2) 100–115. https://doi.org/10.1093/biomet/41.1-2.100. MR0088850
[24] 
Podkopaev, A., Blöbaum, P., Kasiviswanathan, S. P. and Ramdas, A. (2023). Sequential Kernelized Independence Testing. In: International Conference on Machine Learning.
[25] 
Pollak, M. and Siegmund, D. (1975). Approximations to the expected sample size of certain sequential tests. The Annals of Statistics 3(6) 1267–1282. MR0403114
[26] 
Pollak, M. (1985). Optimal detection of a change in distribution. The Annals of Statistics 13(1) 206–227. https://doi.org/10.1214/aos/1176346587. MR0773162
[27] 
Polunchenko, A. S. and Tartakovsky, A. G. (2010). On optimality of the Shiryaev-Roberts procedure for detecting a change in distribution. The Annals of Statistics 38(6) 3445–3457. https://doi.org/10.1214/09-AOS775. MR2766858
[28] 
Ramdas, A., Ruf, J., Larsson, M. and Koolen, W. (2020). Admissible anytime-valid sequential inference must rely on nonnegative martingales. arXiv preprint arXiv:2009.03167.
[29] 
Ramdas, A., Ruf, J., Larsson, M. and Koolen, W. M. (2022). Testing exchangeability: Fork-convexity, supermartingales and e-processes. International Journal of Approximate Reasoning. 141 83–109. https://doi.org/10.1016/j.ijar.2021.06.017. MR4364897
[30] 
Ramdas, A., Grünwald, P., Vovk, V. and Shafer, G. (2023). Game-theoretic statistics and safe anytime-valid inference. Statistical Science 38(4) 576–601. https://doi.org/10.1214/23-sts894. MR4665027
[31] 
Ritov, Y. (1990). Decision theoretic optimality of the CUSUM procedure. The Annals of Statistics 18(3) 1464–1469. https://doi.org/10.1214/aos/1176347761. MR1062720
[32] 
Roberts, S. (1966). A comparison of some control chart procedures. Technometrics 8(3) 411–430. https://doi.org/10.2307/1266688. MR0196887
[33] 
Shafer, G. and Vovk, V. (2019) Game-Theoretic Foundations for Probability and Finance 455. John Wiley & Sons. https://doi.org/10.1002/0471249696. MR1852450
[34] 
Shekhar, S. and Ramdas, A. (2023). Nonparametric two sample testing by betting. IEEE Transactions on Information Theory. https://doi.org/10.1109/TIT.2023.3305867
[35] 
Shekhar, S. and Ramdas, A. (2023). Sequential change detection via backward confidence sequences. In International Conference on Machine Learning.
[36] 
Shin, J., Ramdas, A. and Rinaldo, A. (2021). Nonparametric iterated-logarithm extensions of the sequential generalized likelihood ratio test. IEEE Journal on Selected Areas in Information Theory.
[37] 
Shiryaev, A. N. (1963). On optimum methods in quickest detection problems. Theory of Probability & Its Applications 8(1) 22–46. MR0155708
[38] 
Siegmund, D. and Venkatraman, E. (1995). Using the generalized likelihood ratio statistic for sequential detection of a change-point. The Annals of Statistics 23 255–271. https://doi.org/10.1214/aos/1176324466. MR1331667
[39] 
Tartakovsky, A., Nikiforov, I. and Basseville, M. (2014) Sequential Analysis: Hypothesis Testing and Changepoint Detection. CRC Press. MR3241619
[40] 
Tartakovsky, A. G. (2014). Nearly optimal sequential tests of composite hypotheses revisited. Proceedings of the Steklov Institute of Mathematics 287(1) 268–288. https://doi.org/10.1134/S0081543814080161. MR3484334
[41] 
Tartakovsky, A. G., Pollak, M. and Polunchenko, A. S. (2012). Third-order Asymptotic Optimality of the Generalized Shiryaev–Roberts Changepoint Detection Procedures. Theory of Probability & Its Applications 56(3) 457–484. https://doi.org/10.4213/tvp4406. MR3136464
[42] 
Ville, J. (1939). Étude critique de la notion de collectif. Monographies des Probabilités 3 144. MR3533075
[43] 
Vovk, V. (2021). Testing randomness online. Statistical Science 36(4) 595–611. https://doi.org/10.1214/20-sts817. MR4323055
[44] 
Vovk, V., Gammerman, A. and Shafer, G. (2005) Algorithmic Learning in a Random World. Springer Science & Business Media. MR2161220
[45] 
Wang, H. and Ramdas, A. (2023). Catoni-style confidence sequences for heavy-tailed mean estimation. Stochastic Processes and Applications 163 168–202. https://doi.org/10.1016/j.spa.2023.05.007. MR4610125
[46] 
Wang, H. and Ramdas, A. (2023). Huber-robust confidence sequences. In International Conference on Artificial Intelligence and Statistics 9662–9679. PMLR.
[47] 
Wasserman, L., Ramdas, A. and Balakrishnan, S. (2020). Universal inference. Proceedings of the National Academy of Sciences 117(29) 16880–16890. https://doi.org/10.1073/pnas.1922664117. MR4242731
[48] 
Waudby-Smith, I. and Ramdas, A. (2023). Estimating means of bounded random variables by betting. Journal of the Royal Statistical Society Series B: Statistical Methodology.
[49] 
Willsky, A. and Jones, H. (1976). A generalized likelihood ratio approach to the detection and estimation of jumps in linear systems. IEEE Transactions on Automatic Control 21(1) 108–112. https://doi.org/10.1109/tac.1976.1101146. MR0396019
[50] 
Xie, Y. and Siegmund, D. (2013). Sequential multi-sensor change-point detection. In 2013 Information Theory and Applications Workshop (ITA) 1–20. IEEE. https://doi.org/10.1214/13-AOS1094. MR3099117
[51] 
Zou, S., Veeravalli, V. V., Li, J. and Towsley, D. (2019). Quickest detection of dynamic events in networks. IEEE Transactions on Information Theory 66(4) 2280–2295. https://doi.org/10.1109/TIT.2019.2948350. MR4087685

Full article PDF XML
Full article PDF XML

Copyright
© 2024 New England Statistical Society
by logo by logo
Open access article under the CC BY license.

Keywords
Sequential change detection Nonparametric Shiryaev-Roberts CUSUM

Funding
A. Ramdas was partially supported by NSF grants IIS-2229881 and DMS-2310718. J. Shin and A. Rinaldo were partially supported by NSF grant DMS-EPSRC 2015489.

Metrics
since December 2021
627

Article info
views

114

Full article
views

220

PDF
downloads

45

XML
downloads

Export citation

Copy and paste formatted citation
Placeholder

Download citation in file


Share


RSS

The New England Journal of Statistics in Data Science

  • ISSN: 2693-7166
  • Copyright © 2021 New England Statistical Society

About

  • About journal

For contributors

  • Submit
  • OA Policy
  • Become a Peer-reviewer
Powered by PubliMill  •  Privacy policy