The New England Journal of Statistics in Data Science logo


  • Help
Login Register

  1. Home
  2. To appear
  3. Inverse Probability Weighting: From Surv ...

The New England Journal of Statistics in Data Science

Submit your article Information Become a Peer-reviewer
  • Article info
  • Full article
  • More
    Article info Full article

Inverse Probability Weighting: From Survey Sampling to Evidence Estimation
Jyotishka Datta ORCID icon link to view author Jyotishka Datta details   Nicholas Polson  

Authors

 
Placeholder
https://doi.org/10.51387/26-NEJSDS100
Pub. online: 13 April 2026      Type: Methodology Article      Open accessOpen Access
Area: Statistical Methodology

Accepted
3 February 2026
Published
13 April 2026

Abstract

We consider the class of inverse probability weight (IPW) estimators, including the popular Horvitz–Thompson and Hájek estimators used routinely in survey sampling, causal inference and for Bayesian computation. We focus on the ‘weak paradoxes’ for these estimators due to two counterexamples by Basu (1988) and Wasserman (2004) and investigate the two natural Bayesian answers to this problem: one based on binning and smoothing: a ‘Bayesian sieve’ and the other based on a conjugate hierarchical model that allows borrowing information via exchangeability. We compare the mean squared errors for the two Bayesian estimators with the IPW estimators for Wasserman’s example via simulation studies on a broad range of parameter configurations. We also prove posterior consistency for the Bayes estimators under missing-completely-at-random assumption and show that it requires fewer assumptions on the inclusion probabilities. We also revisit the connection between the different problems where improved or adaptive IPW estimators will be useful, including survey sampling, evidence estimation strategies such as Conditional Monte Carlo, Riemannian sum, Trapezoidal rules and vertical likelihood, as well as average treatment effect estimation in causal inference.

References

 
Basu, D. Statistical information and likelihood: a collection of critical essays by Dr. D. Basu. (J. K. Ghosh, ed.) Lecture Notes in Statistics 45. Springer Science & Business Media (1988). MR0953081
 
Bhadra, A., Datta, J., Polson, N. G. and Willard, B. Lasso meets horseshoe. Statistical Science 34(3) 405–427 (2019). https://doi.org/10.1214/19-STS700. MR4017521
 
Chauvet, G. A note on the consistency of the narain-horvitz-thompson estimator (2014). arXiv preprint arXiv:1412.2887.
 
Chen, Q., Elliott, M. R., Haziza, D., Yang, Y., Ghosh, M., Little, R. J., Sedransk, J. and Thompson, M. Approaches to improving survey-weighted estimates. Statistical Science 32(2) 227–248 (2017). https://doi.org/10.1214/17-STS609. MR3648957
 
Chopin, N. and Robert, C. P. Properties of nested sampling. Biometrika 97(3) 741–755 (2010). https://doi.org/10.1093/biomet/asq021. MR2672495
 
Cunningham, S. Causal inference. In Causal Inference. Yale University Press (2021).
 
Datta, J. and Polson, N. G. Quantile importance sampling (2025). arXiv preprint arXiv:2305.03158. https://doi.org/10.1214/25-bjps638. MR5007225
 
Delevoye, A. and Sävje, F. Consistency of the horvitz–thompson estimator under general sampling and experimental designs. Journal of Statistical Planning and Inference 207. 190–197 (2020). https://doi.org/10.1016/j.jspi.2019.12.002. MR4066130
 
Diaconis, P. Bayesian numerical analysis. Statistical decision theory and related topics IV 1. 163–175 (1988). MR0927099
 
Efron, B. and Morris, C. Stein’s paradox in statistics. Scientific American 236(5) 119–127 (1977).
 
Firth, D. On improved estimation for importance sampling. Brazilian Journal of Probability and Statistics 25(3) 437–443 (2011). https://doi.org/10.1214/11-BJPS155. MR2832895
 
Ghosh, J. K. Weak paradoxes and paradigms. In Statistical Paradigms: Recent Advances and Reconciliations 3–12. World Scientific (2015). https://doi.org/10.1142/9789814343961_0001. MR3308074
 
Hahn, P. R., Carvalho, C. M., Puelz, D. and He, J. Regularization and confounding in linear regression for treatment effect estimation. Bayesian Analysis 13(1) 163–182 (2018). https://doi.org/10.1214/16-BA1044. MR3737947
 
Harmeling, S. and Touissant, M. Bayesian estimators for robins-ritov’s problem. Technical report, (2007).
 
Haziza, D. and Beaumont, J.-F. (2017). Construction of weights in surveys: A review. https://doi.org/10.1214/16-STS608. MR3648956
 
Hesterberg, T. Weighted average importance sampling and defensive mixture distributions. Technometrics 37(2) 185–194 (1995).
 
Hesterberg, T. C. Advances in importance sampling. PhD thesis, Stanford University (1988). MR2637036
 
Horvitz, D. G. and Thompson, D. J. A generalization of sampling without replacement from a finite universe. Journal of the American statistical Association 47(260) 663–685 (1952). MR0053460
 
Hájek, J. Comment on “an essay on the logical foundations of survey sampling” by basu. (V. P. Godambe and D. A. Sprott, eds.) In Foundations of Statistical Inference 236–242. Holt, Rinehart and Winston (1971). MR0423625
 
Imbens, G. W. Nonparametric estimation of average treatment effects under exogeneity: A review. Review of Economics and statistics 86(1) 4–29 (2004).
 
Isaki, C. T. and Fuller, W. A. Survey design under the regression superpopulation model. Journal of the American Statistical Association 77(377) 89–96 (1982). MR0648029
 
James, W. and Stein, C. Estimation with quadratic loss. In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Contributions to the Theory of Statistics 361–379. University of California Press (1961). MR0133191
 
Khan, S. and Ugander, J. Adaptive normalization for ipw estimation. Journal of Causal Inference 11(1) 20220019 (2023). https://doi.org/10.1515/jci-2022-0019. MR4545024
 
Kim, C., Tec, M. and Zigler, C. M. Bayesian nonparametric adjustment of confounding (2022). arXiv preprint arXiv:2203.11798. https://doi.org/10.1111/biom.13833. MR4680719
 
Kong, A., McCullagh, P., Meng, X.-L., Nicolae, D. and Tan, Z. A theory of statistical models for monte carlo integration. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 65(3) 585–604 (2003). https://doi.org/10.1111/1467-9868.00404. MR1998624
 
Li, L. Are bayesian inferences weak for wasserman’s example? Communications in Statistics—Simulation and Computationő 39(3) 655–667 (2010). https://doi.org/10.1080/03610910903576540. MR2784548
 
Linero, A. R. In nonparametric and high-dimensional models, bayesian ignorability is an informative prior. Journal of the American Statistical Association 119(548) 2785–2798 (2024). https://doi.org/10.1080/01621459.2023.2278202. MR4833915
 
Little, R. J. Weighting and prediction in sample surveys. Calcutta Statistical Association Bulletin 60(3–4) 147–167 (2008). https://doi.org/10.1177/0008068320080301. MR2553424
 
Madrid-Padilla, O.-H., Polson, N. G. and Scott, J. A deconvolution path for mixtures. Electronic Journal of Statistics 12(1) 1717–1751 (2018). https://doi.org/10.1214/18-ejs1430. MR3806437
 
Narain, R. On sampling without replacement with varying probabilities. Journal of the Indian Society of Agricultural Statistics 3(2) 169–175 (1951). MR0045354
 
Neyman, J. On the two different aspects of the representative method: The method of stratified sampling and the method of purposive selection. Journal of the Royal Statistical Society Series A: Statistics in Society 97(4) 558–606 (1934).
 
Owen, A. B. Comment: Unreasonable effectiveness of monte carlo. Statistical Science 34(1) 29–33 (2019). https://doi.org/10.1214/18-STS676. MR3938960
 
Philippe, A. Processing simulation output by riemann sums. Journal of Statistical Computation and Simulation 59(4) 295–314 (1997). https://doi.org/10.1080/00949659708811863. MR1623186
 
Philippe, A. and Robert, C. P. Riemann sums for mcmc estimation and convergence monitoring. Statistics and Computing 11(2) 103–115 (2001). https://doi.org/10.1023/A:1008926514119. MR1837130
 
Polson, N. G. and Scott, J. G. Shrink globally, act locally: Sparse Bayesian regularization and prediction. Bayesian Statistics 9. 501–538 (2010). https://doi.org/10.1093/acprof:oso/9780199694587.003.0017. MR3204017
 
Polson, N. G. and Scott, J. G. Local shrinkage rules, lévy processes and regularized regression. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 74(2) 287–311 (2012). https://doi.org/10.1111/j.1467-9868.2011.01015.x. MR2899864
 
Polson, N. G. and Scott, J. G. (2014). Vertical-likelihood monte carlo. arXiv preprint arXiv:1409.3601.
 
Raftery, A. E., Newton, M. A., Satagopan, J. M. and Krivitsky, P. N. Estimating the integrated likelihood via posterior simulation using the harmonic mean identity (2006). MR2433201
 
Ramakrishnan, M. An alternative proof of the admissibility of the horvitz-thompson estimator. The Annals of Statistics 1(3) 577–579 (1973). MR0408061
 
Rao, J. N., Chaudhuri, A., Eltinge, J., Fay, R. E., Ghosh, J., Ghosh, M., Lahiri, P. and Pfeffermann, D. Some current trends in sample survey theory and methods (with discussion). Sankhy: The Indian Journal of Statistics, Series B 1–57 (1999). MR1720726
 
Ritov, Y., Bickel, P. J., Gamst, A. C. and Kleijn, B. J. K. The bayesian analysis of complex, high-dimensional models: Can it be coda? Statistical Science 29(4) 619–639 (2014). https://doi.org/10.1214/14-STS483. MR3300362
 
Robins, J. M. and Ritov, Y. Toward a curse of dimensionality appropriate (coda) asymptotic theory for semi-parametric models. Statistics in medicine 16(3) 285–319 (1997).
 
Rubin, D. B. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of educational Psychology 66(5) 688–701 (1974).
 
Särndal, C.-E., Swensson, B. and Wretman, J. Model assisted survey sampling. Springer Science & Business Media (2003). https://doi.org/10.1007/978-1-4612-4378-6. MR1140409
 
Shortreed, S. M. and Ertefaie, A. Outcome-adaptive lasso: variable selection for causal inference. Biometrics 73(4) 1111–1122 (2017). https://doi.org/10.1111/biom.12679. MR3744525
 
Sims, C. Understanding non-bayesians. Unpublished chapter, Department of Economics, Princeton University (2010).
 
Sims, C. A. Thinking about instrumental variables. manuscript, Department of Economics, Princeton University (2007) manuscript.
 
Sims, C. A. On an example of larry wasserman, round 2 (2012).
 
Skilling, J. Nested sampling for general bayesian computation. Bayesian analysis 1(4) 833–859 (2006). https://doi.org/10.1214/06-BA127. MR2282208
 
Stigler, S. M. The 1988 neyman memorial lecture: a galtonian perspective on shrinkage estimators. Statistical Science 147–155 (1990). MR1054859
 
Trotter, H. F. and Tukey, H. Conditional monte carlo for normal samples. In Proc. Symp. on Monte Carlo Methods 64–79. John Wiley and Sons (1956). MR0079825
 
Wang, C., Dominici, F., Parmigiani, G. and Zigler, C. M. Accounting for uncertainty in confounder and effect modifier selection when estimating average causal effects in generalized linear models. Biometrics 71(3) 654–665 (2015). https://doi.org/10.1111/biom.12315. MR3402601
 
Wang, C., Parmigiani, G. and Dominici, F. Bayesian effect estimation accounting for adjustment uncertainty. Biometrics 68(3) 661–671 (2012). https://doi.org/10.1111/j.1541-0420.2011.01731.x. MR3055168
 
Wasserman, L. Bayesian inference. In All of Statistics 175–192. Springer (2004). https://doi.org/10.1007/978-0-387-21736-9. MR2055670
 
Yakowitz, S., Krimmel, J. and Szidarovszky, F. Weighted monte carlo integration. SIAM Journal on Numerical Analysis 15(6) 1289–1300 (1978). https://doi.org/10.1137/0715088. MR0512700

Full article PDF XML
Full article PDF XML

Copyright
© 2026 New England Statistical Society
by logo by logo
Open access article under the CC BY license.

Keywords
Inverse probability weighting Horvitz–Thompson Hájek Importance sampling Stein phenomenon Bias-variance trade-off Evidence Estimation

Funding
Dr. Datta acknowledges support from the National Science Foundation (NSF DMS-2015460 and NSF CAREER 2443282).

Metrics
since December 2021
14

Article info
views

3

Full article
views

4

PDF
downloads

0

XML
downloads

Export citation

Copy and paste formatted citation
Placeholder

Download citation in file


Share


RSS

The New England Journal of Statistics in Data Science

  • ISSN: 2693-7166
  • Copyright © 2021 New England Statistical Society

About

  • About journal

For contributors

  • Submit
  • OA Policy
  • Become a Peer-reviewer
Powered by PubliMill  •  Privacy policy