Help

Home
Search

The New England Journal of Statistics in Data Science

Submit your article Information Become a Peer-reviewer

Journal home
To appear
Current issue
All issues
More
Journal home To appear Current issue All issues

Detailed search

Title

Author

Types

Areas

Abstract

Keywords

Published

Pages

Volumes

Issues

DOI

Affiliation

Search results 109

Order by:

Select: All None Download:

Unsupervised Cell Segmentation by Fast Gaussian Processes

Laura Baracaldo Blythe King Haoran Yan All authors (6)

https://doi.org/10.51387/26-NEJSDS97

Pub. online: 28 Jan 2026 Type: Methodology Article

Open Access

Area: Engineering Science

Journal: The New England Journal of Statistics in Data Science

Abstract

Cell boundary information is crucial for analyzing cell behaviors from time-lapse microscopy videos. Existing supervised cell segmentation tools, such as ImageJ, require tuning various parameters and rely on restrictive assumptions about the shape of the objects. While recent supervised segmentation tools based on convolutional neural networks enhance accuracy, they depend on high-quality labeled images, making them unsuitable for segmenting new types of objects not in the database. We developed a novel unsupervised cell segmentation algorithm based on fast Gaussian processes for noisy microscopy images without the need for parameter tuning or restrictive assumptions about the shape of the object. We derived robust thresholding criteria adaptive for heterogeneous images containing distinct brightness at different parts to separate objects from the background, and employed watershed segmentation to distinguish touching cell objects. Both simulated studies and real-data analysis of large microscopy images demonstrate the scalability and accuracy of our approach compared with the alternatives.

Dose Finding and Related Topics in Drug Development

Tianjian Zhou Yiming Zhang Qian H. Li

https://doi.org/10.51387/25-NEJSDS33EDI

Pub. online: 23 Jan 2026 Type: Editorial

Open Access

Journal: The New England Journal of Statistics in Data Science Volume 3, Issue 3 (2025), pp. 209–211

Lessons Learned from the Bayesian Design and Analysis for the BNT162b2 COVID-19 Vaccine Phase 3 Trial

Yuan Ji Shijie Yuan

https://doi.org/10.51387/26-NEJSDS93

Pub. online: 22 Jan 2026 Type: Commentary And/or Historical Perspective

Open Access

Area: Cancer Research

Journal: The New England Journal of Statistics in Data Science Volume 3, Issue 2 (2025), pp. 159–163

Abstract

The phase III BNT162b2 mRNA COVID-19 vaccine trial is based on a Bayesian design and analysis, and the main evidence of vaccine efficacy is presented in Bayesian statistics. Confusion and mistakes arise in the presentation of the Bayesian results. Some key statistics, such as Bayesian credible intervals, are mislabeled and stated as confidence intervals. Posterior probabilities of the vaccine efficacy are not reported as the main results. We illustrate the main differences in the reporting of Bayesian analysis results for a clinical trial and provide four recommendations. We argue that statistical evidence from a Bayesian trial, when presented properly, is easier to interpret and directly addresses the main clinical questions, thereby better supporting regulatory decision making. We also recommend using the abbreviation “BI” to represent Bayesian credible interval as a differentiation to “CI” which stands for confidence interval.

Risks in Finding Doses for a New Drug

Naitee Ting

https://doi.org/10.51387/25-NEJSDS081

Pub. online: 6 Jan 2026 Type: Research Communication

Open Access

Area: Biomedical Research

Journal: The New England Journal of Statistics in Data Science Volume 3, Issue 3 (2025), pp. 212–221

Abstract

The biggest risk in new drug development is either unaware of, or under-estimate the potential risks in designing clinical trials. Among all challenges in drug development, the most critical one is about finding the appropriate dose(s) for the study drug in treating patients. Designing dose finding clinical trials involves in many potential risks. In practice, most of the expensive failures in drug development originated not from “We did not know”, rather, the mistake is “We thought we knew”. In other words, greatest risks came from lack of awareness of underlying assumptions. This manuscript attempts to discuss some of these risks and make recommendations to reduce risks in the design of dose finding clinical trials. This is not a complete list of risks, but it is only starting the discussion.

Bayesian Information Sharing for Equivalence Testing with an Application to Dose Proportionality Studies

Wenru Zhou Samantha MaWhinney Peter Anderson All authors (4)

https://doi.org/10.51387/25-NEJSDS95

Pub. online: 5 Jan 2026 Type: Methodology Article

Open Access

Area: Biomedical Research

Journal: The New England Journal of Statistics in Data Science Volume 3, Issue 3 (2025), pp. 242–252

Abstract

Dose proportionality is an essential aspect of pharmacokinetics (PK). We aim to enhance the efficiency of PK studies by incorporating interim analyses and utilizing data from past trials to increase precision and enable early termination of studies if applicable. In this paper, we extend the multisource exchangeability model (MEM) to the setting with correlated data with interim analyses. Simulation results indicate that the MEM estimators are efficient even with smaller sample sizes, although smaller sample sizes may have higher mean square error (MSE) and bias due to early stopping and more liberal data borrowing from non-exchangeable supplementary sources. Our recommendation is to use a constrained MEM approach when considering small sample sizes, with additional caution needed around the equivalence boundary to better control the inflated type I error rate, bias, and MSE. This research extends the application of MEMs from linear regression models to settings with correlated data using linear mixed effects regression models. It also innovatively applies MEMs to equivalence testing in the context of dose proportionality studies, thereby enhancing their efficiency.

LiMAP-Curvature: A Simple Model-Free Approach for Analysing Dose-Finding Studies

Linxi Han Qiqi Deng

Zhangyi He All authors (4)

https://doi.org/10.51387/25-NEJSDS90

Pub. online: 10 Sep 2025 Type: Methodology Article

Open Access

Area: Statistical Methodology

Journal: The New England Journal of Statistics in Data Science Volume 3, Issue 3 (2025), pp. 265–271

Abstract

MCP-Mod (Multiple Comparison Procedure-Modelling) is an efficient statistical method for the analysis of Phase II dose-finding trials, although it requires specialised expertise to pre-specify plausible candidate models along with model parameters. This can be problematic given limited knowledge of the agent/compound being studied, and misspecification of candidate models and model parameters can severely degrade its performance. To circumvent this challenge, in the work, we introduce LiMAP-curvature, a Bayesian model-free approach for the detection of the dose-response signal in Phase II dose-finding trials. LiMAP-curvature is built upon a Bayesian hierarchical framework incorporating information about the total curvature of the dose-response curve. Through extensive simulations, we show that LiMAP-curvature has comparable performance to MCP-Mod if the true underlying dose-response model is included in the candidate model set of MCP-Mod. Otherwise, LiMAP-curvature can achieve performance superior to that of MCP-Mod, especially when the true dose-response model drastically deviates from candidate models in MCP-Mod.

Locally Adaptive Modeling of Unconditional Heteroskedasticity

Matthias R. Fengler Bruno Jäger Ostap Okhrin

https://doi.org/10.51387/25-NEJSDS91

Pub. online: 10 Sep 2025 Type: Methodology Article

Open Access

Area: Statistical Methodology

Journal: The New England Journal of Statistics in Data Science

Abstract

We study local change point detection in variance using generalized likelihood ratio tests. Building on [24], we utilize the multiplier bootstrap to approximate the unknown, non-asymptotic distribution of the test statistic and introduce a multiplicative bias correction that improves upon the existing additive version. This proposed correction offers a clearer interpretation of the bootstrap estimators while significantly reducing computational costs. Simulation results demonstrate that our method performs comparably to the original approach. We apply it to the growth rates of U.S. inflation, industrial production, and Bitcoin returns.

Shift Models for Dose-Finding in Ordered Groups with Late-Onset Toxicity

Rami Hawila Nolan A. Wages

https://doi.org/10.51387/25-NEJSDS88

Pub. online: 9 Jul 2025 Type: Methodology Article

Open Access

Area: Cancer Research

Journal: The New England Journal of Statistics in Data Science Volume 3, Issue 3 (2025), pp. 234–241

Abstract

Historically, the primary objective of Phase I clinical trials has been to pick an optimal dose in terms of patient safety, referred to as the maximum tolerated dose (MTD). Most of these trials recommend a “one-size-fits-all” dose for the patient population being studied, while also solely focusing on short-term adverse events. Often patient heterogeneity exists so that group-specific dose selection is of interest. To address the issue of patient heterogeneity, several dose-finding methods have been proposed, including the shift model framework based on the Continual Reassessment Method (CRM). Additionally, for many cancer therapies, relevant toxicities may be defined by participants experiencing adverse events at any point over a long evaluation window, resulting in pending outcomes when new participants need to be assigned a dose. By leveraging the CRM, the time-to-event continual reassessment method (TITE-CRM) provides a feasible approach for addressing this issue. Motivated by a Phase I trial involving radiotherapy that included two patient groups conducted at the University of Virginia, we have developed a hybrid design that combines elements from the TITE-CRM and the shift model framework. This approach helps address patient heterogeneity and late-onset toxicity simultaneously. We illustrate how to perform a dose-finding trial using the proposed design, and compare its operating characteristics to other suggested methods in the field by conducting a simulation study. The shift model TITE-CRM is shown to be a practical design with good operating characteristics in regard to selecting the correct MTD in each group. An R package is also being developed, allowing investigators to provide group-specific MTD recommendations by applying the proposed design, in addition to providing operating characteristics for custom simulation settings.

Additional Considerations for Single-Arm Trials to Support Accelerated Approval of Oncology Drugs

Feinan Lu Tao Wang Ying Lu All authors (4)

https://doi.org/10.51387/25-NEJSDS89

Pub. online: 1 Jul 2025 Type: Research Communication

Open Access

Area: Cancer Research

Journal: The New England Journal of Statistics in Data Science Volume 3, Issue 2 (2025), pp. 164–166

Three-Outcome Dual-Criterion Randomized Phase II Clinical Trial Design

Yujia Wang Xiaohan Chi Ruitao Lin

https://doi.org/10.51387/25-NEJSDS83

Pub. online: 7 May 2025 Type: Methodology Article

Open Access

Area: Statistical Methodology

Journal: The New England Journal of Statistics in Data Science Volume 3, Issue 3 (2025), pp. 272–281

Abstract

The high cost of drug development and the relatively low success rates of phase III clinical trials highlight the need for improved and reasonably sized phase II trial designs, especially when responses observed in treatment and control could not lead to a clear-cut decision warranting further studies. To this end, we propose a three-outcome dual-criterion randomized (TDR) trial design, which implements inconclusive region sculpting using boundaries defined by both statistically significant differences between treatment and control as well as the clinical relevance of treatment responses. We provide statistical justifications for the TDR design in both one-stage and two-stage trial settings. Additionally, we evaluate its operating characteristics through a comparison with existing designs. The proposed design is shown able to achieve sample size saving and type II error reduction while controlling the type I error at a marginal cost of power reduction. Lastly, robustness under various deviations from the assumed control response rate is also demonstrated.

1 2 3 4

Items per page

Export citation

Copy and paste formatted citation

Formatted citation

Placeholder

Citation style

Download citation in file

Export format

Authors

Placeholder

RSS

The New England Journal of Statistics in Data Science

ISSN: 2693-7166

About

About journal

For contributors

Submit
OA Policy
Become a Peer-reviewer