<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">NEJSDS</journal-id>
<journal-title-group><journal-title>The New England Journal of Statistics in Data Science</journal-title></journal-title-group>
<issn pub-type="ppub">2693-7166</issn><issn-l>2693-7166</issn-l>
<publisher>
<publisher-name>New England Statistical Society</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">NEJSDS74</article-id>
<article-id pub-id-type="doi">10.51387/24-NEJSDS74</article-id>
<article-categories><subj-group subj-group-type="area">
<subject>Statistical Methodology</subject></subj-group><subj-group subj-group-type="heading">
<subject>Methodology Article</subject></subj-group></article-categories>
<title-group>
<article-title>Up-and-Down: The Most Popular, Most Reliable, and Most Overlooked Dose-Finding Design</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Oron</surname><given-names>Assaf P.</given-names></name><email xlink:href="mailto:assaf@uw.edu">assaf@uw.edu</email><xref ref-type="aff" rid="j_nejsds74_aff_001"/><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Flournoy</surname><given-names>Nancy</given-names></name><email xlink:href="mailto:flournoyn@umsystem.edu">flournoyn@umsystem.edu</email><xref ref-type="aff" rid="j_nejsds74_aff_002"/>
</contrib>
<aff id="j_nejsds74_aff_001">Institute for Health Metrics and Evaluation, <institution>University of Washington</institution>, Seattle, WA, <country>USA</country>. E-mail address: <email xlink:href="mailto:assaf@uw.edu">assaf@uw.edu</email></aff>
<aff id="j_nejsds74_aff_002"><institution>University of Missouri System</institution>, Columbia, MO, <country>USA</country>. E-mail address: <email xlink:href="mailto:flournoyn@umsystem.edu">flournoyn@umsystem.edu</email></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2025</year></pub-date><pub-date pub-type="epub"><day>17</day><month>12</month><year>2024</year></pub-date><volume>3</volume><issue>3</issue><fpage>222</fpage><lpage>233</lpage><history><date date-type="accepted"><day>5</day><month>11</month><year>2024</year></date></history>
<permissions><copyright-statement>© 2025 New England Statistical Society</copyright-statement><copyright-year>2025</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>Up-and-Down designs (UDDs) are ubiquitous for dose-finding in a wide variety of scientific, engineering, and clinical fields. They are defined by a few simple rules that generate a random walk around the target percentile. UDDs’ combination of robust, tractable behavior, straightforward usage, and good dose-finding performance, has won the trust of practitioners and their consulting analysts across fields and continents. In contrast, in recent decades the statistical dose-finding design field has turned a cold shoulder towards UDDs, and it is quite possible that many younger dose-finding methods researchers are not even aware of this design approach.</p>
<p>We present a concise overview of UDDs and their current state-of-the-art methodology, with references for further inquiry. We also revisit the performance comparison between UDDs and novel, more complicated design approaches such as the Continual Reassessment Method and the Bayesian Optimal Interval design, which we group under the term “Aim-for-Target” designs. UDDs fare very well in the comparison, particularly in terms of robustness to sources of variability.</p>
</abstract>
<kwd-group>
<label>Keywords and phrases</label>
<kwd>Adaptive designs</kwd>
<kwd>Dose-finding</kwd>
<kwd>Up-and-Down</kwd>
<kwd>Staircase method</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="j_nejsds74_s_001">
<label>1</label>
<title>Introduction</title>
<p>Up-and-Down designs (UDDs) were developed in the 1940s, independently on two continents and in two different fields: sensory studies [<xref ref-type="bibr" rid="j_nejsds74_ref_065">65</xref>] and explosive testing [<xref ref-type="bibr" rid="j_nejsds74_ref_010">10</xref>]. They remain the dose-finding method of choice in both fields [<xref ref-type="bibr" rid="j_nejsds74_ref_059">59</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_044">44</xref>], and are very popular in many other fields including anesthesiology [<xref ref-type="bibr" rid="j_nejsds74_ref_039">39</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_052">52</xref>], dentistry [<xref ref-type="bibr" rid="j_nejsds74_ref_061">61</xref>], toxicology [<xref ref-type="bibr" rid="j_nejsds74_ref_054">54</xref>], materials science and engineering [<xref ref-type="bibr" rid="j_nejsds74_ref_025">25</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_057">57</xref>], electrical engineering [<xref ref-type="bibr" rid="j_nejsds74_ref_071">71</xref>], and more. UDDs are considered a standard or recommended design in these fields by many national [<xref ref-type="bibr" rid="j_nejsds74_ref_002">2</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_011">11</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_034">34</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_043">43</xref>] and international [<xref ref-type="bibr" rid="j_nejsds74_ref_029">29</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_030">30</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_042">42</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_045">45</xref>] organizations.</p>
<p>One surprising domain where UDDs have become rather <italic>unpopular</italic> and mostly neglected in recent decades, is the statistical field of dose-finding methodology. Amid a veritable explosion of articles presenting, modifying, and discussing novel dose-finding designs, UDD methodology articles have dwindled to less than a trickle. This relative silence in the statistical community is surprising in several ways:</p>
<list>
<list-item id="j_nejsds74_li_001">
<label>•</label>
<p>Statisticians are the ones who had spearheaded UDD methodological development during the design’s early decades;</p>
</list-item>
<list-item id="j_nejsds74_li_002">
<label>•</label>
<p>Those decades, followed by a rather abrupt neglect, have left behind them many key unresolved challenges, both theoretical and practical;</p>
</list-item>
<list-item id="j_nejsds74_li_003">
<label>•</label>
<p>Judging by the sheer number of UDD experiments taking place across such a wide array of fields, one would expect that statistical consulting needs alone would have spurred many statisticians to continue investigating and improving UDD methodology. To wit, Oron’s long affair with UDDs began with a 2003 graduate-student consulting project.</p>
</list-item>
<list-item id="j_nejsds74_li_004">
<label>•</label>
<p>Last, but not least: when one compares UDDs’ dose-finding performance with newer, far more complicated designs, and does so on a level playing field – UDDs tend to hold their own [<xref ref-type="bibr" rid="j_nejsds74_ref_013">13</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_019">19</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_048">48</xref>]. When robustness is examined, UDDs are generally far more robust than these newer designs.</p>
</list-item>
</list>
<p>The last point alone, a variation on Occam’s Razor, should convince statisticians to take UDDs seriously again. Why invest so much in design overhead, when a simpler more straightforward method does the job at least as well? One plausible explanation for the collective overlooking of UDDs is that after several decades outside the statistical limelight, they have simply receded beyond the horizon of methods that most active and incoming statisticians are familiar with. Our aim here is to pique the reader’s interest regarding UDDs, and to provide concrete information for getting started, both methodologically and in a consulting capacity. Following a brief overview of UDDs and recent methodological developments, we will present fresh simulation data comparing UDD performance with leading newer designs. The latter will be described and discussed only to the extent required for such a comparison, as we maintain the article’s main focus upon UDDs. We end with a general discussion.</p>
</sec>
<sec id="j_nejsds74_s_002">
<label>2</label>
<title>Up-and-Down Overview</title>
<sec id="j_nejsds74_s_003">
<label>2.1</label>
<title>Basics</title>
<p>Due in part to the scarcity of authoritative material, there is no single definition that distinguishes UDDs from other dose-finding designs, some of which are closely related. We prefer to define UDDs as sharing the following elements [<xref ref-type="bibr" rid="j_nejsds74_ref_052">52</xref>]:</p>
<list>
<list-item id="j_nejsds74_li_005">
<label>1.</label>
<p>The responses, <inline-formula id="j_nejsds74_ineq_001"><alternatives><mml:math>
<mml:mi mathvariant="bold">Y</mml:mi>
<mml:mo>=</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:mo>…</mml:mo>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[$\mathbf{Y}=\{{Y_{i}},\hspace{2.5pt}i=1,\hspace{2.5pt}\dots n\}$]]></tex-math></alternatives></inline-formula>, are binary or dichotomized.<xref ref-type="fn" rid="j_nejsds74_fn_001">1</xref><fn id="j_nejsds74_fn_001"><label><sup>1</sup></label>
<p>Some ordinal forms of <bold>Y</bold> may also be possible; see Discussion.</p></fn> We will refer to the two options verbally as “positive” and “negative”, even though they are coded numerically as 1 and 0.</p>
</list-item>
<list-item id="j_nejsds74_li_006">
<label>2.</label>
<p>The treatments <inline-formula id="j_nejsds74_ineq_002"><alternatives><mml:math>
<mml:mi mathvariant="bold">X</mml:mi>
<mml:mo>=</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:mo>…</mml:mo>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[$\mathbf{X}=\{{X_{i}},\hspace{2.5pt}i=1,\hspace{2.5pt}\dots n\}$]]></tex-math></alternatives></inline-formula> (often generically known as <italic>“doses”</italic>) are selected from a discrete set of increasing values <inline-formula id="j_nejsds74_ineq_003"><alternatives><mml:math>
<mml:mi mathvariant="script">X</mml:mi>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:mo stretchy="false">⋯</mml:mo>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">M</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[$\mathcal{X}={d_{1}}\lt {d_{2}}\lt \cdots \lt {d_{M}}$]]></tex-math></alternatives></inline-formula>, which we will call <bold>dose levels</bold>. We assume here that <inline-formula id="j_nejsds74_ineq_004"><alternatives><mml:math>
<mml:mi mathvariant="script">X</mml:mi></mml:math><tex-math><![CDATA[$\mathcal{X}$]]></tex-math></alternatives></inline-formula> is finite, without loss of generality.<xref ref-type="fn" rid="j_nejsds74_fn_002">2</xref><fn id="j_nejsds74_fn_002"><label><sup>2</sup></label>
<p>Preferably, dose levels are uniformly spaced in an algebraic or geometric sequence, but this is not required.</p></fn></p>
</list-item>
<list-item id="j_nejsds74_li_007">
<label>3.</label>
<p>The probability of positive response is monotone over <inline-formula id="j_nejsds74_ineq_005"><alternatives><mml:math>
<mml:mi mathvariant="script">X</mml:mi></mml:math><tex-math><![CDATA[$\mathcal{X}$]]></tex-math></alternatives></inline-formula>; without loss of generality we assume monotone increasing. The probability is usually denoted via the dose-response function <inline-formula id="j_nejsds74_ineq_006"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(x)$]]></tex-math></alternatives></inline-formula>, where <italic>x</italic> is the continuous treatment-magnitude variable. The dose levels <inline-formula id="j_nejsds74_ineq_007"><alternatives><mml:math>
<mml:mi mathvariant="script">X</mml:mi></mml:math><tex-math><![CDATA[$\mathcal{X}$]]></tex-math></alternatives></inline-formula> are simply specific discrete values of <italic>x</italic>. It is common and often useful to think of <inline-formula id="j_nejsds74_ineq_008"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(x)$]]></tex-math></alternatives></inline-formula> as a cumulative distribution function (CDF) of response thresholds, but it is not required.</p>
</list-item>
<list-item id="j_nejsds74_li_008">
<label>4.</label>
<p>Treatments are allocated sequentially and (for each new subject or cohort) only allow for increasing by one dose level, decreasing by one dose level, or no change from the current level. Hence, the design’s name “up-and-down,” or (in sensory studies and materials testing) the <italic>“Staircase Method”</italic> [<xref ref-type="bibr" rid="j_nejsds74_ref_063">63</xref>].</p>
</list-item>
<list-item id="j_nejsds74_li_009">
<label>5.</label>
<p><bold>Dose-transition rules</bold> are based on the treatments and responses of the most recent observations – up to <italic>k</italic> of them (with <inline-formula id="j_nejsds74_ineq_009"><alternatives><mml:math>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo stretchy="false">≥</mml:mo>
<mml:mn>1</mml:mn></mml:math><tex-math><![CDATA[$k\ge 1$]]></tex-math></alternatives></inline-formula> constant), and possibly also on a few additional fixed design parameters. The rules involve no estimation.</p>
</list-item>
<list-item id="j_nejsds74_li_010">
<label>6.</label>
<p>UDDs have no intrinsic stopping rules, although such rules can be constructed optionally.</p>
</list-item>
</list>
<p>Using this terminology, a dose-finding experiment’s typical goal is estimating the <bold>target percentile</bold> (also known as the “target dose” or simply “the target”) <inline-formula id="j_nejsds74_ineq_010"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${F^{-1}}(\Gamma ),\Gamma \in (0,1)$]]></tex-math></alternatives></inline-formula>.</p>
<p>Elements 1–3 in the list above are common to dose-finding designs in many fields, and define the dose-finding task’s characteristic constraints. Element 4 has become a widely (though not universally) accepted guideline across most dose-finding designs. The remaining two elements turn a dose-finding design on a grid, into a UDD. With UDDs, <bold>X</bold> is a random walk over <inline-formula id="j_nejsds74_ineq_011"><alternatives><mml:math>
<mml:mi mathvariant="script">X</mml:mi></mml:math><tex-math><![CDATA[$\mathcal{X}$]]></tex-math></alternatives></inline-formula>. It is also a regular random walk, meaning that the distribution of <bold>X</bold> over <inline-formula id="j_nejsds74_ineq_012"><alternatives><mml:math>
<mml:mi mathvariant="script">X</mml:mi></mml:math><tex-math><![CDATA[$\mathcal{X}$]]></tex-math></alternatives></inline-formula> converges to a stationary distribution <inline-formula id="j_nejsds74_ineq_013"><alternatives><mml:math>
<mml:mi mathvariant="bold-italic">π</mml:mi></mml:math><tex-math><![CDATA[$\boldsymbol{\pi }$]]></tex-math></alternatives></inline-formula>.</p>
<p>UDD dose-transition probabilities depend only upon <inline-formula id="j_nejsds74_ineq_014"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(x)$]]></tex-math></alternatives></inline-formula> and the design’s specific rules. If the ‘up’ transition probability decreases with increasing <inline-formula id="j_nejsds74_ineq_015"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(x)$]]></tex-math></alternatives></inline-formula> and vice versa for the ‘down’ probability, then the UDD generates a random walk with a central tendency [<xref ref-type="bibr" rid="j_nejsds74_ref_014">14</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_027">27</xref>], and <inline-formula id="j_nejsds74_ineq_016"><alternatives><mml:math>
<mml:mi mathvariant="bold-italic">π</mml:mi></mml:math><tex-math><![CDATA[$\boldsymbol{\pi }$]]></tex-math></alternatives></inline-formula> is sharply peaked around <inline-formula id="j_nejsds74_ineq_017"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${F^{-1}}(\Gamma )$]]></tex-math></alternatives></inline-formula> – or more precisely, around the <bold>UDD balance point</bold> <inline-formula id="j_nejsds74_ineq_018"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo stretchy="false">≡</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${x^{\ast }}\equiv {F^{-1}}({p^{\ast }})$]]></tex-math></alternatives></inline-formula> [<xref ref-type="bibr" rid="j_nejsds74_ref_050">50</xref>]. The balance point can be determined from the specific UDD chosen by solving the equation 
<disp-formula id="j_nejsds74_eq_001">
<label>(2.1)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:mo movablelimits="false">Pr</mml:mo>
<mml:mfenced separators="" open="(" close=")">
<mml:mrow>
<mml:mi mathvariant="italic">u</mml:mi>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo stretchy="false">∣</mml:mo>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:mfenced>
<mml:mo>=</mml:mo>
<mml:mo movablelimits="false">Pr</mml:mo>
<mml:mfenced separators="" open="(" close=")">
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mi mathvariant="italic">o</mml:mi>
<mml:mi mathvariant="italic">w</mml:mi>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo stretchy="false">∣</mml:mo>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:mfenced>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ \Pr \left(up\mid F(x)={p^{\ast }}\right)=\Pr \left(down\mid F(x)={p^{\ast }}\right).\]]]></tex-math></alternatives>
</disp-formula>
</p>
<p>Specifically, the dual monotonicity conditions on the dose-transition probabilities guarantee that <inline-formula id="j_nejsds74_ineq_019"><alternatives><mml:math>
<mml:mi mathvariant="bold-italic">π</mml:mi></mml:math><tex-math><![CDATA[$\boldsymbol{\pi }$]]></tex-math></alternatives></inline-formula>’s mode is at one of the two dose levels straddling <inline-formula id="j_nejsds74_ineq_020"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${x^{\ast }}$]]></tex-math></alternatives></inline-formula>. The conditions are known as <italic>the Durham-Flournoy conditions</italic> after the researchers who first spelled them out [<xref ref-type="bibr" rid="j_nejsds74_ref_014">14</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_012">12</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_050">50</xref>]. Without meeting these conditions, a design might still be considered a UDD – that is perhaps a matter of semantics – but it is unlikely to work well as a <italic>dose-finding</italic> UDD.</p>
<p>The balance-point equation (<xref rid="j_nejsds74_eq_001">2.1</xref>) holds for all UDD variants described in Section <xref rid="j_nejsds74_s_004">2.2</xref>; one only needs to plug the correct transition probabilities into the formula. In general, design parameters should be chosen so that <inline-formula id="j_nejsds74_ineq_021"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo stretchy="false">≈</mml:mo>
<mml:mi mathvariant="normal">Γ</mml:mi></mml:math><tex-math><![CDATA[${p^{\ast }}\approx \Gamma $]]></tex-math></alternatives></inline-formula>.</p>
<p>The convergence of <bold>X</bold> towards stationary behavior happens at a very rapid, geometric rate, meaning that within a few dozen observations and usually sooner, a contiguous “slice” of <bold>X</bold> will be essentially equivalent to a sample out of <inline-formula id="j_nejsds74_ineq_022"><alternatives><mml:math>
<mml:mi mathvariant="bold-italic">π</mml:mi></mml:math><tex-math><![CDATA[$\boldsymbol{\pi }$]]></tex-math></alternatives></inline-formula> [<xref ref-type="bibr" rid="j_nejsds74_ref_009">9</xref>].</p>
<p>Before we continue, a few words about robustness, a term mentioned frequently in this article. Dose-finding is a small-to-moderate sample affair; in most fields <italic>n</italic> is rarely over 50, and in many fields it is usually <inline-formula id="j_nejsds74_ineq_023"><alternatives><mml:math>
<mml:mo stretchy="false">≤</mml:mo>
<mml:mn>25</mml:mn></mml:math><tex-math><![CDATA[$\le 25$]]></tex-math></alternatives></inline-formula> [<xref ref-type="bibr" rid="j_nejsds74_ref_028">28</xref>]. Each of these observations is binary, so the experiment provides a few dozen bits of information at best. Thus, the overall signal-to-noise ratio cannot be very high, particularly when observations are obtained from live subjects rather than, say, industrially-produced units. Even under idealized simulated conditions in which all response thresholds are drawn from a single well-defined <inline-formula id="j_nejsds74_ineq_024"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(x)$]]></tex-math></alternatives></inline-formula> and there are no experimental mishaps, challenging situations are common. For example, the target <inline-formula id="j_nejsds74_ineq_025"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${F^{-1}}(\Gamma )$]]></tex-math></alternatives></inline-formula> might be situated relatively far from the starting dose <inline-formula id="j_nejsds74_ineq_026"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${x_{1}}$]]></tex-math></alternatives></inline-formula>, or, very commonly, different parts of the experiments might encounter “streaks” of relatively high or low response thresholds compared with the population average, so that experimental behavior seems erratic and the target percentile might not be clearly discernible from the data.</p>
<p>In this context, a design or estimator being robust means that its dose-allocation behavior and dose-finding performance show little degradation under such more challenging situations. Conversely, some design approaches are intrinsically oriented towards capitalizing upon well-behaved conditions, but falter under moderate deviations from such conditions. In the terminology we use here, this indicates lack of robustness.</p>
</sec>
<sec id="j_nejsds74_s_004">
<label>2.2</label>
<title>Popular Types of UDDs</title>
<p>The original UDD has the simplest of rules: escalate when <inline-formula id="j_nejsds74_ineq_027"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[${Y_{i}}=0$]]></tex-math></alternatives></inline-formula> and vice versa. Therefore, <inline-formula id="j_nejsds74_ineq_028"><alternatives><mml:math>
<mml:mo movablelimits="false">Pr</mml:mo>
<mml:mfenced separators="" open="(" close=")">
<mml:mrow>
<mml:mi mathvariant="italic">u</mml:mi>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\Pr \left(up\right)=1-F(x)$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds74_ineq_029"><alternatives><mml:math>
<mml:mo movablelimits="false">Pr</mml:mo>
<mml:mfenced separators="" open="(" close=")">
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
<mml:mi mathvariant="italic">o</mml:mi>
<mml:mi mathvariant="italic">w</mml:mi>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\Pr \left(down\right)=F(x)$]]></tex-math></alternatives></inline-formula>. Whether by plugging this into (<xref rid="j_nejsds74_eq_001">2.1</xref>) or simply by symmetry, evidently <inline-formula id="j_nejsds74_ineq_030"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:mn>0.5</mml:mn></mml:math><tex-math><![CDATA[${p^{\ast }}=0.5$]]></tex-math></alternatives></inline-formula>. To date this is the most commonly and widely used UDD. Below we list three popular straightforward extensions that enable targeting other percentiles, while remaining only once removed from the original UDD and meeting all six criteria listed in the UDD definition above, as well as the Durham-Flournoy conditions.</p>
<p>A simple extension that can target any percentile is known as Biased-Coin UDD [<xref ref-type="bibr" rid="j_nejsds74_ref_012">12</xref>]. For <inline-formula id="j_nejsds74_ineq_031"><alternatives><mml:math>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:mn>0.5</mml:mn></mml:math><tex-math><![CDATA[$\Gamma \lt 0.5$]]></tex-math></alternatives></inline-formula>, following <inline-formula id="j_nejsds74_ineq_032"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[${Y_{i}}=0$]]></tex-math></alternatives></inline-formula> one draws a random number to determine whether to escalate or repeat the same dose. In contrast, <inline-formula id="j_nejsds74_ineq_033"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:math><tex-math><![CDATA[${Y_{i}}=1$]]></tex-math></alternatives></inline-formula> mandates a de-escalation. Setting the random (<italic>“biased coin”</italic>) escalation probability to <inline-formula id="j_nejsds74_ineq_034"><alternatives><mml:math>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo mathvariant="normal" stretchy="false">/</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\Gamma /(1-\Gamma )$]]></tex-math></alternatives></inline-formula> ensures that <inline-formula id="j_nejsds74_ineq_035"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="normal">Γ</mml:mi></mml:math><tex-math><![CDATA[${p^{\ast }}=\Gamma $]]></tex-math></alternatives></inline-formula> exactly. For <inline-formula id="j_nejsds74_ineq_036"><alternatives><mml:math>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mn>0.5</mml:mn></mml:math><tex-math><![CDATA[$\Gamma \gt 0.5$]]></tex-math></alternatives></inline-formula> the roles of <inline-formula id="j_nejsds74_ineq_037"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${Y_{i}}$]]></tex-math></alternatives></inline-formula> are reversed, and the biased-coin probability is inverted. The <monospace>bcoin</monospace> utility function in the R package <monospace>upndown</monospace> provides the required coin probability to achieve a given Γ. The utility also returns a verbal description of transition rules, to clarify how the result is to be used:</p><preformat><monospace>&gt; bcoin(0.3)</monospace>
<monospace>After positive response, move DOWN.</monospace>
<monospace>After negative response, ‘toss a COIN’:</monospace>
<monospace>   - with probability of approximately 0.43 move UP</monospace>
<monospace>   - Otherwise REPEAT same dose.</monospace>
</preformat>
<p>Another simple UDD extension replaces the random draw with a requirement for a run of <italic>k</italic> contiguous negative (positive) responses at the same dose level before escalation (de-escalation), to target <inline-formula id="j_nejsds74_ineq_038"><alternatives><mml:math>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo stretchy="false">≤</mml:mo>
<mml:mn>0.5</mml:mn></mml:math><tex-math><![CDATA[$\Gamma \le 0.5$]]></tex-math></alternatives></inline-formula> (<inline-formula id="j_nejsds74_ineq_039"><alternatives><mml:math>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo stretchy="false">≥</mml:mo>
<mml:mn>0.5</mml:mn></mml:math><tex-math><![CDATA[$\Gamma \ge 0.5$]]></tex-math></alternatives></inline-formula>). This UDD is extremely popular in sensory studies, to which it was introduced in the 1960s by its developer G.B. Wetherill [<xref ref-type="bibr" rid="j_nejsds74_ref_067">67</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_068">68</xref>]. It has been known by various names; we prefer the rather straightforward name “<italic>k</italic>-in-a-row UDD” [<xref ref-type="bibr" rid="j_nejsds74_ref_032">32</xref>]. Dose allocation behavior can be described either as a <italic>k</italic>-th order random walk, or as a random walk with internal states [<xref ref-type="bibr" rid="j_nejsds74_ref_022">22</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_050">50</xref>]. For <inline-formula id="j_nejsds74_ineq_040"><alternatives><mml:math>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mn>0.5</mml:mn></mml:math><tex-math><![CDATA[$\Gamma \gt 0.5$]]></tex-math></alternatives></inline-formula> (typical of sensory studies), the balance point is <inline-formula id="j_nejsds74_ineq_041"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>.</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>5</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal" stretchy="false">/</mml:mo>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${p^{\ast }}=0.{5^{1/k}}$]]></tex-math></alternatives></inline-formula>, with mirror-image balance points for <inline-formula id="j_nejsds74_ineq_042"><alternatives><mml:math>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:mn>0.5</mml:mn></mml:math><tex-math><![CDATA[$\Gamma \lt 0.5$]]></tex-math></alternatives></inline-formula> (adequate for toxicity studies). Thus, for toxicity studies the <inline-formula id="j_nejsds74_ineq_043"><alternatives><mml:math>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>3</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>4</mml:mn></mml:math><tex-math><![CDATA[$k=2,3,4$]]></tex-math></alternatives></inline-formula> balance points are very close to the 30th, 20th and 15th percentiles, respectively. The <monospace>k2targ</monospace> utility function in <monospace>upndown</monospace> provides the balance point for given <italic>k</italic>. The reverse utility <monospace>ktargOptions</monospace> provides plausible values of <italic>k</italic> given Γ, together with a verbal description of the rules analogous to the <monospace>bcoin</monospace> output shown above.</p>
<p>For both <italic>k</italic>-in-a-row and Biased-Coin UDDs, the non-median balance point is achieved by rendering one transition direction “slow”, while the opposite direction retains the original UDD’s “fast” transitions. Beginning an experiment from the “slow” end (e.g., starting from the lowest dose in toxicity studies) might incur a substantial delay and reduced performance if the true target is not close. A common modification, introduced already in the 1960s [<xref ref-type="bibr" rid="j_nejsds74_ref_067">67</xref>], is to start the experiment with original-UDD rules, until at least one observation of each type is encountered. In toxicity studies, this would mean escalating after every observation until the first toxicity, then reverting to the intended <italic>k</italic>-in-a-row or Biased-Coin rules. Barring extreme exceptions, this modification is highly recommended.</p>
<p>Lastly, the Group UDD (GUD) evaluates cohorts of fixed size <inline-formula id="j_nejsds74_ineq_044"><alternatives><mml:math>
<mml:mi mathvariant="italic">s</mml:mi>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mn>1</mml:mn></mml:math><tex-math><![CDATA[$s\gt 1$]]></tex-math></alternatives></inline-formula> simultaneously, escalating with <italic>l</italic> or fewer positive responses and de-escalating with <italic>u</italic> or more [<xref ref-type="bibr" rid="j_nejsds74_ref_064">64</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_023">23</xref>]. All members of the same cohort receive the same dose. Somewhat similarly to <italic>k</italic>-in-a-row, GUDs can be described either as an <italic>s</italic>-th order random walk, or as first-order with a twist; in this case, moving from binary <italic>Y</italic> to size-<italic>s</italic> Binomial. Balance points can be determined from symmetry when <inline-formula id="j_nejsds74_ineq_045"><alternatives><mml:math>
<mml:mi mathvariant="italic">l</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">u</mml:mi>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">s</mml:mi></mml:math><tex-math><![CDATA[$l+u=s$]]></tex-math></alternatives></inline-formula> (in which case <inline-formula id="j_nejsds74_ineq_046"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:mn>0.5</mml:mn></mml:math><tex-math><![CDATA[${p^{\ast }}=0.5$]]></tex-math></alternatives></inline-formula>), by solving (<xref rid="j_nejsds74_eq_001">2.1</xref>) analytically for some other specific GUD sub-families, and otherwise by solving (<xref rid="j_nejsds74_eq_001">2.1</xref>) numerically from Binomial distribution probabilities. Similarly to the <italic>k</italic>-in-a-row utilities, The <monospace>g2targ</monospace> utility function in <monospace>upndown</monospace> provides the balance point for given <inline-formula id="j_nejsds74_ineq_047"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">s</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">l</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">u</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(s,l,u)$]]></tex-math></alternatives></inline-formula>. The reverse utility <monospace>gtargOptions</monospace> provides plausible <inline-formula id="j_nejsds74_ineq_048"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">s</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">l</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">u</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(s,l,u)$]]></tex-math></alternatives></inline-formula> trios for a given Γ. See the following example:</p><preformat><monospace>&gt; gtargOptions(0.3, maxsize = 5, tolerance = 0.05)</monospace>
<monospace>For each design, if positive responses &lt;= Lower, move up</monospace>
<monospace>                 if positive responses &gt;= Upper, move down</monospace>
<monospace>otherwise repeat same dose</monospace>
<monospace>    (relevant only when Upper - Lower &gt; 1).</monospace>

<monospace>  Cohort Lower Upper BalancePoint</monospace>
<monospace>1      2     0     1    0.2928932</monospace>
<monospace>2      3     0     2    0.3472963</monospace>
<monospace>3      4     0     2    0.2663668</monospace>
<monospace>4      5     0     3    0.3019788</monospace>
<monospace>5      5     1     2    0.3138095</monospace>
</preformat>
<p>GUDs may have inspired the ‘3+3’ escalation design [<xref ref-type="bibr" rid="j_nejsds74_ref_006">6</xref>], which is notorious in the phase I cancer trial design literature for its enduring popularity despite volumes of evidence for its poor dose-finding performance. The transition rules after ‘3+3’s first visit to a new dose-level resemble a GUD<inline-formula id="j_nejsds74_ineq_049"><alternatives><mml:math>
<mml:msub>
<mml:mrow/>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>3</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${_{(3,0,2)}}$]]></tex-math></alternatives></inline-formula>, listed in the second row of the <monospace>gtargOptions</monospace> output above. However, ‘3+3’ stops the experiment before any dose level sees more than 6 observations, and completely disallows re-escalation to a previously visited dose. This prevents any possibility for a target-centered random walk, and therefore denies ‘3+3’ the attendant UDD performance-beneficial properties. To emphasize: despite occasional mis-identification in literature, ‘3+3’ is not a UDD.</p>
<p>When these UDD variants are compared for estimation of the same target percentile, <italic>k</italic>-in-a-row converges somewhat faster to its stationary behavior [<xref ref-type="bibr" rid="j_nejsds74_ref_050">50</xref>]. This translates into an estimation-efficiency advantage, which has however become more nuanced with improvements to UDD estimation methods that have enhanced all variants’ performance [<xref ref-type="bibr" rid="j_nejsds74_ref_049">49</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_017">17</xref>]. <italic>k</italic>-in-a-row’s advantage depends on it having a balance point close enough to the experiment’s designated target (say, within <inline-formula id="j_nejsds74_ineq_050"><alternatives><mml:math>
<mml:mo stretchy="false">∼</mml:mo>
<mml:mn>5</mml:mn></mml:math><tex-math><![CDATA[$\sim 5$]]></tex-math></alternatives></inline-formula> percentage points).</p>
<p>Figure <xref rid="j_nejsds74_fig_001">1</xref> illustrates a UDD allocation distribution and provides insight into the somewhat elusive topic of UDD allocation convergence. The vertical bars show the expected accumulated dose-allocation distribution after <inline-formula id="j_nejsds74_ineq_051"><alternatives><mml:math>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>30</mml:mn></mml:math><tex-math><![CDATA[$n=30$]]></tex-math></alternatives></inline-formula>, under <italic>k</italic>-in-a-row with <inline-formula id="j_nejsds74_ineq_052"><alternatives><mml:math>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn></mml:math><tex-math><![CDATA[$k=2$]]></tex-math></alternatives></inline-formula> and the first <inline-formula id="j_nejsds74_ineq_053"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(x)$]]></tex-math></alternatives></inline-formula> curve in the 500-curve ensemble used in the next section’s simulations. Given knowledge of <inline-formula id="j_nejsds74_ineq_054"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(x)$]]></tex-math></alternatives></inline-formula> (shown in the background as a faint green band) and the starting point (here assumed to be <inline-formula id="j_nejsds74_ineq_055"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${d_{1}}$]]></tex-math></alternatives></inline-formula>), the distribution can be calculated analytically and was derived via the <monospace>upndown</monospace> utility <monospace>cumulvec</monospace>. The stationary or asymptotic random-walk distribution <inline-formula id="j_nejsds74_ineq_056"><alternatives><mml:math>
<mml:mi mathvariant="bold-italic">π</mml:mi></mml:math><tex-math><![CDATA[$\boldsymbol{\pi }$]]></tex-math></alternatives></inline-formula> (connected black dots; calculated via <monospace>pivec</monospace>) is independent of the starting point. The bars’ heights are not too far removed from it, but one can see the low starting point’s effect. The expected marginal distribution of additional doses halfway through the experiment at <inline-formula id="j_nejsds74_ineq_057"><alternatives><mml:math>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>15</mml:mn></mml:math><tex-math><![CDATA[$n=15$]]></tex-math></alternatives></inline-formula> (connected red dots; calculated via <monospace>currentvec</monospace>) is hardly distinguishable from <inline-formula id="j_nejsds74_ineq_058"><alternatives><mml:math>
<mml:mi mathvariant="bold-italic">π</mml:mi></mml:math><tex-math><![CDATA[$\boldsymbol{\pi }$]]></tex-math></alternatives></inline-formula>. Around <inline-formula id="j_nejsds74_ineq_059"><alternatives><mml:math>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>25</mml:mn></mml:math><tex-math><![CDATA[$n=25$]]></tex-math></alternatives></inline-formula>, the distribution of additional doses becomes visually indistinguishable from <inline-formula id="j_nejsds74_ineq_060"><alternatives><mml:math>
<mml:mi mathvariant="bold-italic">π</mml:mi></mml:math><tex-math><![CDATA[$\boldsymbol{\pi }$]]></tex-math></alternatives></inline-formula> at this scale. This demonstrates the meaning of dose-allocation convergence. Note that <inline-formula id="j_nejsds74_ineq_061"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(x)$]]></tex-math></alternatives></inline-formula> is on the same scale as the allocation probabilities: it crosses <inline-formula id="j_nejsds74_ineq_062"><alternatives><mml:math>
<mml:mn>29.3</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$29.3\% $]]></tex-math></alternatives></inline-formula> (the balance point) and <inline-formula id="j_nejsds74_ineq_063"><alternatives><mml:math>
<mml:mn>30</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$30\% $]]></tex-math></alternatives></inline-formula> (the experiment’s official target rate) shortly after <inline-formula id="j_nejsds74_ineq_064"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${d_{4}}$]]></tex-math></alternatives></inline-formula>, where indeed the peaks of all depicted distributions are located.</p>
<fig id="j_nejsds74_fig_001">
<label>Figure 1</label>
<caption>
<p>Illustration of UDD dose-allocation distribution and allocation convergence. Details are described in the text.</p>
</caption>
<graphic xlink:href="nejsds74_g001.jpg"/>
</fig>
<p>Other UDD variants beyond the four described here have been published, some of them extending the possibilities via additional biased coins [e.g., <xref ref-type="bibr" rid="j_nejsds74_ref_008">8</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_012">12</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_018">18</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_024">24</xref>]. Generally speaking, designs with biased coins applied to both the ‘up’ and ‘down’ transitions do not provide additional practical benefit to justify the added complication, and have rarely if ever been implemented in practice. One UDD variant that does enjoy popularity in sensory studies, uses different step sizes for the up and down transitions [<xref ref-type="bibr" rid="j_nejsds74_ref_020">20</xref>]. This innovation “violates” either criterion element 4 (if the ratio between step sizes is rational) or 2 (otherwise), and hence narrowly speaking might not be considered a UDD as it does not generate a random walk on <inline-formula id="j_nejsds74_ineq_065"><alternatives><mml:math>
<mml:mi mathvariant="script">X</mml:mi></mml:math><tex-math><![CDATA[$\mathcal{X}$]]></tex-math></alternatives></inline-formula>. However, it does generate a target-centered Markov chain (either discrete or continuous-state) that shares many properties with “proper” UDDs.</p>
<p>The R package <monospace>upndown</monospace> has additional utilities, such as estimation functions and even a fast-running ensemble simulation framework. We recommend using the package’s development version, available via GitHub at <monospace>"assaforon/upndown"</monospace>.</p>
</sec>
<sec id="j_nejsds74_s_005">
<label>2.3</label>
<title>Estimation</title>
<sec id="j_nejsds74_s_006">
<label>2.3.1</label>
<title>Regression Estimators</title>
<p>The estimator we recommend for UDD is Centered Isotonic Regression (CIR) [<xref ref-type="bibr" rid="j_nejsds74_ref_049">49</xref>]. Using regression for dose-finding begins by calculating the observed dose-specific response rates, <inline-formula id="j_nejsds74_ineq_066"><alternatives><mml:math>
<mml:mi mathvariant="bold">R</mml:mi>
<mml:mo>=</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">M</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\mathbf{R}=({R_{1}},\dots ,{R_{M}})$]]></tex-math></alternatives></inline-formula>: 
<disp-formula id="j_nejsds74_eq_002">
<label>(2.2)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">≡</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">T</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:mspace width="2.5pt"/>
<mml:mi mathvariant="italic">m</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:mo>…</mml:mo>
<mml:mi mathvariant="italic">M</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ {R_{m}}\equiv \frac{{T_{m}}}{{N_{m}}},\hspace{2.5pt}\hspace{2.5pt}m=1,\hspace{2.5pt}\dots M,\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_nejsds74_ineq_067"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${N_{m}}$]]></tex-math></alternatives></inline-formula> is the sample size at <inline-formula id="j_nejsds74_ineq_068"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${d_{m}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds74_ineq_069"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">T</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${T_{m}}$]]></tex-math></alternatives></inline-formula> is the number of responses (e.g., toxicities) observed among them. The rates <bold>R</bold> (shown as ‘x’ marks in Figure <xref rid="j_nejsds74_fig_002">2</xref>) are used to estimate the dose-response curve <inline-formula id="j_nejsds74_ineq_070"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(x)$]]></tex-math></alternatives></inline-formula>, ultimately “reading” <inline-formula id="j_nejsds74_ineq_071"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${F^{-1}}(\Gamma )$]]></tex-math></alternatives></inline-formula> off of the regression curve. See for example in Figure <xref rid="j_nejsds74_fig_002">2</xref>, how CIR’s 90th percentile estimate (purple dot) is the value of <italic>x</italic> where the CIR curve (blue) crosses <inline-formula id="j_nejsds74_ineq_072"><alternatives><mml:math>
<mml:mi mathvariant="italic">y</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>90</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$y=90\% $]]></tex-math></alternatives></inline-formula>.</p>
<p>Isotonic regression methods are a good match for UDDs, as both are non-parametric and both tend to demonstrate robustness to experimental mishaps and to variations in the dose-response relationship. We prefer CIR specifically, because it offers a considerable performance improvement over straightforward interpolation of ordinary isotonic regression, an estimator introduced to UDD by Stylianou and Flournoy [<xref ref-type="bibr" rid="j_nejsds74_ref_060">60</xref>]. CIR produces more realistic dose-response curves by avoiding the characteristic flat stretches produced by ordinary isotonic regression. Figure <xref rid="j_nejsds74_fig_002">2</xref> illustrates CIR and isotonic regression, with data from an anesthesiology experiment that targeted the 90th percentile using biased-coin UDD [<xref ref-type="bibr" rid="j_nejsds74_ref_021">21</xref>].</p>
<fig id="j_nejsds74_fig_002">
<label>Figure 2</label>
<caption>
<p>Example of isotonic regression and CIR using data from an anesthesiology UDD experiment with <inline-formula id="j_nejsds74_ineq_073"><alternatives><mml:math>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>45</mml:mn></mml:math><tex-math><![CDATA[$n=45$]]></tex-math></alternatives></inline-formula> that targeted the 90th percentile [<xref ref-type="bibr" rid="j_nejsds74_ref_021">21</xref>]. Isotonic regression as adapted by Stilyanou and Flournoy (black dashes) interpolates between the observed response rates <bold>R</bold> (‘x’ marks; size proportional to <inline-formula id="j_nejsds74_ineq_074"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${n_{m}}$]]></tex-math></alternatives></inline-formula>), replacing regions of decrease with flat stretches. CIR (solid blue) collapses the flat stretches to single points, ensuring strict monotonicity. CIR also incorporates the bias-mitigation formula (<xref rid="j_nejsds74_eq_003">2.3</xref>). The CIR target estimate and 90% confidence interval are shown in purple. The figure was generated via the <monospace>upndown</monospace> package utility <monospace>drplot</monospace>.</p>
</caption>
<graphic xlink:href="nejsds74_g002.jpg"/>
</fig>
<p>CIR also includes an accompanying confidence interval with adequate coverage, beginning with an interval for <inline-formula id="j_nejsds74_ineq_075"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(x)$]]></tex-math></alternatives></inline-formula> based on an analytical formula for ordered binary data by Morris [<xref ref-type="bibr" rid="j_nejsds74_ref_040">40</xref>], then using a localized delta-method-like inversion to obtain an interval for <inline-formula id="j_nejsds74_ineq_076"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${F^{-1}}(\Gamma )$]]></tex-math></alternatives></inline-formula> [<xref ref-type="bibr" rid="j_nejsds74_ref_049">49</xref>]. It should be noted that isotonic regression has historically lacked an adequate small-sample interval. Therefore, CIR available via the R package <monospace>cir</monospace>, offers solutions relevant for dose-response and dose-finding applications far beyond UDD alone. Our confidence-interval method is compatible with both CIR and ordinary isotonic regression. The convenience function <monospace>udest</monospace> in <monospace>upndown</monospace> offers a CIR target estimate pre-configured for UDD datasets.</p>
<p>One might wonder why we do not recommend parametric regression, e.g., Logistic or Probit. Such methods can be found in some UDD experimental reports, but we generally advise against them because of poorly-characterized performance under model mis-specification, and the considerable chance for non-existent estimates [<xref ref-type="bibr" rid="j_nejsds74_ref_058">58</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_019">19</xref>]. If the latter problem is circumvented via use of Bayesian priors, performance might depend too strongly upon them.</p>
<p>A note of caution regarding regression estimators: in dose-finding it is customary to assume that the observed rates <bold>R</bold> are equivalent to Binomial random variables, and therefore constitute unbiased estimates of <italic>F</italic> on <inline-formula id="j_nejsds74_ineq_077"><alternatives><mml:math>
<mml:mi mathvariant="script">X</mml:mi></mml:math><tex-math><![CDATA[$\mathcal{X}$]]></tex-math></alternatives></inline-formula>. The assumption is wrong, not only for UDDs but for all adaptive dose-finding designs, because of the dependence between numerator and denominator in (<xref rid="j_nejsds74_eq_002">2.2</xref>) [<xref ref-type="bibr" rid="j_nejsds74_ref_026">26</xref>]. We recently described the typical form this bias takes in dose-finding. In the target’s vicinity the bias is nearly zero, and therefore it has little affect upon designs’ dose-finding performance. Away from target, the bias “flares out” in both directions, making observed rates seem more extreme than the underlying values of <italic>F</italic> and therefore producing exaggerated slopes [<xref ref-type="bibr" rid="j_nejsds74_ref_017">17</xref>]. The bias tends to be stronger for non-UDD designs such as the Continual Reassessment Method (CRM) [<xref ref-type="bibr" rid="j_nejsds74_ref_046">46</xref>], because the dependence there is stronger.</p>
<p>Inspired by Firth [<xref ref-type="bibr" rid="j_nejsds74_ref_015">15</xref>] and informed by the shape of the bias, we developed a simple ad-hoc bias mitigation formula that shrinks <bold>R</bold> towards Γ: 
<disp-formula id="j_nejsds74_eq_003">
<label>(2.3)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">R</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">˜</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">T</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="normal">Γ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo>=</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="normal">Γ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">m</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ {\widetilde{R}_{m}}=\frac{{T_{m}}+\Gamma }{{N_{m}}+1}=\frac{{N_{m}}{R_{m}}+\Gamma }{{N_{m}}+1}.\]]]></tex-math></alternatives>
</disp-formula>
</p>
<p>When <inline-formula id="j_nejsds74_ineq_078"><alternatives><mml:math>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.5</mml:mn></mml:math><tex-math><![CDATA[$\Gamma =0.5$]]></tex-math></alternatives></inline-formula>, this formula is identical to the commonly used correction for calculating the empirical logit in the presence of zero cell counts [<xref ref-type="bibr" rid="j_nejsds74_ref_069">69</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_001">1</xref>]. The bias mitigation is an option in <monospace>cir</monospace>, and implemented as default in <monospace>upndown::udest</monospace>. It tends to improve CIR interval coverage for the target-dose estimate, via making the slope of <italic>F</italic> less exaggerated. The CIR curve in Figure <xref rid="j_nejsds74_fig_002">2</xref> incorporates the bias-mitigation formula.</p>
<p>Due to the bias, we generally advise against off-target estimates with adaptive dose-finding designs, e.g., estimating the 95th percentile using data from a median-targeting UDD. Note that many safety dose-exclusion rules implemented in other dose-finding designs rely upon such off-target estimates.</p>
</sec>
<sec id="j_nejsds74_s_007">
<label>2.3.2</label>
<title>Dose-Averaging Estimators</title>
<p>Historically, dose-averaging estimators appeared before regression estimators, and are still very popular, particularly in non-medical fields. These are averages of a subset of the sequence of allocated doses, <bold>X</bold>. The rapid convergence of <bold>X</bold> to stationary behavior and <inline-formula id="j_nejsds74_ineq_079"><alternatives><mml:math>
<mml:mi mathvariant="bold-italic">π</mml:mi></mml:math><tex-math><![CDATA[$\boldsymbol{\pi }$]]></tex-math></alternatives></inline-formula>’s relative symmetry provide the basic rationale for dose-averaging estimators. A deeper justification is that nearly all the experiment’s information is encoded in <bold>X</bold> via the dose-transition rules. One can even add a “phantom” <inline-formula id="j_nejsds74_ineq_080"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${X_{n+1}}$]]></tex-math></alternatives></inline-formula> to the average, because when the experiment ends the next treatment allocation can be pre-determined without need to observe <inline-formula id="j_nejsds74_ineq_081"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">Y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${Y_{n+1}}$]]></tex-math></alternatives></inline-formula> [<xref ref-type="bibr" rid="j_nejsds74_ref_005">5</xref>]. Both the original Dixon-Mood UDD estimator, and the estimator developed by Wetherill and Leavitt upon UDDs’ introduction to sensory studies, are dose-averaging estimators [<xref ref-type="bibr" rid="j_nejsds74_ref_010">10</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_068">68</xref>]. The latter is likely still the single most popular UDD estimation approach when all fields are considered. It averages only the subset of doses at points where <bold>X</bold>’s trajectory changes direction from ‘up’ to ‘down’ or vice versa.</p>
<p>The simplicity of averaging and the relatively low variance of using an average for estimation in general are appealing, but we have found that dose-averaging approaches tend to lack robustness. A plethora of biases counter-balances the low-variance advantage; some are very difficult or impossible to mitigate [<xref ref-type="bibr" rid="j_nejsds74_ref_052">52</xref>]. For example, a starting-point bias may be observed if the target is far from the starting dose, and a boundary bias takes place when the target is near the edge of <inline-formula id="j_nejsds74_ineq_082"><alternatives><mml:math>
<mml:mi mathvariant="script">X</mml:mi></mml:math><tex-math><![CDATA[$\mathcal{X}$]]></tex-math></alternatives></inline-formula>.</p>
<p>In addition, none of the dose-averaging confidence intervals in current use offers sufficient and robust coverage, mostly because all require a standard-error estimate, and those are hard to obtain reliably when the data are so discrete. Our <monospace>upndown</monospace> package offers a bootstrap-based interval that comes close to passable coverage for some dose-averaging estimators, but generally still falls short by at least <inline-formula id="j_nejsds74_ineq_083"><alternatives><mml:math>
<mml:mo stretchy="false">∼</mml:mo>
<mml:mn>5</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$\sim 5\% $]]></tex-math></alternatives></inline-formula>.</p>
</sec>
</sec>
</sec>
<sec id="j_nejsds74_s_008">
<label>3</label>
<title>Up-and-Down – and Other Approaches</title>
<sec id="j_nejsds74_s_009">
<label>3.1</label>
<title>Background</title>
<p>Because dose-finding is a generic challenge that resurfaces in many contexts, a variety of approaches have been developed to address it. For comparison with UDD, we focus on the most prominent family of approaches in recent literature, one that utilizes repeated estimation. The use of estimation to guide the next treatment’s placement can be traced back at least to the 1950s, nearly as old as UDDs [<xref ref-type="bibr" rid="j_nejsds74_ref_056">56</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_041">41</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_035">35</xref>]. More recently, in the context of dose-finding on a discrete grid <inline-formula id="j_nejsds74_ineq_084"><alternatives><mml:math>
<mml:mi mathvariant="script">X</mml:mi></mml:math><tex-math><![CDATA[$\mathcal{X}$]]></tex-math></alternatives></inline-formula>, most estimation-based approaches have coalesced around the following outline:</p>
<list>
<list-item id="j_nejsds74_li_011">
<label>•</label>
<p>After each observation, estimate <inline-formula id="j_nejsds74_ineq_085"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(x)$]]></tex-math></alternatives></inline-formula> – either the entire curve or the value at the current dose-level;</p>
</list-item>
<list-item id="j_nejsds74_li_012">
<label>•</label>
<p>Place the next treatment at the dose-level deemed “closest to target”, according to these estimates and the design’s specific optimization criterion.</p>
</list-item>
</list>
<p>The optimization criterion could be, e.g., the dose-level with the smallest <inline-formula id="j_nejsds74_ineq_086"><alternatives><mml:math>
<mml:mo stretchy="false">|</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo stretchy="false">|</mml:mo></mml:math><tex-math><![CDATA[$|\hat{F}-\Gamma |$]]></tex-math></alternatives></inline-formula>, or – in the case of so-called “interval designs”, simply that the current dose-level’s estimate of <italic>F</italic> is within some tolerance interval around Γ. When the former criterion is used, the target estimation method at the experiment’s end is usually identical to the dose-allocation method during the experiment.</p>
<p>The first dose-finding design we are aware of to follow this outline explicitly, was a parametric Bayesian design for sensory studies published in 1983 under the acronym QUEST [<xref ref-type="bibr" rid="j_nejsds74_ref_066">66</xref>]. While QUEST has gained considerable traction in its own field, it was a 1990 publication of another parametric Bayesian design that has caught mainstream statistics’ attention: the aforementioned CRM [<xref ref-type="bibr" rid="j_nejsds74_ref_046">46</xref>].<xref ref-type="fn" rid="j_nejsds74_fn_003">3</xref><fn id="j_nejsds74_fn_003"><label><sup>3</sup></label>
<p>Wu also presented such an approach independently in 1985, but it seems that the reach of his work has remained confined mostly to methodological discussions of stochastic approximation and related designs [<xref ref-type="bibr" rid="j_nejsds74_ref_070">70</xref>].</p></fn> Catering to phase I cancer trials, which is the dose-finding application receiving the most method-development resources nowadays, CRM was soon followed in that field by methods bearing acronyms such as EWOC [<xref ref-type="bibr" rid="j_nejsds74_ref_003">3</xref>], or more recently interval designs such as CCD [<xref ref-type="bibr" rid="j_nejsds74_ref_031">31</xref>], mTPI [<xref ref-type="bibr" rid="j_nejsds74_ref_033">33</xref>], and BOIN [<xref ref-type="bibr" rid="j_nejsds74_ref_038">38</xref>]. This is a very partial list.</p>
<p>Such designs have often been named “long-memory” because they incorporate information going back to <inline-formula id="j_nejsds74_ineq_087"><alternatives><mml:math>
<mml:mfenced separators="" open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">y</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:math><tex-math><![CDATA[$\left({x_{1}},{y_{1}}\right)$]]></tex-math></alternatives></inline-formula>, but there are other approaches with long memory that do not follow the outline as described above. Here we suggest the provisional name <bold>Aim-for-Target</bold> designs, which seems more specific and descriptive. Plausibly, one can also describe them as greedy algorithms [<xref ref-type="bibr" rid="j_nejsds74_ref_007">7</xref>, Ch. 15]. Our impression is that Aim-for-Target designs have taken up nearly all the oxygen in the statistical dose-finding-literature room, with attempts to dethrone or modify ‘3+3’ in the phase I realm accounting for most of the balance. The practical needs of other fields that use dose-finding have been largely ignored in this recent body of methodological literature. UDDs are mentioned in passing, if at all, and often in a misguided manner.</p>
<p>Oron and Hoff demonstrated a decade ago that Aim-for-Target designs suffer from a disturbing, structural lack of robustness which, even more disturbingly, has gone almost completely under the radar of all this novel methodological activity [<xref ref-type="bibr" rid="j_nejsds74_ref_048">48</xref>]. In a nutshell, Aim-for-Target designs tend to lock onto a perceived optimum early in the trial. In case this “early bet” misses the true optimum, these designs take very long to self-correct, because their self-correction mechanism operates at a root-<italic>n</italic> rate, with new information accumulating rather slowly since it consists of dependent binary data.</p>
<p>In Oron and Hoff’s work, UDDs were shown to attain similar dose-finding performance overall, and to have far better robustness, than Aim-for-Target. Since the evidence presented there has not become common knowledge, and since some time has passed with new designs and new developments, we have thought it appropriate to revisit the comparison with new, broader simulations.</p>
</sec>
<sec id="j_nejsds74_s_010">
<label>3.2</label>
<title>Comparative Performance Simulations</title>
<sec id="j_nejsds74_s_011" sec-type="methods">
<label>3.2.1</label>
<title>Methods</title>
<p>We present here results for designs targeting the 30th percentile, a common phase I cancer target, and the 90th percentile, popular in anesthesiology. We refer to the <inline-formula id="j_nejsds74_ineq_088"><alternatives><mml:math>
<mml:mi mathvariant="italic">Y</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:math><tex-math><![CDATA[$Y=1$]]></tex-math></alternatives></inline-formula> outcome in the former case as dose-limiting toxicity (DLT), and in the latter as efficacy, even though both are simulated via very similar computer code. More simulations details are provided below. <disp-quote>
<p>We generated parametric random <italic>F</italic> curves using a 3-parameter Weibull (shape, scale, lateral shift). In order to enable separate looks into the effect of curve properties (slope, shape, etc.) and of the relationship between starting dose and target location, for each target a single ensemble of <inline-formula id="j_nejsds74_ineq_089"><alternatives><mml:math>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>500</mml:mn></mml:math><tex-math><![CDATA[$B=500$]]></tex-math></alternatives></inline-formula> was generated, with each curve having different Weibull parameter values but with all curves crossing target near the middle of <inline-formula id="j_nejsds74_ineq_090"><alternatives><mml:math>
<mml:mi mathvariant="script">X</mml:mi></mml:math><tex-math><![CDATA[$\mathcal{X}$]]></tex-math></alternatives></inline-formula>. Extremely steep or extremely shallow curves were excluded as being less “interesting” for the dose-finding task.</p>
<p>Then, the simulation setting was varied by shifting the entire ensemble right or left, or by changing the starting dose. This approach follows in the footsteps of earlier randomized-<italic>F</italic> simulations [<xref ref-type="bibr" rid="j_nejsds74_ref_051">51</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_048">48</xref>]. In its specific details it is quite similar, but somewhat more sophisticated, than the curve ensembles shown in the supplement of reference [<xref ref-type="bibr" rid="j_nejsds74_ref_052">52</xref>].</p>
<p>For the 30th percentile we used <inline-formula id="j_nejsds74_ineq_091"><alternatives><mml:math>
<mml:mi mathvariant="italic">M</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>8</mml:mn></mml:math><tex-math><![CDATA[$M=8$]]></tex-math></alternatives></inline-formula> dose levels and <inline-formula id="j_nejsds74_ineq_092"><alternatives><mml:math>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>30</mml:mn></mml:math><tex-math><![CDATA[$n=30$]]></tex-math></alternatives></inline-formula> observations, and 4 different settings using the same 500-curve ensemble. In 3 settings the starting dose was <inline-formula id="j_nejsds74_ineq_093"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${d_{1}}$]]></tex-math></alternatives></inline-formula> as is common in toxicity studies, and the target location was in the middle [<inline-formula id="j_nejsds74_ineq_094"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mfenced separators="" open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>5</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:math><tex-math><![CDATA[${x^{\ast }}\in \left({d_{4}},{d_{5}}\right)$]]></tex-math></alternatives></inline-formula>], low [<inline-formula id="j_nejsds74_ineq_095"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mfenced separators="" open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>3</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:math><tex-math><![CDATA[${x^{\ast }}\in \left({d_{2}},{d_{3}}\right)$]]></tex-math></alternatives></inline-formula>] or high [<inline-formula id="j_nejsds74_ineq_096"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mfenced separators="" open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>6</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>7</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:math><tex-math><![CDATA[${x^{\ast }}\in \left({d_{6}},{d_{7}}\right)$]]></tex-math></alternatives></inline-formula>]. The fourth setting had the target in <inline-formula id="j_nejsds74_ineq_097"><alternatives><mml:math>
<mml:mfenced separators="" open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>5</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:math><tex-math><![CDATA[$\left({d_{4}},{d_{5}}\right)$]]></tex-math></alternatives></inline-formula> and started at <inline-formula id="j_nejsds74_ineq_098"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${d_{4}}$]]></tex-math></alternatives></inline-formula>. The 90th percentile simulations were fairly similar, except for using <inline-formula id="j_nejsds74_ineq_099"><alternatives><mml:math>
<mml:mi mathvariant="italic">M</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>12</mml:mn></mml:math><tex-math><![CDATA[$M=12$]]></tex-math></alternatives></inline-formula> dose levels and <inline-formula id="j_nejsds74_ineq_100"><alternatives><mml:math>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>50</mml:mn></mml:math><tex-math><![CDATA[$n=50$]]></tex-math></alternatives></inline-formula>, and with 3 of the 4 settings varying the starting point (high, middle, low) rather than the target location. We kept one particularly “hard” setting, the one starting at <inline-formula id="j_nejsds74_ineq_101"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${d_{1}}$]]></tex-math></alternatives></inline-formula> and having a high target. The 30th percentile simulation had one set of comparisons with single-patient dose allocation decisions, and one set with cohorts of 3, a cohort size used very commonly in phase I cancer trials. The 90th percentile simulation only had a single-patient set.</p>
<p>For the 30th percentile cohort-allocation simulations, we used GUD<inline-formula id="j_nejsds74_ineq_102"><alternatives><mml:math>
<mml:msub>
<mml:mrow/>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>3</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${_{(3,0,2)}}$]]></tex-math></alternatives></inline-formula> (<inline-formula id="j_nejsds74_ineq_103"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo stretchy="false">≈</mml:mo>
<mml:mn>0.35</mml:mn></mml:math><tex-math><![CDATA[${p^{\ast }}\approx 0.35$]]></tex-math></alternatives></inline-formula>): escalate after zero-toxicity cohorts, repeat the same dose with one toxicity, and de-escalate otherwise. For the single-patient allocation simulations, we used the <italic>k</italic>-in-a-row UDD with <inline-formula id="j_nejsds74_ineq_104"><alternatives><mml:math>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn></mml:math><tex-math><![CDATA[$k=2$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds74_ineq_105"><alternatives><mml:math>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>6</mml:mn></mml:math><tex-math><![CDATA[$k=6$]]></tex-math></alternatives></inline-formula> for the 30th (below-median rules, <inline-formula id="j_nejsds74_ineq_106"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo stretchy="false">≈</mml:mo>
<mml:mn>0.29</mml:mn></mml:math><tex-math><![CDATA[${p^{\ast }}\approx 0.29$]]></tex-math></alternatives></inline-formula>) and 90th (above-median rules, <inline-formula id="j_nejsds74_ineq_107"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo stretchy="false">≈</mml:mo>
<mml:mn>0.89</mml:mn></mml:math><tex-math><![CDATA[${p^{\ast }}\approx 0.89$]]></tex-math></alternatives></inline-formula>) percentiles, respectively. For <italic>k</italic>-in-a-row we used the quick start-up modification described in Section <xref rid="j_nejsds74_s_004">2.2</xref>. For UDD estimation, CIR was used including the bias-mitigation formula (<xref rid="j_nejsds74_eq_003">2.3</xref>).</p>
<p>As to Aim-for-Target designs, we used three CRM variants for each target, all generated via the <monospace>getprior</monospace> function in the <monospace>dfcrm</monospace> package. This function provides a “skeleton” of <italic>F</italic> values on <inline-formula id="j_nejsds74_ineq_108"><alternatives><mml:math>
<mml:mi mathvariant="script">X</mml:mi></mml:math><tex-math><![CDATA[$\mathcal{X}$]]></tex-math></alternatives></inline-formula>, determined by the user’s choice of an “indifference interval” around target, and by the prior-predictive mode location of the target dose [<xref ref-type="bibr" rid="j_nejsds74_ref_036">36</xref>, <xref ref-type="bibr" rid="j_nejsds74_ref_037">37</xref>]. For the 30th percentile we used an “indifference interval” half-width of 0.05, except for one variant with 0.1 half-width. For the 90th percentile, we found by trial-and-error that these intervals needed to be half as wide. Prior distributions of the estimated parameter were kept at defaults. The narrower-interval variants varied by prior-predictive mode location (high vs. low), while the wider-interval variant had its prior mode near the middle of <inline-formula id="j_nejsds74_ineq_109"><alternatives><mml:math>
<mml:mi mathvariant="script">X</mml:mi></mml:math><tex-math><![CDATA[$\mathcal{X}$]]></tex-math></alternatives></inline-formula>. CRM dose transitions were limited to a single dose-level upwards or downwards.</p>
<p>We also used two interval designs: the Cumulative Cohort design (CCD) [<xref ref-type="bibr" rid="j_nejsds74_ref_031">31</xref>] and the Bayesian Optimal Interval design (BOIN) [<xref ref-type="bibr" rid="j_nejsds74_ref_038">38</xref>]. For the 30th percentile, CCD was used with an interval width of <inline-formula id="j_nejsds74_ineq_110"><alternatives><mml:math>
<mml:mo>±</mml:mo>
<mml:mn>0.1</mml:mn></mml:math><tex-math><![CDATA[$\pm 0.1$]]></tex-math></alternatives></inline-formula> as recommended by the authors, and BOIN used the transition and dose-exclusion look-up table generated via the <monospace>get.boundary</monospace> function in the <monospace>BOIN</monospace> package, using the function’s defaults. For the 90th percentile, CCD’s interval width was halved, and BOIN software did not permit calculation of the design rules. For estimation with both interval designs, CIR was used including the bias-mitigation formula.</p>
<p>All simulated experiments were run using the <monospace>dfsim</monospace> simulation utility in <monospace>upndown</monospace>, currently (fall 2024) available only in the package’s development version, but eventually to become available in the CRAN version as well. Post-processing and visualization were done using auxiliary code in R version 4.3.3. The entire simulation’s scripts can be found on Github under the <monospace>assaforon/UpndownBook</monospace> repository, in the folders <monospace>P2_Estimation</monospace> and <monospace>P3_Practical</monospace>.</p></disp-quote></p>
</sec>
<sec id="j_nejsds74_s_012">
<label>3.2.2</label>
<title>Results: Main Metrics</title>
<p>We follow the phase I field’s conventions, and rather than evaluate point estimates of <inline-formula id="j_nejsds74_ineq_111"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${F^{-1}}(\Gamma )$]]></tex-math></alternatives></inline-formula> using continuous metrics, we identify the dose-level in <inline-formula id="j_nejsds74_ineq_112"><alternatives><mml:math>
<mml:mi mathvariant="script">X</mml:mi></mml:math><tex-math><![CDATA[$\mathcal{X}$]]></tex-math></alternatives></inline-formula> with the smallest <inline-formula id="j_nejsds74_ineq_113"><alternatives><mml:math>
<mml:mo stretchy="false">|</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo stretchy="false">|</mml:mo></mml:math><tex-math><![CDATA[$|\hat{F}-\Gamma |$]]></tex-math></alternatives></inline-formula>, often known as the Maximum Tolerated Dose (MTD) estimate. Phase I simulation studies usually examine what proportion of the ensemble’s runs had the correct MTD estimate (i.e., the MTD estimate was indeed the dose-level with smallest <inline-formula id="j_nejsds74_ineq_114"><alternatives><mml:math>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo stretchy="false">|</mml:mo></mml:math><tex-math><![CDATA[$|F-\Gamma |$]]></tex-math></alternatives></inline-formula>), or whether this estimate falls on a dose whose <italic>F</italic> value is within an “acceptable window” around Γ. Here we adopted the latter criterion; for the 30th percentile we used an “acceptable window” of <inline-formula id="j_nejsds74_ineq_115"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mfenced separators="" open="[" close="]">
<mml:mrow>
<mml:mn>0.2</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.4</mml:mn>
</mml:mrow>
</mml:mfenced>
</mml:math><tex-math><![CDATA[$F\in \left[0.2,0.4\right]$]]></tex-math></alternatives></inline-formula>. We made sure that every <inline-formula id="j_nejsds74_ineq_116"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(x)$]]></tex-math></alternatives></inline-formula> curve in the ensemble has at least one dose-level within the “acceptable window”, but no more than three. The Supplementary Material includes analogous summary plots (Figures S1, S2), with the narrower criterion of correct-MTD identification for the 30th percentile simulations.</p>
<p>The proportion of single-patient 30th percentile simulation runs whose MTD estimate fell within the window, is plotted on the <italic>y</italic>-axis of Figure <xref rid="j_nejsds74_fig_003">3</xref>. The <italic>x</italic>-axis shows the ensemble-average DLT rate during the experiment. Since <inline-formula id="j_nejsds74_ineq_117"><alternatives><mml:math>
<mml:mn>30</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$30\% $]]></tex-math></alternatives></inline-formula> is the target rate (marked as a dashed vertical line), rates below, or not far above <inline-formula id="j_nejsds74_ineq_118"><alternatives><mml:math>
<mml:mn>30</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$30\% $]]></tex-math></alternatives></inline-formula>, should be acceptable. Thus, the desirable region of the plot is high and to the left, or at least not too far to the right. For each design we show the mean (dot) and range (lines extending from it) across the 4 starting-point and target-location settings described in Section <xref rid="j_nejsds74_s_011">3.2.1</xref>. Designs more robust to changes in settings will have shorter lines extending from the mean.</p>
<p>On the combination of Figure <xref rid="j_nejsds74_fig_003">3</xref>’s three metrics – dose-finding performance, toxicity and robustness – the <italic>k</italic>-in-a-row UDD (dark red) is among the best, and arguably even the single best overall. The CRM variant with wider “indifference interval” (steel blue) does well on performance. However, its considerable spread suggests less robustness, and there is more to this story as we shall see soon. Aim-for-Target designs that show similar robustness to <italic>k</italic>-in-a-row are generally lower on performance. CRM with a high prior-predictive MTD has the highest overall DLT rate, as expected. The newest design, BOIN (orange), had disappointingly low performance, and yet does not achieve lower overall DLT rate than UDD.</p>
<fig id="j_nejsds74_fig_003">
<label>Figure 3</label>
<caption>
<p>Main performance plot from the 30th percentile target simulations with single-patient dose allocations. Additional details are in the text.</p>
</caption>
<graphic xlink:href="nejsds74_g003.jpg"/>
</fig>
<fig id="j_nejsds74_fig_004">
<label>Figure 4</label>
<caption>
<p>Main performance plot from the 30th percentile target simulations with cohorts of size 3. Additional details are in the text.</p>
</caption>
<graphic xlink:href="nejsds74_g004.jpg"/>
</fig>
<p>Figure <xref rid="j_nejsds74_fig_004">4</xref> shows summaries under identical settings except for the use of 3-patient cohorts, a common practice in phase I trials. We have retained the same plot boundaries as Figure <xref rid="j_nejsds74_fig_003">3</xref>, so the first thing to note is a substantial loss of dose-finding performance compared with the single-patient simulations. Much of the loss is due to the most challenging setting, under which experiments start at <inline-formula id="j_nejsds74_ineq_119"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${d_{1}}$]]></tex-math></alternatives></inline-formula> and the target is in <inline-formula id="j_nejsds74_ineq_120"><alternatives><mml:math>
<mml:mfenced separators="" open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>6</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>7</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:math><tex-math><![CDATA[$\left({d_{6}},{d_{7}}\right)$]]></tex-math></alternatives></inline-formula>; for 4 of the 6 designs, the performance under this setting now falls below the plot’s lower limit of <inline-formula id="j_nejsds74_ineq_121"><alternatives><mml:math>
<mml:mn>60</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$60\% $]]></tex-math></alternatives></inline-formula>. The Supplementary Material includes a version of Figure <xref rid="j_nejsds74_fig_004">4</xref> where the full performance range is visible. Some designs lose altitude across the board: “CRM Wide” in particular loses <inline-formula id="j_nejsds74_ineq_122"><alternatives><mml:math>
<mml:mn>3</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$3\% $]]></tex-math></alternatives></inline-formula> average performance even when the most challenging setting is excluded. More can be said about the loss of performance when a cohort structure is imposed, and whether it justifies the actual benefits – but perhaps this is a topic for another article.</p>
<p>Turning to our main business of UDD vs. Aim-for-Target comparison: the UDD variant is Figure <xref rid="j_nejsds74_fig_004">4</xref>’s clear number 1 in dose-finding performance. Conversely, it is also responsible for the single highest-toxicity ensemble – <inline-formula id="j_nejsds74_ineq_123"><alternatives><mml:math>
<mml:mn>32.7</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$32.7\% $]]></tex-math></alternatives></inline-formula> under the setting that starts at <inline-formula id="j_nejsds74_ineq_124"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">d</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${d_{4}}$]]></tex-math></alternatives></inline-formula> – but this is the only setting in which it exceeds <inline-formula id="j_nejsds74_ineq_125"><alternatives><mml:math>
<mml:mn>30</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$30\% $]]></tex-math></alternatives></inline-formula> despite having a balance point of <inline-formula id="j_nejsds74_ineq_126"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo stretchy="false">≈</mml:mo>
<mml:mn>0.347</mml:mn></mml:math><tex-math><![CDATA[${p^{\ast }}\approx 0.347$]]></tex-math></alternatives></inline-formula>, and on average its toxicity rate is <inline-formula id="j_nejsds74_ineq_127"><alternatives><mml:math>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:mn>25</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$\lt 25\% $]]></tex-math></alternatives></inline-formula>, substantially lower than “CRM High” and similar to “CRM Low’.</p>
<p>For simplicity, we retained the same metrics for the 90th percentile simulations – i.e., establishing a “Best Dose” <italic>(note it is not a “maximum tolerated dose” in this context)</italic> and defining a “desirable window” between the 82.5 and 97.5 percentiles. The rationale for this window is that failure rates nearing <inline-formula id="j_nejsds74_ineq_128"><alternatives><mml:math>
<mml:mn>20</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$20\% $]]></tex-math></alternatives></inline-formula>, (i.e., double the target rate) would be deemed too high, and conversely near-perfect efficacy rates might suggest that we are choosing doses that are excessive for the vast majority of patients. Thus, the set-up for Figure <xref rid="j_nejsds74_fig_005">5</xref> is very similar to Figure <xref rid="j_nejsds74_fig_003">3</xref>, but now the best region in the plot is high and to the right, although still perhaps not too far to the right.<xref ref-type="fn" rid="j_nejsds74_fn_004">4</xref><fn id="j_nejsds74_fn_004"><label><sup>4</sup></label>
<p>We reiterate that the anesthesiology field usually prefers continuous target-dose estimates rather than this discrete “Best Dose” approach, although the latter is encountered occasionally. However, even if we presented continuous estimates and metrics, the results would be similar overall.</p></fn></p>
<p>Once again, UDD does very well, and once again, the wide-interval CRM (which here means a half-width of 0.05 vs. 0.025 for the other two variants) shows the worst robustness to changes in setting. BOIN is not shown because the design’s official package refuses to calculate design rules for <inline-formula id="j_nejsds74_ineq_129"><alternatives><mml:math>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mn>0.6</mml:mn></mml:math><tex-math><![CDATA[$\Gamma \gt 0.6$]]></tex-math></alternatives></inline-formula>.</p>
<fig id="j_nejsds74_fig_005">
<label>Figure 5</label>
<caption>
<p>Main performance plot from the 90th percentile target simulations. Additional details are in the text.</p>
</caption>
<graphic xlink:href="nejsds74_g005.jpg"/>
</fig>
</sec>
<sec id="j_nejsds74_s_013">
<label>3.2.3</label>
<title>Results: Number-Treated-in-Window Metric</title>
<p>In Aim-for-Target editorials and simulation articles, it is commonplace to discuss and examine the metric of how many patients during the experiment were treated at the true MTD, or within the “acceptable window” around it. We call this metric <inline-formula id="j_nejsds74_ineq_130"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${n^{\ast }}$]]></tex-math></alternatives></inline-formula> for brevity. It does not strike us as originating from practitioners; there’s a good chance that if practitioners were the ones coming up with such a metric, statisticians would have told them it is no less than circular reasoning, because if the true MTD is known then the experiment is not needed, and since it is not known the expectation that most patients be treated at it during a small-sample, binary-<italic>Y</italic> experiment, is unrealistic.</p>
<p>On a possibly related topic, a hallmark of Aim-for-Target design behavior is the tendency to “settle in” relatively quickly on the same dose-level for long stretches. This behavior is very widely mis-interpreted as “convergence”, even leading to widespread adoption of early-stopping rules designed around it. Oron and Hoff [<xref ref-type="bibr" rid="j_nejsds74_ref_048">48</xref>] argued that since Aim-for-Target convergence is tied to the convergence of <italic>F</italic> estimates, and these take place at a root-<italic>n</italic> rate, <italic>“late-stage convergence”</italic> (loosely speaking, when behavior doesn’t change anymore because the estimates have in fact gotten very close to their asymptotic value) is not observable at the rather small phase I sample sizes, barring the occasional lucky individual sample. Instead, the settling behavior is a side-effect of design rules, because the same model is refitted at each step with nearly the same data. Oron and Hoff also provided simulation evidence that aggressive early settling-in is unrelated, or even inversely related, to estimation performance, and therefore stopping rules based on this behavior might be detrimental. Regardless, for our narrow purposes here, we note that this settling-in behavior tends to drive <inline-formula id="j_nejsds74_ineq_131"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${n^{\ast }}$]]></tex-math></alternatives></inline-formula> up for Aim-for-Target designs under favorable conditions, surely compared to UDDs and their random walk.</p>
<fig id="j_nejsds74_fig_006">
<label>Figure 6</label>
<caption>
<p>Distributions of the run-specific number of patients treated at acceptable dose-levels, from the 30th percentile single-patient simulations.</p>
</caption>
<graphic xlink:href="nejsds74_g006.jpg"/>
</fig>
<p>We examine <inline-formula id="j_nejsds74_ineq_132"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${n^{\ast }}$]]></tex-math></alternatives></inline-formula>, but instead of ensemble averages we look at the entire ensemble distribution, as Oron and Hoff did in 2013. Figure <xref rid="j_nejsds74_fig_006">6</xref> shows the ensemble distributions of <inline-formula id="j_nejsds74_ineq_133"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${n^{\ast }}$]]></tex-math></alternatives></inline-formula> for the single-patient 30th percentile simulations, across 3 settings and 5 designs. Each pane represents a 500-run ensemble, with individual runs (single virtual “experiments”) differing both by their <inline-formula id="j_nejsds74_ineq_134"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(x)$]]></tex-math></alternatives></inline-formula> curve and by their set of random response thresholds (“patients”). In contrast, the columns differ only by the location of <inline-formula id="j_nejsds74_ineq_135"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="normal">Γ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${F^{-1}}(\Gamma )$]]></tex-math></alternatives></inline-formula>.</p>
<p>While the UDD histograms (bottom row) show a clear peak with tails, most Aim-For-Target histograms are not too far removed from a uniform distribution. This means that whether the trial will be spent almost entirely within the acceptable window, almost entirely outside of it, or somewhere between those extremes – is anyone’s guess. The CRM with a wider “indifference interval” (second row from bottom) is particularly volatile – and in the least favorable setting (rightmost column), the most common single value of <inline-formula id="j_nejsds74_ineq_136"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${n^{\ast }}$]]></tex-math></alternatives></inline-formula> for this design is zero. Overall, Aim-for-Target ensemble-average <inline-formula id="j_nejsds74_ineq_137"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${n^{\ast }}$]]></tex-math></alternatives></inline-formula> values are <inline-formula id="j_nejsds74_ineq_138"><alternatives><mml:math>
<mml:mo stretchy="false">∼</mml:mo>
<mml:mn>10</mml:mn>
<mml:mo>−</mml:mo>
<mml:mn>25</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$\sim 10-25\% $]]></tex-math></alternatives></inline-formula> higher than <italic>k</italic>-in-a-row, but their ensemble standard deviations of <inline-formula id="j_nejsds74_ineq_139"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${n^{\ast }}$]]></tex-math></alternatives></inline-formula> are <inline-formula id="j_nejsds74_ineq_140"><alternatives><mml:math>
<mml:mo stretchy="false">∼</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi mathvariant="italic">x</mml:mi></mml:math><tex-math><![CDATA[$\sim 2x$]]></tex-math></alternatives></inline-formula> higher.</p>
<fig id="j_nejsds74_fig_007">
<label>Figure 7</label>
<caption>
<p>Distributions of the run-specific number of cohorts treated at acceptable dose-levels, from the 30th percentile cohort simulations.</p>
</caption>
<graphic xlink:href="nejsds74_g007.jpg"/>
</fig>
<p>Figure <xref rid="j_nejsds74_fig_007">7</xref> shows analogous distributions from the cohort simulations, counting allocated cohorts of size 3 instead of single patients since all patients in each cohort receive the same dose. Some differences between UDD and Aim-for-Target appear less dramatic here, both because of the strong constraint imposed by the use of cohorts, and because GUD<inline-formula id="j_nejsds74_ineq_141"><alternatives><mml:math>
<mml:msub>
<mml:mrow/>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>3</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${_{(3,0,2)}}$]]></tex-math></alternatives></inline-formula> allows for the same dose to be repeated over more consecutive observations than the single-patient UDD. Indeed, average <inline-formula id="j_nejsds74_ineq_142"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${n^{\ast }}$]]></tex-math></alternatives></inline-formula> values for GUD<inline-formula id="j_nejsds74_ineq_143"><alternatives><mml:math>
<mml:msub>
<mml:mrow/>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>3</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${_{(3,0,2)}}$]]></tex-math></alternatives></inline-formula> are similar to those of the other designs. It does still have the lowest standard deviation of <inline-formula id="j_nejsds74_ineq_144"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${n^{\ast }}$]]></tex-math></alternatives></inline-formula> in each setting – although not by a factor of 2 as in the single-patient simulations. As seen in other simulation results, “CRM Wide” once again is the least robust, this time showing marked sensitivity both between settings and between individual runs. Under the least favorable setting, <inline-formula id="j_nejsds74_ineq_145"><alternatives><mml:math>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mn>25</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$\gt 25\% $]]></tex-math></alternatives></inline-formula> of this design’s runs never made it into the “acceptable window” during the experiment.</p>
</sec>
</sec>
</sec>
<sec id="j_nejsds74_s_014">
<label>4</label>
<title>Discussion</title>
<p>UDDs are widely used with a long track record of reliability. They are high-performing, flexible and modifiable. We recommend strongly to pertinent application fields where UDDs are currently not part of mainstream discussion, to consider them again. For phase I and similar clinical toxicity trials in particular, UDDs have the dual advantage of being simpler and more tractable than ‘3+3’, which might appeal to practitioners and regulators, yet performing at least as well as the best novel “Aim-for-Target” designs, which should appeal to everyone.</p>
<p>In that context, UDD’s random walk is often faulted because it allows a dose experiencing multiple prior toxicities to be visited again, more readily than most estimation-based designs. This valid concern is not limited to UDDs, and a simple, generic solution is known and is applicable to UDDs as well: incorporate a dose-exclusion rule based on current information. Such rules have been proposed for UDDs at least once [<xref ref-type="bibr" rid="j_nejsds74_ref_047">47</xref>]. We note that regardless of design, many of these rules ignore the bias in <bold>R</bold> and hence tend to be too aggressive. Also regardless of design, there is some loss of performance in exchange for reducing the risk of visiting high doses. We plan to examine new ideas for UDD-specific dose-exclusion rules, which in contrast to the above will be cognizant of the bias, and might end up improving the design’s dose-finding performance.</p>
<p>There are many opportunities for further extensions and improvements to UDDs. For example, in anesthesiology a key adverse-response endpoint is change in blood pressure. Given the volatility of blood pressure readings it seems more sensible to discretize this continuous measure as ordered-ternary <italic>Y</italic> (decrease, inconsequential change, increase) rather than to dichotomize it. Fortunately, a UDD extension to accommodate ordered-ternary <italic>Y</italic> will probably be simple and straightforward, like the UDD extensions mentioned in Section <xref rid="j_nejsds74_s_004">2.2</xref>. A more sophisticated potential extension using the full range of ordinal toxicity-grade data was explored briefly 20 years ago in the context of phase I designs, and can also be followed upon [<xref ref-type="bibr" rid="j_nejsds74_ref_055">55</xref>]. Another potential extension is related to GUD<inline-formula id="j_nejsds74_ineq_146"><alternatives><mml:math>
<mml:msub>
<mml:mrow/>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>3</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${_{(3,0,2)}}$]]></tex-math></alternatives></inline-formula> which fared well in Section <xref rid="j_nejsds74_s_010">3.2</xref>’s cohort-based comparative simulation. As mentioned earlier its balance point is <inline-formula id="j_nejsds74_ineq_147"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo stretchy="false">≈</mml:mo>
<mml:mn>0.347</mml:mn></mml:math><tex-math><![CDATA[${p^{\ast }}\approx 0.347$]]></tex-math></alternatives></inline-formula>, a tad high if the target is <inline-formula id="j_nejsds74_ineq_148"><alternatives><mml:math>
<mml:mo stretchy="false">≤</mml:mo>
<mml:mn>0.3</mml:mn></mml:math><tex-math><![CDATA[$\le 0.3$]]></tex-math></alternatives></inline-formula>. One could propose a modification whereby in case 1 toxicity out of 3 is observed, a biased coin is tossed to determine whether to repeat the dose or de-escalate. Such a variant could target, e.g., the 30th or the 25th percentile, depending upon coin probability. Baldi Antognini <italic>et al.</italic> examined GUD with biased coins some time ago, but their exploration was generic rather than focusing on concrete experimental applications and their specific properties [<xref ref-type="bibr" rid="j_nejsds74_ref_004">4</xref>]. Last but not least, in sensory studies it is common to run a UDD experiment on a single participant, who repeatedly reports whether they notice a stimulus as its intensity varies up and down. This introduces additional dependence to the observations, as well as “drifts” in response due to fatigue, etc. While the sensory-studies field has been cognizant of these issues, we feel that their impact upon UDD properties and the potential implications for design and estimation have not been studied thoroughly.</p>
<p>Given the paucity of person-hours devoted to UDD methodology in recent decades, even better opportunities surely await the intrepid researcher. A sense of how the methodology has progressed due to the efforts of the few, can be attained by comparing the 2007 Anesthesiology UDD tutorial by Pace and Stylianou [<xref ref-type="bibr" rid="j_nejsds74_ref_053">53</xref>], the chapter written by us for a 2015 experimental-design handbook [<xref ref-type="bibr" rid="j_nejsds74_ref_016">16</xref>], and the 2022 Anesthesiology tutorial written by us in collaboration with a senior anesthesiologist [<xref ref-type="bibr" rid="j_nejsds74_ref_052">52</xref>]. We are thrilled to be in the final stages of completing the first-ever book solely dedicated to UDDs, which contains further developments. We would love to see younger researchers taking up the challenges presented there.</p>
<p>We end on a philosophical note. While it is likely that UDDs had sprung out of common-sense and intuition rather than deep theoretical introspection, they seem to have hit a sweet spot with respect to the handling of uncertainty in a highly constrained, low-information problem. UDDs do not attempt to control the dose-allocation process too tightly; instead, their rules leverage uncertainty to generate random walks with reasonable behavior and good data-collection properties. By contrast, ‘3+3’ and similar escalation designs place very tight constraints on the number of DLTs in the trial. They generally succeed in stopping experiments quickly with few DLTs, but the price is very poor estimation performance, defeating the phase I trial’s entire purpose. At the opposite end, Aim-for-Target designs introduce considerable complexity in the attempt to tame uncertainty via repeated estimation, which in practice plays out as declaring a “best dose” early on based on minimal evidence, and sticking with it until proven otherwise. In case this early bet was wrong, correcting it might require longer than the entire experiment’s duration. Therefore, as some have suggested in a more general context, it may be possible that letting go just a little bit rather than try to control randomness forcibly, is the winning approach all things considered [<xref ref-type="bibr" rid="j_nejsds74_ref_062">62</xref>].</p>
</sec>
</body>
<back>
<ack id="j_nejsds74_ack_001">
<title>Acknowledgements</title>
<p>We thank the anonymous reviewers, whose insightful and inquisitive comments have helped to improve the manuscript substantially.</p></ack>
<ref-list id="j_nejsds74_reflist_001">
<title>References</title>
<ref id="j_nejsds74_ref_001">
<label>[1]</label><mixed-citation publication-type="journal"><string-name><surname>Anscombe</surname>, <given-names>F. J.</given-names></string-name> (<year>1956</year>). <article-title>On Estimating Binomial Response Relations</article-title>. <source>Biometrika</source> <volume>43</volume> <fpage>461</fpage>–<lpage>464</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1093/biomet/43.3-4.461" xlink:type="simple">https://doi.org/10.1093/biomet/43.3-4.461</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0081598">MR0081598</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_002">
<label>[2]</label><mixed-citation publication-type="other"><string-name><surname>ASTM</surname></string-name> (<year>1991</year>). Standard test method for estimating acute oral toxicity in rats. <italic>American Society for Testing and Materials</italic>. 1163–90.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_003">
<label>[3]</label><mixed-citation publication-type="journal"><string-name><surname>Babb</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Rogatko</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Rogatko</surname>, <given-names>A.</given-names></string-name> and <string-name><surname>Zacks</surname>, <given-names>S.</given-names></string-name> (<year>1998</year>). <article-title>Cancer Phase I Clinical Trials: Efficient Dose Escalation with Overdose Control</article-title>. <source>Stat. Med.</source> <volume>17</volume> <fpage>1103</fpage>–<lpage>1120</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_004">
<label>[4]</label><mixed-citation publication-type="journal"><string-name><surname>Baldi Antognini</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Bortot</surname>, <given-names>P.</given-names></string-name> and <string-name><surname>Giovagnoli</surname>, <given-names>A.</given-names></string-name> (<year>2008</year>). <article-title>Randomized group up and down experiments</article-title>. <source>Annals of the Institute of Statistical Mathematics</source> <volume>60</volume> <fpage>45</fpage>–<lpage>59</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1007/s10463-006-0081-5" xlink:type="simple">https://doi.org/10.1007/s10463-006-0081-5</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2400060">MR2400060</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_005">
<label>[5]</label><mixed-citation publication-type="journal"><string-name><surname>Brownlee</surname>, <given-names>K. A.</given-names></string-name>, <string-name><surname>Hodges Jr.</surname>, <given-names>J. L.</given-names></string-name> and <string-name><surname>Rosenblatt</surname>, <given-names>M.</given-names></string-name> (<year>1953</year>). <article-title>The up-and-down method with small samples</article-title>. <source>JASA</source> <volume>48</volume> <fpage>262</fpage>–<lpage>277</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0055644">MR0055644</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_006">
<label>[6]</label><mixed-citation publication-type="chapter"><string-name><surname>Carter</surname>, <given-names>S. K.</given-names></string-name> (<year>1973</year>). <chapter-title>Study design principles in the clinical evaluation of new drugs as developed by the chemotherapy programme of the National Cancer Institute</chapter-title>. In <source>The Design of Clinical Trials in Cancer Therapy</source> (<string-name><given-names>M. J.</given-names> <surname>Staquet</surname></string-name>, ed.) <fpage>242</fpage>–<lpage>289</lpage>. <publisher-name>Editions Scientific Europe</publisher-name>, <publisher-loc>Brussels</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_007">
<label>[7]</label><mixed-citation publication-type="book"><string-name><surname>Cormen</surname>, <given-names>T. H.</given-names></string-name>, <string-name><surname>Leiserson</surname>, <given-names>C. E.</given-names></string-name>, <string-name><surname>Rivest</surname>, <given-names>R. L.</given-names></string-name> and <string-name><surname>Stein</surname>, <given-names>C.</given-names></string-name> (<year>2022</year>) <source>Introduction to algorithms</source>, <edition>4</edition>th Edition. <publisher-name>MIT press</publisher-name>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2572804">MR2572804</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_008">
<label>[8]</label><mixed-citation publication-type="journal"><string-name><surname>Derman</surname>, <given-names>C.</given-names></string-name> (<year>1957</year>). <article-title>Non-parametric up-and-down experimentation</article-title>. <source>Ann. Math. Stat.</source> <volume>28</volume> <fpage>795</fpage>–<lpage>798</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/aoms/1177706895" xlink:type="simple">https://doi.org/10.1214/aoms/1177706895</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0090956">MR0090956</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_009">
<label>[9]</label><mixed-citation publication-type="journal"><string-name><surname>Diaconis</surname>, <given-names>P.</given-names></string-name> and <string-name><surname>Stroock</surname>, <given-names>D.</given-names></string-name> (<year>1991</year>). <article-title>Geometric Bounds for Eigenvalues of Markov Chains</article-title>. <source>Ann. App. Prob.</source> <volume>1</volume> <fpage>36</fpage>–<lpage>61</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1097463">MR1097463</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_010">
<label>[10]</label><mixed-citation publication-type="journal"><string-name><surname>Dixon</surname>, <given-names>W. J.</given-names></string-name> and <string-name><surname>Mood</surname>, <given-names>A.</given-names></string-name> (<year>1948</year>). <article-title>A method for obtaining and analyzing sensitivity data</article-title>. <source>JASA</source> <volume>13</volume> <fpage>109</fpage>–<lpage>126</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_011">
<label>[11]</label><mixed-citation publication-type="book"><string-name><surname>DOD</surname></string-name> (<year>2001</year>). <source>MIL-STD-1751A – Safety and Performance Tests for the Qualification of Explosives (high explosives, propellants, and pyrotechnics)</source>. <publisher-name>United States Department of Defense</publisher-name>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_012">
<label>[12]</label><mixed-citation publication-type="chapter"><string-name><surname>Durham</surname>, <given-names>S. D.</given-names></string-name> and <string-name><surname>Flournoy</surname>, <given-names>N.</given-names></string-name> (<year>1995</year>). <chapter-title>Up-and-down Designs I: Stationary Treatment Distributions</chapter-title>. In <source>Adaptive Designs</source> (<string-name><given-names>N.</given-names> <surname>Flournoy</surname></string-name> and <string-name><given-names>W. F.</given-names> <surname>Rosenberger</surname></string-name>, eds.) <fpage>139</fpage>–<lpage>157</lpage>. <publisher-name>Institute of Mathematical Statistics</publisher-name>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/lnms/1215451483" xlink:type="simple">https://doi.org/10.1214/lnms/1215451483</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1477678">MR1477678</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_013">
<label>[13]</label><mixed-citation publication-type="journal"><string-name><surname>Durham</surname>, <given-names>S. D.</given-names></string-name>, <string-name><surname>Flournoy</surname>, <given-names>N.</given-names></string-name> and <string-name><surname>Rosenberger</surname>, <given-names>W. F.</given-names></string-name> (<year>1997</year>). <article-title>A Random Walk Rule for Phase I Clinical Trials</article-title>. <source>Biometrics</source> <volume>53</volume> <fpage>745</fpage>–<lpage>760</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_014">
<label>[14]</label><mixed-citation publication-type="chapter"><string-name><surname>Durham</surname>, <given-names>S. D.</given-names></string-name> and <string-name><surname>Flournoy</surname>, <given-names>N.</given-names></string-name> (<year>1994</year>). <chapter-title>Random Walks for Quantile Estimation</chapter-title>. In <source>Statistical Decision Theory and Related Topics, V</source> (<string-name><given-names>S. S.</given-names> <surname>Gupta</surname></string-name> and <string-name><given-names>J. O.</given-names> <surname>Berger</surname></string-name>, eds.) <fpage>467</fpage>–<lpage>476</lpage>. <publisher-name>Springer-Verlag Inc.</publisher-name> <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1286322">MR1286322</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_015">
<label>[15]</label><mixed-citation publication-type="journal"><string-name><surname>Firth</surname>, <given-names>D.</given-names></string-name> (<year>1993</year>). <article-title>Bias reduction of maximum likelihood estimates</article-title>. <source>Biometrika</source> <volume>80</volume>(<issue>1</issue>) <fpage>27</fpage>–<lpage>38</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1093/biomet/80.1.27" xlink:type="simple">https://doi.org/10.1093/biomet/80.1.27</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1225212">MR1225212</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_016">
<label>[16]</label><mixed-citation publication-type="chapter"><string-name><surname>Flournoy</surname>, <given-names>N.</given-names></string-name> and <string-name><surname>Oron</surname>, <given-names>A. P.</given-names></string-name> (<year>2015</year>). <chapter-title>Up-and-down designs for dose-finding</chapter-title>. In <source>Handbook of Design and Analysis of Experiments</source> (<string-name><given-names>D.</given-names> <surname>Bingham</surname></string-name>, <string-name><given-names>A. M.</given-names> <surname>Dean</surname></string-name>, <string-name><given-names>M.</given-names> <surname>Morris</surname></string-name> and <string-name><given-names>J.</given-names> <surname>Stufken</surname></string-name>, eds.) <comment>24</comment>, <fpage>862</fpage>–<lpage>898</lpage>. <publisher-name>CRC Press, Chapman Hall</publisher-name>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=3699370">MR3699370</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_017">
<label>[17]</label><mixed-citation publication-type="journal"><string-name><surname>Flournoy</surname>, <given-names>N.</given-names></string-name> and <string-name><surname>Oron</surname>, <given-names>A. P.</given-names></string-name> (<year>2020</year>). <article-title>Bias Induced by Adaptive dose-finding designs</article-title>. <source>Journal of Applied Statistics</source> <volume>47</volume>(<issue>13-15</issue>) <fpage>2431</fpage>–<lpage>2442</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/02664763.2019.1649375" xlink:type="simple">https://doi.org/10.1080/02664763.2019.1649375</ext-link>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/02664763.2019.1649375" xlink:type="simple">https://doi.org/10.1080/02664763.2019.1649375</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=4149564">MR4149564</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_018">
<label>[18]</label><mixed-citation publication-type="journal"><string-name><surname>Flournoy</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>Durham</surname>, <given-names>S. D.</given-names></string-name> and <string-name><surname>Rosenberger</surname>, <given-names>W. F.</given-names></string-name> (<year>1995</year>). <article-title>Toxicity in Sequential Dose-response Experiments</article-title>. <source>Sequential Analysis</source> <volume>14</volume> <fpage>217</fpage>–<lpage>227</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/07474949508836333" xlink:type="simple">https://doi.org/10.1080/07474949508836333</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1365660">MR1365660</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_019">
<label>[19]</label><mixed-citation publication-type="journal"><string-name><surname>Flournoy</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>Moler</surname>, <given-names>J.</given-names></string-name> and <string-name><surname>Plo</surname>, <given-names>F.</given-names></string-name> (<year>2020</year>). <article-title>Performance measures in dose-finding experiments</article-title>. <source>International Statistical Review</source> <volume>88</volume>(<issue>3</issue>) <fpage>728</fpage>–<lpage>751</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/insr.12363" xlink:type="simple">https://doi.org/10.1111/insr.12363</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=4180676">MR4180676</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_020">
<label>[20]</label><mixed-citation publication-type="journal"><string-name><surname>Garcìa-Perez</surname>, <given-names>M. A.</given-names></string-name> (<year>1998</year>). <article-title>Forced-Choice staircases with fixed step sizes: asymptotic and small-sample properties</article-title>. <source>Vision Res.</source> <volume>38</volume> <fpage>1861</fpage>–<lpage>1881</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_021">
<label>[21]</label><mixed-citation publication-type="journal"><string-name><surname>George</surname>, <given-names>R. B.</given-names></string-name>, <string-name><surname>McKeen</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Columb</surname>, <given-names>M. O.</given-names></string-name> and <string-name><surname>Habib</surname>, <given-names>A. S.</given-names></string-name> (<year>2010</year>). <article-title>Up-down determination of the 90% effective dose of phenylephrine for the treatment of spinal anesthesia-induced hypotension in parturients undergoing cesarean delivery</article-title>. <source>Anesthesia &amp; Analgesia</source> <volume>110</volume>(<issue>1</issue>) <fpage>154</fpage>–<lpage>158</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_022">
<label>[22]</label><mixed-citation publication-type="other"><string-name><surname>Gezmu</surname>, <given-names>M.</given-names></string-name> The geometric up-and-down design for allocating dosage levels (1996). PhD thesis, American University, Washington, DC. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2695534">MR2695534</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_023">
<label>[23]</label><mixed-citation publication-type="journal"><string-name><surname>Gezmu</surname>, <given-names>M.</given-names></string-name> and <string-name><surname>Flournoy</surname>, <given-names>N.</given-names></string-name> (<year>2006</year>). <article-title>Group up-and-down designs for dose-finding</article-title>. <source>J Stat. Plan. Inf.</source> <volume>136</volume>(<issue>6</issue>) <fpage>1749</fpage>–<lpage>1764</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/j.jspi.2005.08.002" xlink:type="simple">https://doi.org/10.1016/j.jspi.2005.08.002</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2255594">MR2255594</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_024">
<label>[24]</label><mixed-citation publication-type="journal"><string-name><surname>Giovagnoli</surname>, <given-names>A.</given-names></string-name> and <string-name><surname>Pintacuda</surname>, <given-names>N.</given-names></string-name> (<year>1998</year>). <article-title>Properties of Frequency Distributions Induced by General ‘up-and-down’ Methods for Estimating Quantiles</article-title>. <source>J Stat. Plan. Inf.</source> <volume>74</volume> <fpage>51</fpage>–<lpage>63</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/S0378-3758(98)00076-7" xlink:type="simple">https://doi.org/10.1016/S0378-3758(98)00076-7</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1665120">MR1665120</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_025">
<label>[25]</label><mixed-citation publication-type="journal"><string-name><surname>Gorla</surname>, <given-names>C.</given-names></string-name>, <string-name><surname>Rosa</surname>, <given-names>F.</given-names></string-name>, <string-name><surname>Conrado</surname>, <given-names>E.</given-names></string-name> and <string-name><surname>Concli</surname>, <given-names>F.</given-names></string-name> (<year>2017</year>). <article-title>Bending Fatigue Strength of Case Carburized and Nitrided Gear Steels for Aeronautical Applications</article-title>. <source>International Journal of Applied Engineering Research</source> <volume>12</volume>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_026">
<label>[26]</label><mixed-citation publication-type="journal"><string-name><surname>Heijmans</surname>, <given-names>R.</given-names></string-name> (<year>1999</year>). <article-title>When does the expectation of a ratio equal the ratio of expectations?</article-title> <source>Statistical Papers</source> <volume>40</volume> <fpage>107</fpage>–<lpage>115</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1007/BF02927114" xlink:type="simple">https://doi.org/10.1007/BF02927114</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1668879">MR1668879</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_027">
<label>[27]</label><mixed-citation publication-type="book"><string-name><surname>Hughes</surname>, <given-names>B. D.</given-names></string-name> (<year>1995</year>) <source>Random Walks and Random Environments. Vol. 1</source>. <series>Oxford Science Publications</series>. <publisher-name>The Clarendon Press Oxford University Press</publisher-name>, <publisher-loc>New York</publisher-loc>. <comment>Random walks</comment>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1420619">MR1420619</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_028">
<label>[28]</label><mixed-citation publication-type="journal"><string-name><surname>Iasonos</surname>, <given-names>A.</given-names></string-name> and <string-name><surname>O’Quigley</surname>, <given-names>J.</given-names></string-name> (<year>2014</year>). <article-title>Adaptive dose-finding studies: a review of model-guided phase I clinical trials</article-title>. <source>Journal of Clinical Oncology</source> <volume>32</volume>(<issue>23</issue>) <fpage>2505</fpage>–<lpage>2511</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_029">
<label>[29]</label><mixed-citation publication-type="book"><collab>ISO</collab> (<year>2012</year>). <source>International Organization of Standardization</source>. <comment>12107 Metallic materials–Fatigue testing–Statistical planning and analysis of data</comment>. <publisher-name>Geneva</publisher-name>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_030">
<label>[30]</label><mixed-citation publication-type="book"><collab>ISO</collab> (<year>2016</year>). <source>International Organization of Standardization</source>. <comment>14801 Dentistry–Implants–Dynamic loading test for endosseous dental implants</comment>. <publisher-name>Geneva</publisher-name>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_031">
<label>[31]</label><mixed-citation publication-type="journal"><string-name><surname>Ivanova</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Flournoy</surname>, <given-names>N.</given-names></string-name> and <string-name><surname>Chung</surname>, <given-names>Y.</given-names></string-name> (<year>2007</year>). <article-title>Cumulative cohort design for dose-finding</article-title>. <source>J Stat. Plan. Inf.</source> <volume>137</volume> <fpage>2316</fpage>–<lpage>2317</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/j.jspi.2006.07.009" xlink:type="simple">https://doi.org/10.1016/j.jspi.2006.07.009</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2325437">MR2325437</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_032">
<label>[32]</label><mixed-citation publication-type="journal"><string-name><surname>Ivanova</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Montazer-Haghighi</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Mohanty</surname>, <given-names>S. G.</given-names></string-name> and <string-name><surname>Durham</surname>, <given-names>S. D.</given-names></string-name> (<year>2003</year>). <article-title>Improved Up-and-down Designs for Phase I Trials</article-title>. <source>Stat. Med.</source> <volume>22</volume>(<issue>1</issue>) <fpage>69</fpage>–<lpage>82</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_033">
<label>[33]</label><mixed-citation publication-type="journal"><string-name><surname>Ji</surname>, <given-names>Y.</given-names></string-name>, <string-name><surname>Liu</surname>, <given-names>P.</given-names></string-name>, <string-name><surname>Li</surname>, <given-names>Y.</given-names></string-name> and <string-name><surname>Nebiyou Bekele</surname>, <given-names>B.</given-names></string-name> (<year>2010</year>). <article-title>A modified toxicity probability interval method for dose-finding trials</article-title>. <source>Clinical Trials</source> <volume>7</volume>(<issue>6</issue>) <fpage>653</fpage>–<lpage>663</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_034">
<label>[34]</label><mixed-citation publication-type="book"><collab>JSME</collab> (<year>1981</year>). <source>Standard method of statistical fatigue testing</source>. <publisher-name>Japan Society of Mechanical Engineers</publisher-name>, <publisher-loc>Japan</publisher-loc>. <comment>JSME S 002</comment>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_035">
<label>[35]</label><mixed-citation publication-type="other"><string-name><surname>Langlie</surname>, <given-names>H. J.</given-names></string-name> (1962). A Reliability Test Method for “One-Shot” Items. Technical Report No. <elocation-id>U-1792</elocation-id>, Ford Motor Company, Ford Motor Company Aeronautics Division. <uri>https://apps.dtic.mil/sti/citations/tr/ADP014612</uri>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_036">
<label>[36]</label><mixed-citation publication-type="journal"><string-name><surname>Lee</surname>, <given-names>S. M.</given-names></string-name> and <string-name><surname>Cheung</surname>, <given-names>Y. K.</given-names></string-name> (<year>2009</year>). <article-title>Model Calibration in the continual reassessment method</article-title>. <source>Clinical Trials</source> <volume>6</volume> <fpage>227</fpage>–<lpage>238</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_037">
<label>[37]</label><mixed-citation publication-type="journal"><string-name><surname>Lee</surname>, <given-names>S. M.</given-names></string-name> and <string-name><surname>Cheung</surname>, <given-names>Y. K.</given-names></string-name> (<year>2011</year>). <article-title>Calibration of prior variance in the Bayesian continual reassessment method</article-title>. <source>Stat. Med.</source> <volume>30</volume> <fpage>2081</fpage>–<lpage>2089</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1002/sim.4139" xlink:type="simple">https://doi.org/10.1002/sim.4139</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2829158">MR2829158</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_038">
<label>[38]</label><mixed-citation publication-type="journal"><string-name><surname>Liu</surname>, <given-names>S.</given-names></string-name> and <string-name><surname>Yuan</surname>, <given-names>Y.</given-names></string-name> (<year>2015</year>). <article-title>Bayesian optimal interval designs for phase I clinical trials</article-title>. <source>Journal of the Royal Statistical Society: Series C: Applied Statistics</source> <fpage>507</fpage>–<lpage>523</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/rssc.12089" xlink:type="simple">https://doi.org/10.1111/rssc.12089</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=3325461">MR3325461</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_039">
<label>[39]</label><mixed-citation publication-type="journal"><string-name><surname>Maeda</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Villela-Franyutti</surname>, <given-names>D.</given-names></string-name>, <string-name><surname>Lumbreras-Marquez</surname>, <given-names>M. I.</given-names></string-name>, <string-name><surname>Murthy</surname>, <given-names>A.</given-names></string-name>, <string-name><surname>Fields</surname>, <given-names>K. G.</given-names></string-name>, <string-name><surname>Justice</surname>, <given-names>S.</given-names></string-name> and <string-name><surname>Tsen</surname>, <given-names>L. C.</given-names></string-name> (<year>2023</year>). <article-title>Labor analgesia initiation with Dural puncture Epidural Versus Conventional Epidural techniques: a Randomized biased-Coin Sequential Allocation Trial to determine the effective dose for 90% of patients of Bupivacaine</article-title>. <source>Anesthesia &amp; Analgesia</source> <comment>10–1213</comment>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_040">
<label>[40]</label><mixed-citation publication-type="journal"><string-name><surname>Morris</surname>, <given-names>M. D.</given-names></string-name> (<year>1988</year>). <article-title>Small-Sample Confidence Limits for Parameters under Inequality Constraints with Application to Quantal Bioassay</article-title>. <source>Biometrics</source> <volume>44</volume> <fpage>1083</fpage>–<lpage>1092</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.2307/2531737" xlink:type="simple">https://doi.org/10.2307/2531737</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0981001">MR0981001</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_041">
<label>[41]</label><mixed-citation publication-type="other"><string-name><surname>Narayana</surname>, <given-names>T. V.</given-names></string-name> Sequential procedures in probit analysis (1953). PhD thesis, University of North Carolina. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2938682">MR2938682</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_042">
<label>[42]</label><mixed-citation publication-type="book"><collab>NATO</collab> (<year>1999</year>) <source>STANAG 4489 – Explosives, impact sensititity test</source>. <publisher-name>North Atlantic Treaty Organization</publisher-name>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_043">
<label>[43]</label><mixed-citation publication-type="other"><string-name><surname>NIEHS</surname></string-name> (2001). The revised up-and-down procedure: A Test method for Determining the Acute Oral Toxicity of Chemicals. Technical Report No. <elocation-id>2-4501</elocation-id>, Washington D.C.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_044">
<label>[44]</label><mixed-citation publication-type="journal"><string-name><surname>Novik</surname>, <given-names>G. P.</given-names></string-name> and <string-name><surname>Christensen</surname>, <given-names>D.</given-names></string-name> (<year>2024</year>). <article-title>Increased impact sensitivity in ageing high explosives; analysis of Amatol extracted from explosive remnants of war</article-title>. <source>Royal Society open science</source> <volume>11</volume>(<issue>3</issue>) <fpage>231344</fpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_045">
<label>[45]</label><mixed-citation publication-type="book"><collab>OECD</collab> (<year>2022</year>) <source>Test No. 425: Acute Oral Toxicity: Up-and-Down Procedure</source>. <uri>https://www.oecd-ilibrary.org/content/publication/9789264071049-en</uri>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_046">
<label>[46]</label><mixed-citation publication-type="journal"><string-name><surname>O’Quigley</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Pepe</surname>, <given-names>M.</given-names></string-name> and <string-name><surname>Fisher</surname>, <given-names>L.</given-names></string-name> (<year>1990</year>). <article-title>Continual reassessment method: a practical design for phase 1 clinical trials in cancer</article-title>. <source>Biometrics</source> <fpage>33</fpage>–<lpage>48</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.2307/2531628" xlink:type="simple">https://doi.org/10.2307/2531628</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1059105">MR1059105</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_047">
<label>[47]</label><mixed-citation publication-type="chapter"><string-name><surname>Oron</surname>, <given-names>A. P.</given-names></string-name> (<year>2017</year>). <chapter-title>Up-and-Down Designs Enhanced with SPRT Rules for Phase I Cancer Trials</chapter-title>. In <source>Society for Clinical Trials Annual Meeting, Liverpool</source>. <publisher-name>SCT</publisher-name>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_048">
<label>[48]</label><mixed-citation publication-type="journal"><string-name><surname>Oron</surname>, <given-names>A. P.</given-names></string-name> and <string-name><surname>Hoff</surname>, <given-names>P. D.</given-names></string-name> (<year>2013</year>). <article-title>Small–sample behavior of novel Phase I cancer trial designs</article-title>. <source>Clinical Trials</source> <volume>10</volume>(<issue>1</issue>) <fpage>63</fpage>–<lpage>80</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_049">
<label>[49]</label><mixed-citation publication-type="journal"><string-name><surname>Oron</surname>, <given-names>A. P.</given-names></string-name> and <string-name><surname>Flournoy</surname>, <given-names>N.</given-names></string-name> (<year>2017</year>). <article-title>Centered isotonic regression: point and interval estimation for dose-response studies</article-title>. <source>Journal of Biopharmaceutical Statistics</source>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_050">
<label>[50]</label><mixed-citation publication-type="journal"><string-name><surname>Oron</surname>, <given-names>A. P.</given-names></string-name> and <string-name><surname>Hoff</surname>, <given-names>P. D.</given-names></string-name> (<year>2009</year>). <article-title>The <italic>k</italic>-in-a-row up-and-down design, revisited</article-title>. <source>Stat. Med.</source> <volume>28</volume>(<issue>13</issue>) <fpage>1805</fpage>–<lpage>1820</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1002/sim.3590" xlink:type="simple">https://doi.org/10.1002/sim.3590</ext-link>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1002/sim.3590" xlink:type="simple">https://doi.org/10.1002/sim.3590</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2751599">MR2751599</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_051">
<label>[51]</label><mixed-citation publication-type="journal"><string-name><surname>Oron</surname>, <given-names>A. P.</given-names></string-name>, <string-name><surname>Azriel</surname>, <given-names>D.</given-names></string-name> and <string-name><surname>Hoff</surname>, <given-names>P. D.</given-names></string-name> (<year>2011</year>). <article-title>Dose–finding designs: The role of convergence properties.</article-title> <source>Int. J Biostat.</source> <volume>7</volume>(<issue>1</issue>) <fpage>39</fpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.2202/1557-4679.1298" xlink:type="simple">https://doi.org/10.2202/1557-4679.1298</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2873999">MR2873999</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_052">
<label>[52]</label><mixed-citation publication-type="journal"><string-name><surname>Oron</surname>, <given-names>A. P.</given-names></string-name>, <string-name><surname>Souter</surname>, <given-names>M. J.</given-names></string-name> and <string-name><surname>Flournoy</surname>, <given-names>N.</given-names></string-name> (<year>2022</year>). <article-title>Understanding research methods: Up-and-down designs for dose-finding</article-title>. <source>Anesthesiology</source> <volume>137</volume>(<issue>2</issue>) <fpage>137</fpage>–<lpage>150</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_053">
<label>[53]</label><mixed-citation publication-type="journal"><string-name><surname>Pace</surname>, <given-names>N. L.</given-names></string-name> and <string-name><surname>Stylianou</surname>, <given-names>M. P.</given-names></string-name> (<year>2007</year>). <article-title>Advances in and Limitations of Up-and-down Methodology: A Précis of Clinical Use, Study Design, and Dose Estimation in Anesthesia Research</article-title>. <source>Anesthesiology</source> <volume>107</volume>(<issue>1</issue>) <fpage>144</fpage>–<lpage>152</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_054">
<label>[54]</label><mixed-citation publication-type="journal"><string-name><surname>Parasuraman</surname>, <given-names>S.</given-names></string-name> (<year>2011</year>). <article-title>Toxicological screening</article-title>. <source>Journal of Pharmacology and Pharmacotherapeutics</source> <volume>2</volume>(<issue>2</issue>) <fpage>74</fpage>–<lpage>79</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.4103/0976-500X.81895" xlink:type="simple">https://doi.org/10.4103/0976-500X.81895</ext-link>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_055">
<label>[55]</label><mixed-citation publication-type="journal"><string-name><surname>Paul</surname>, <given-names>R. K.</given-names></string-name>, <string-name><surname>Rosenberger</surname>, <given-names>W. F.</given-names></string-name> and <string-name><surname>Flournoy</surname>, <given-names>N.</given-names></string-name> (<year>2004</year>). <article-title>Quantile estimation following non-parametric phase I clinical trials with ordinal response</article-title>. <source>Statistics in medicine</source> <volume>23</volume>(<issue>16</issue>) <fpage>2483</fpage>–<lpage>2495</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_056">
<label>[56]</label><mixed-citation publication-type="journal"><string-name><surname>Robbins</surname>, <given-names>H.</given-names></string-name> and <string-name><surname>Monro</surname>, <given-names>S.</given-names></string-name> (<year>1951</year>). <article-title>A stochastic approximation method</article-title>. <source>Ann. Math. Statistics</source> <volume>22</volume> <fpage>400</fpage>–<lpage>407</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=MR0042668">MR0042668 (13,144j)</ext-link>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_057">
<label>[57]</label><mixed-citation publication-type="journal"><string-name><surname>Shi</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Khalij</surname>, <given-names>L.</given-names></string-name>, <string-name><surname>Gautrelet</surname>, <given-names>C.</given-names></string-name>, <string-name><surname>Shi</surname>, <given-names>C.</given-names></string-name> and <string-name><surname>Benasciutti</surname>, <given-names>D.</given-names></string-name> (<year>2024</year>). <article-title>Two-phase optimized experimental design for fatigue limit testing</article-title>. <source>Probabilistic Engineering Mechanics</source> <volume>75</volume> <fpage>103551</fpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/j.probengmech.2023.103551" xlink:type="simple">https://doi.org/10.1016/j.probengmech.2023.103551</ext-link>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_058">
<label>[58]</label><mixed-citation publication-type="journal"><string-name><surname>Silvapulle</surname>, <given-names>M. J.</given-names></string-name> (<year>1981</year>). <article-title>On the existence of maximum likelihood estimators for the binomial response models</article-title>. <source>Journal of the Royal Statistical Society. Series B (Methodological)</source> <fpage>310</fpage>–<lpage>313</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0637943">MR0637943</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_059">
<label>[59]</label><mixed-citation publication-type="journal"><string-name><surname>Sørensen</surname>, <given-names>C. B.</given-names></string-name>, <string-name><surname>Adams</surname>, <given-names>T. B.</given-names></string-name>, <string-name><surname>Pedersen</surname>, <given-names>E. R.</given-names></string-name>, <string-name><surname>Nielsen</surname>, <given-names>J.</given-names></string-name> and <string-name><surname>Schmidt</surname>, <given-names>J. H.</given-names></string-name> (<year>2023</year>). <article-title>AMTAS<inline-formula id="j_nejsds74_ineq_149"><alternatives><mml:math>
<mml:msup>
<mml:mrow/>
<mml:mrow>
<mml:mi mathvariant="italic">T</mml:mi>
<mml:mi mathvariant="italic">M</mml:mi>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${^{TM}}$]]></tex-math></alternatives></inline-formula> and user-operated smartphone research application audiometry—An evaluation study</article-title>. <source>Plos one</source> <volume>18</volume>(<issue>9</issue>) <fpage>0291412</fpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_060">
<label>[60]</label><mixed-citation publication-type="journal"><string-name><surname>Stylianou</surname>, <given-names>M.</given-names></string-name> and <string-name><surname>Flournoy</surname>, <given-names>N.</given-names></string-name> (<year>2002</year>). <article-title>Dose Finding Using the Biased Coin Up-and-down Design and Isotonic Regression</article-title>. <source>Biometrics</source> <volume>58</volume>(<issue>1</issue>) <fpage>171</fpage>–<lpage>177</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/j.0006-341X.2002.00171.x" xlink:type="simple">https://doi.org/10.1111/j.0006-341X.2002.00171.x</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1891376">MR1891376</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_061">
<label>[61]</label><mixed-citation publication-type="journal"><string-name><surname>Takano</surname>, <given-names>T.</given-names></string-name>, <string-name><surname>Yoshinari</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Sakurai</surname>, <given-names>K.</given-names></string-name> and <string-name><surname>Ueda</surname>, <given-names>T.</given-names></string-name> (<year>2024</year>). <article-title>Cyclic Fatigue Properties of Titanium Alloys for Application in Dental Implants</article-title>. <source>The Bulletin of Tokyo Dental College</source> <comment>2023-0025</comment>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_062">
<label>[62]</label><mixed-citation publication-type="book"><string-name><surname>Taleb</surname>, <given-names>N. N.</given-names></string-name> (<year>2001</year>) <source>Fooled by Randomness</source>. <publisher-name>Random House</publisher-name>, <publisher-loc>New York</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_063">
<label>[63]</label><mixed-citation publication-type="journal"><string-name><surname>Treutwein</surname>, <given-names>B.</given-names></string-name> (<year>1995</year>). <article-title>Minireview: adaptive psychophysical procedures</article-title>. <source>Vision Res.</source> <volume>35</volume> <fpage>2503</fpage>–<lpage>2522</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_064">
<label>[64]</label><mixed-citation publication-type="journal"><string-name><surname>Tsutakawa</surname>, <given-names>R. K.</given-names></string-name> (<year>1967</year>). <article-title>Asymptotic Properties of the Block Up-and-down Method in Bio-assay</article-title>. <source>Ann. Math. Stat.</source> <volume>38</volume> <fpage>1822</fpage>–<lpage>1828</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/aoms/1177698615" xlink:type="simple">https://doi.org/10.1214/aoms/1177698615</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0217951">MR0217951</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_065">
<label>[65]</label><mixed-citation publication-type="journal"><string-name><surname>von Békésy</surname>, <given-names>G.</given-names></string-name> (<year>1947</year>). <article-title>A new audiometer</article-title>. <source>Acta Oto.Laryn.</source> <volume>35</volume> <fpage>411</fpage>–<lpage>422</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_066">
<label>[66]</label><mixed-citation publication-type="journal"><string-name><surname>Watson</surname>, <given-names>A. B.</given-names></string-name> and <string-name><surname>Pelli</surname>, <given-names>D. G.</given-names></string-name> (<year>1983</year>). <article-title>Quest: A Bayesian adaptive psychometric method</article-title>. <source>Perception &amp; Psychophysics</source> <volume>33</volume> <fpage>113</fpage>–<lpage>120</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_067">
<label>[67]</label><mixed-citation publication-type="journal"><string-name><surname>Wetherill</surname>, <given-names>G. B.</given-names></string-name> (<year>1963</year>). <article-title>Sequential estimation of quantal response curves</article-title>. <source>J Roy. Stat. Soc. B</source> <volume>25</volume> <fpage>1</fpage>–<lpage>48</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_068">
<label>[68]</label><mixed-citation publication-type="journal"><string-name><surname>Wetherill</surname>, <given-names>G. B.</given-names></string-name> and <string-name><surname>Levitt</surname>, <given-names>H.</given-names></string-name> (<year>1965</year>). <article-title>Sequential estimation of on a psychometric function</article-title>. <source>Brit. J Math. Stat. Psych.</source> <volume>18</volume> <fpage>1</fpage>–<lpage>10</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_069">
<label>[69]</label><mixed-citation publication-type="journal"><string-name><surname>Woolf</surname>, <given-names>B.</given-names></string-name> (<year>1955</year>). <article-title>On Estimating the Relation between blood group and disease</article-title>. <source>Annals of Human Genetics</source> <volume>19</volume> <fpage>251</fpage>–<lpage>253</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds74_ref_070">
<label>[70]</label><mixed-citation publication-type="journal"><string-name><surname>Wu</surname>, <given-names>C. F. J.</given-names></string-name> (<year>1985</year>). <article-title>Efficient Sequential Designs with Binary Data</article-title>. <source>Journal of the American Statistical Association</source> <volume>80</volume>(<issue>392</issue>) <fpage>974</fpage>–<lpage>984</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0819603">MR0819603</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds74_ref_071">
<label>[71]</label><mixed-citation publication-type="journal"><string-name><surname>Zhao</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Li</surname>, <given-names>X.</given-names></string-name>, <string-name><surname>Tang</surname>, <given-names>N.</given-names></string-name>, <string-name><surname>Jiang</surname>, <given-names>X.</given-names></string-name>, <string-name><surname>Guo</surname>, <given-names>Z.</given-names></string-name> and <string-name><surname>Lin</surname>, <given-names>H.</given-names></string-name> (<year>2018</year>). <article-title>Dielectric properties of fluoronitriles/CO2 and SF6/N2 mixtures as a possible SF6-substitute gas</article-title>. <source>IEEE Transactions on Dielectrics and Electrical Insulation</source> <volume>25</volume>(<issue>4</issue>) <fpage>1332</fpage>–<lpage>1339</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1109/TDEI.2018.007139" xlink:type="simple">https://doi.org/10.1109/TDEI.2018.007139</ext-link>.</mixed-citation>
</ref>
</ref-list>
</back>
</article>
