<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="discussion">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">NEJSDS</journal-id>
<journal-title-group><journal-title>The New England Journal of Statistics in Data Science</journal-title></journal-title-group>
<issn pub-type="ppub">2693-7166</issn><issn-l>2693-7166</issn-l>
<publisher>
<publisher-name>New England Statistical Society</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">NEJSDS4C</article-id>
<article-id pub-id-type="doi">10.51387/23-NEJSDS4C</article-id>
<article-categories>
<subj-group subj-group-type="heading"><subject>Commentary and/or Historical Perspective</subject></subj-group>
<subj-group subj-group-type="area"><subject>Statistical Methodology</subject></subj-group>
</article-categories>
<title-group>
<article-title>Discussion of: Four Types of Frequentism and Their Interplay with Bayesianism, by J. Berger<xref ref-type="fn" rid="j_nejsds4c_fn_001"><sup>✩</sup></xref></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Rousseau</surname><given-names>Judith</given-names></name><email xlink:href="mailto:rousseau@ceremade.dauphine.fr">rousseau@ceremade.dauphine.fr</email><xref ref-type="aff" rid="j_nejsds4c_aff_001"/>
</contrib>
<aff id="j_nejsds4c_aff_001">Department of Statistics, <institution>University of Oxford</institution>, <country>UK</country>. E-mail address: <email xlink:href="mailto:rousseau@ceremade.dauphine.fr">rousseau@ceremade.dauphine.fr</email></aff>
</contrib-group>
<author-notes>
<fn id="j_nejsds4c_fn_001"><label>✩</label>
<p>Main article: <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.51387/22-NEJSDS4">https://doi.org/10.51387/22-NEJSDS4</ext-link>.</p></fn>
</author-notes>
<pub-date pub-type="ppub"><year>2023</year></pub-date><pub-date pub-type="epub"><day>4</day><month>9</month><year>2023</year></pub-date><volume>1</volume><issue>2</issue><fpage>145</fpage><lpage>146</lpage><history><date date-type="accepted"><day>16</day><month>8</month><year>2022</year></date></history>
<permissions><copyright-statement>© 2023 New England Statistical Society</copyright-statement><copyright-year>2023</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions><related-article related-article-type="commentary-article" ext-link-type="doi" xlink:href="https://doi.org/10.51387/22-NEJSDS4" id="j_nejsds4c_ppc_001"/><funding-group><award-group><funding-source xlink:href="https://doi.org/10.13039/100010661">Horizon 2020</funding-source><award-id>834175</award-id></award-group><funding-statement>The author has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 834175). </funding-statement></funding-group>
</article-meta>
</front>
<body>
<p>Jim Berger proposes an interesting review of different ways of addressing the problem of error reporting from a frequentist point of view and their connections to Bayesian ways of thinking. In a way this paper echoes Neyman (1977) [<xref ref-type="bibr" rid="j_nejsds4c_ref_002">2</xref>] – at least as far as testing is concerned. Jim Berger has repeatedly made major contributions on the questioning of what makes a relevant measure of uncertainty or reported error and again this article is thought provoking.</p>
<p>Interestingly Neyman in Neyman (1977) [<xref ref-type="bibr" rid="j_nejsds4c_ref_002">2</xref>] justifies (or advocate) the empirical frequentist criteria or error measures, although the Neyman–Pearson is defined as a procedural frequentist approach (in Jim Berger’s terminology). If I agree with Jim Berger’s point that the justification of the Neyman-Pearson procedure from an empirical frequentist point of view is not fully convincing, I don’t quite agree with his arguments. This might be due to the interpretation of the definition of empirical frequentism and a difficulty for me is making sense to this definition which is quite vague.</p>
<p>Let us consider the type I error in a test of a simple null hypothesis versus a simple alternative hypothesis (or not simple, it does not really matter although in the latter the definition of type I and type II errors can be debatable). As in the paper, consider a series of tests with nominal type I error <italic>α</italic> and power <italic>β</italic>. One problem with reporting the type I error only is that it provides a very partial picture of the error (it provides no information if the true distribution is not in the null). Following the example of Section 3.2.1 of the paper and recalling that <inline-formula id="j_nejsds4c_ineq_001"><alternatives><mml:math>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="script">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[$\alpha ={P_{{H_{0i}}({\mathcal{R}_{i}})}}$]]></tex-math></alternatives></inline-formula> is the type I error for each experiment. Hence it only makes sense to report it when <inline-formula id="j_nejsds4c_ineq_002"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${H_{0i}}$]]></tex-math></alternatives></inline-formula> holds (i.e. <inline-formula id="j_nejsds4c_ineq_003"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\theta _{i}}={\theta _{0i}}$]]></tex-math></alternatives></inline-formula>) in which case we have 
<disp-formula id="j_nejsds4c_eq_001">
<label>(1)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:munderover accentunder="false" accent="false">
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo largeop="true" movablelimits="false">∑</mml:mo></mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
</mml:munderover>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="script">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo stretchy="false">→</mml:mo>
<mml:mn>0</mml:mn>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ \frac{1}{N}{\sum \limits_{i=1}^{N}}{1_{{\theta _{i}}={\theta _{0i}}}}(\alpha -{1_{{x_{i}}\in {\mathcal{R}_{i}}}})\to 0\]]]></tex-math></alternatives>
</disp-formula> 
in probability as soon as the experiments are independent (or more generally as soon as a weak law of large numbers is valid). This is what is suggested in Neyman (1977) [<xref ref-type="bibr" rid="j_nejsds4c_ref_002">2</xref>], pages 108–109. What is not satisfying in (1) is that the reported error only makes sense when non observable events (<inline-formula id="j_nejsds4c_ineq_004"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\theta _{i}}={\theta _{0i}}$]]></tex-math></alternatives></inline-formula>) occur. However since 
<disp-formula id="j_nejsds4c_eq_002">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:munder>
<mml:mrow>
<mml:mo movablelimits="false">lim inf</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
</mml:munder><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:munderover accentunder="false" accent="false">
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo largeop="true" movablelimits="false">∑</mml:mo></mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
</mml:munderover>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="script">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo stretchy="false">≥</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ \underset{N}{\liminf }\frac{1}{N}{\sum \limits_{i=1}^{N}}{1_{{\theta _{i}}={\theta _{0i}}}}(\alpha -{1_{{x_{i}}\in {\mathcal{R}_{i}}}})\ge 0,\]]]></tex-math></alternatives>
</disp-formula> 
the type I error <italic>α</italic> can still be viewed as valid from an empirical frequentist view point, but obviously is much less interesting in the latter inequality. It appears very limited as an accuracy measure. In the theory of minimax estimation the typical risk function for a test is the sum of the type I and type II errors: <inline-formula id="j_nejsds4c_ineq_005"><alternatives><mml:math>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">β</mml:mi></mml:math><tex-math><![CDATA[$\alpha +1-\beta $]]></tex-math></alternatives></inline-formula>. Interestingly this quantity suffers from the same drawback as the type I error: 
<disp-formula id="j_nejsds4c_eq_003">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:munderover accentunder="false" accent="false">
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo largeop="true" movablelimits="false">∑</mml:mo></mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
</mml:munderover>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="script">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">β</mml:mi>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∉</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="script">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo fence="true" stretchy="false">]</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ \frac{1}{N}{\sum \limits_{i=1}^{N}}[{1_{{\theta _{i}}={\theta _{0i}}}}(\alpha -{1_{{x_{i}}\in {\mathcal{R}_{i}}}})+{1_{{\theta _{i}}={\theta _{1i}}}}(1-\beta -{1_{{x_{i}}\notin {\mathcal{R}_{i}}}})]=0,\]]]></tex-math></alternatives>
</disp-formula> 
and needs a reporting strategy which depends on non observables for a long run justification, although the following inequality holds true: 
<disp-formula id="j_nejsds4c_eq_004">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd">
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">β</mml:mi>
</mml:mtd>
<mml:mtd class="align-even">
<mml:mo stretchy="false">≥</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:munderover accentunder="false" accent="false">
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo largeop="true" movablelimits="false">∑</mml:mo></mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
</mml:munderover>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">β</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo fence="true" stretchy="false">]</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mo stretchy="false">≥</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:munderover accentunder="false" accent="false">
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo largeop="true" movablelimits="false">∑</mml:mo></mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
</mml:munderover>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∈</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="script">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∉</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="script">R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">o</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}\alpha +1-\beta & \ge \frac{1}{N}{\sum \limits_{i=1}^{N}}[{1_{{\theta _{i}}={\theta _{0i}}}}\alpha +{1_{{\theta _{i}}={\theta _{1i}}}}(1-\beta )]\\ {} & \ge \frac{1}{N}{\sum \limits_{i=1}^{N}}{1_{{\theta _{i}}={\theta _{0i}}}}{1_{{x_{i}}\in {\mathcal{R}_{i}}}}+{1_{{\theta _{i}}={\theta _{1i}}}}{1_{{x_{i}}\notin {\mathcal{R}_{i}}}}+o(1).\end{aligned}\]]]></tex-math></alternatives>
</disp-formula> 
The above inequality clearly shows the limit of reporting <inline-formula id="j_nejsds4c_ineq_006"><alternatives><mml:math>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">β</mml:mi></mml:math><tex-math><![CDATA[$\alpha +1-\beta $]]></tex-math></alternatives></inline-formula>: in the long run if the proportion of null and of alternatives are of the same order then 
<disp-formula id="j_nejsds4c_eq_005">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:munderover accentunder="false" accent="false">
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo largeop="true" movablelimits="false">∑</mml:mo></mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">N</mml:mi>
</mml:mrow>
</mml:munderover>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">β</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo fence="true" stretchy="false">]</mml:mo>
<mml:mo stretchy="false">≈</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">β</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ \frac{1}{N}{\sum \limits_{i=1}^{N}}[{1_{{\theta _{i}}={\theta _{0i}}}}\alpha +{1_{{\theta _{i}}={\theta _{1i}}}}(1-\beta )]\approx \frac{\alpha +1-\beta }{2},\]]]></tex-math></alternatives>
</disp-formula> 
which is significantly smaller than <inline-formula id="j_nejsds4c_ineq_007"><alternatives><mml:math>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">β</mml:mi></mml:math><tex-math><![CDATA[$\alpha +1-\beta $]]></tex-math></alternatives></inline-formula>. The same holds if <inline-formula id="j_nejsds4c_ineq_008"><alternatives><mml:math>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo stretchy="false">≈</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">β</mml:mi></mml:math><tex-math><![CDATA[$\alpha \approx 1-\beta $]]></tex-math></alternatives></inline-formula> and actually reporting <inline-formula id="j_nejsds4c_ineq_009"><alternatives><mml:math>
<mml:mo movablelimits="false">max</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">α</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">β</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\max (\alpha ,1-\beta )$]]></tex-math></alternatives></inline-formula> is closer to the lower bound. The same reasoning holds for the multiple testing problem under the Bonferroni correction. Again it is not clear to me that the problem with reporting <italic>α</italic> comes from the reported error not being empirical, but rather that it only makes sense when the null is true, which is often interpreted wrongly as: <italic>it only makes sense when the null is rejected.</italic></p>
<p>One of the issues I have with the notion of empirical frequentism is, as I said, that it is quite vague: to which extent need the experiments be related or close to replicates? should the <inline-formula id="j_nejsds4c_ineq_010"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\theta _{i}}$]]></tex-math></alternatives></inline-formula> in the various experiments be considered as deterministic or random? Empirical frequentist justicification of a reported error depends on how we answer these questions. In particular the posterior risk can be validated under the assumption that the <inline-formula id="j_nejsds4c_ineq_011"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">θ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\theta _{i}}$]]></tex-math></alternatives></inline-formula> are random and come from the (same) prior distribution. The targets suggested in Eqs. (3.1), (3.3) or (3.7) of the present paper clearly aim at giving an error measure when the null is rejected (i.e. given that the null is rejected) but this inevitably requires to model at least the probability that the null is verified and typically also the distribution under the alternative, which somewhat involves a prior distribution, say <inline-formula id="j_nejsds4c_ineq_012"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">π</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${\pi _{0}}$]]></tex-math></alternatives></inline-formula> and/or <inline-formula id="j_nejsds4c_ineq_013"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${f_{1}}$]]></tex-math></alternatives></inline-formula>. Reporting error in a statistical test has long been a subject of debate, and much more than in other inference problems (estimation, confidence/credible regions etc) is still largely unresolved. There has been recent growing interests on <italic>E</italic>-values, as measures of accuracy in a testing procedure, as in Shafer (2021) [<xref ref-type="bibr" rid="j_nejsds4c_ref_003">3</xref>] or Grunwald et al. (2020) for instance, which are strongly related to Bayes factors (BF) (see Grunwald et al. (2020) [<xref ref-type="bibr" rid="j_nejsds4c_ref_001">1</xref>]). A difficulty with Bayes factors or <italic>E</italic> values – although some new results and propositions are made towards answering it for the latter – is that their scale is not known. Jim Berger shows that they do not enjoy an empirical frequentist justification either. But what about the log Bayes factors? In the safe test approach, Grunwald et al. (2020) [<xref ref-type="bibr" rid="j_nejsds4c_ref_001">1</xref>] relates <inline-formula id="j_nejsds4c_ineq_014"><alternatives><mml:math>
<mml:mo movablelimits="false">log</mml:mo>
<mml:mi mathvariant="italic">E</mml:mi></mml:math><tex-math><![CDATA[$\log E$]]></tex-math></alternatives></inline-formula> (which can be viewed as a log-BF) to the Kullback–Leibler divergence between <italic>the alternative</italic> and the null (roughly speaking), when studied under the alternative, which gives an empirical frequentist justification to <inline-formula id="j_nejsds4c_ineq_015"><alternatives><mml:math>
<mml:mo movablelimits="false">log</mml:mo>
<mml:mi mathvariant="italic">E</mml:mi></mml:math><tex-math><![CDATA[$\log E$]]></tex-math></alternatives></inline-formula>. The fact that <italic>E</italic> (or BF) does not have an empirical frequentist justification while <inline-formula id="j_nejsds4c_ineq_016"><alternatives><mml:math>
<mml:mo movablelimits="false">log</mml:mo>
<mml:mi mathvariant="italic">E</mml:mi></mml:math><tex-math><![CDATA[$\log E$]]></tex-math></alternatives></inline-formula> could have, makes me wonder about the usefulness of considering empirical frequentist justifications.</p>
</body>
<back>
<ref-list id="j_nejsds4c_reflist_001">
<title>References</title>
<ref id="j_nejsds4c_ref_001">
<label>[1]</label><mixed-citation publication-type="chapter"> <string-name><surname>Grünwald</surname>, <given-names>P.</given-names></string-name>, <string-name><surname>de Heide</surname>, <given-names>R.</given-names></string-name> and <string-name><surname>Koolen</surname>, <given-names>W. M.</given-names></string-name> (<year>2020</year>). <chapter-title>Safe testing</chapter-title>. In <source>2020 Information Theory and Applications Workshop (ITA)</source> <fpage>1</fpage>–<lpage>54</lpage>. <publisher-name>IEEE</publisher-name>.</mixed-citation>
</ref>
<ref id="j_nejsds4c_ref_002">
<label>[2]</label><mixed-citation publication-type="journal"> <string-name><surname>Neyman</surname>, <given-names>J.</given-names></string-name> (<year>1977</year>). <article-title>Frequentist probability and frequentist statistics</article-title>. <source>Synthese</source> <volume>36</volume> <fpage>97</fpage>–<lpage>131</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1007/BF00485695" xlink:type="simple">https://doi.org/10.1007/BF00485695</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0652325">MR0652325</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds4c_ref_003">
<label>[3]</label><mixed-citation publication-type="journal"> <string-name><surname>Shafer</surname>, <given-names>G.</given-names></string-name> <etal>et al.</etal> (<year>2021</year>). <article-title>Testing by betting: A strategy for statistical and scientific communication</article-title>. <source>Journal of the Royal Statistical Society: Series A (Statistics in Society)</source> <volume>184</volume>(<issue>2</issue>) <fpage>407</fpage>–<lpage>431</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/rssa.12647" xlink:type="simple">https://doi.org/10.1111/rssa.12647</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=4255905">MR4255905</ext-link></mixed-citation>
</ref>
</ref-list>
</back>
</article>
