<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">NEJSDS</journal-id>
<journal-title-group><journal-title>The New England Journal of Statistics in Data Science</journal-title></journal-title-group>
<issn pub-type="ppub">2693-7166</issn><issn-l>2693-7166</issn-l>
<publisher>
<publisher-name>New England Statistical Society</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">NEJSDS101</article-id>
<article-id pub-id-type="doi">10.51387/26-NEJSDS101</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Methodology Article</subject></subj-group><subj-group subj-group-type="area">
<subject>Spatial and Environmental Statistics</subject></subj-group></article-categories>
<title-group>
<article-title>Extreme Value Modeling with Generalized Pareto Distributions for Rounded Data</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Ma</surname><given-names>Sai</given-names></name><email xlink:href="mailto:sai.ma@uconn.edu">sai.ma@uconn.edu</email><xref ref-type="aff" rid="j_nejsds101_aff_001"/>
</contrib>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0003-4401-7296</contrib-id>
<name><surname>Yan</surname><given-names>Jun</given-names></name><email xlink:href="mailto:jun.yan@uconn.edu">jun.yan@uconn.edu</email><xref ref-type="aff" rid="j_nejsds101_aff_002"/><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-6551-6249</contrib-id>
<name><surname>Zhang</surname><given-names>Xuebin</given-names></name><email xlink:href="mailto:xuebinzhang23@uvic.ca">xuebinzhang23@uvic.ca</email><xref ref-type="aff" rid="j_nejsds101_aff_003"/>
</contrib>
<aff id="j_nejsds101_aff_001">Department of Statistics, <institution>University of Connecticut</institution>, Storrs, CT, <country>USA</country>. E-mail address: <email xlink:href="mailto:sai.ma@uconn.edu">sai.ma@uconn.edu</email></aff>
<aff id="j_nejsds101_aff_002">Department of Statistics, <institution>University of Connecticut</institution>, Storrs, CT, <country>USA</country>. E-mail address: <email xlink:href="mailto:jun.yan@uconn.edu">jun.yan@uconn.edu</email></aff>
<aff id="j_nejsds101_aff_003">Pacific Climate Impacts Consortium, <institution>University of Victoria, Victoria</institution>, BC, <country>Canada</country>. E-mail address: <email xlink:href="mailto:xuebinzhang23@uvic.ca">xuebinzhang23@uvic.ca</email></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2026</year></pub-date><pub-date pub-type="epub"><day>5</day><month>3</month><year>2026</year></pub-date><volume content-type="ahead-of-print">0</volume><issue>0</issue><fpage>1</fpage><lpage>12</lpage><supplementary-material id="S1" content-type="document" xlink:href="nejsds101_s001.pdf" mimetype="application" mime-subtype="pdf">
<caption>
<title>Supplementary Material</title>
<p>The Supplementary Material presented additional simulation results confirming MLE-IC’s robustness under extreme rounding conditions and extended power comparisons. It also provided supplementary precipitation data analysis across 18 eastern Washington stations, showing that MLE-IC consistently selected lower thresholds and yielded better model fit and return-level estimates than MLE-N.</p>
</caption>
</supplementary-material><history><date date-type="accepted"><day>3</day><month>2</month><year>2026</year></date></history>
<permissions><copyright-statement>© 2026 New England Statistical Society</copyright-statement><copyright-year>2026</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>In extreme value analysis, the impact of rounding in data, a form of quantization, on statistical inferences beyond point estimation has not been comprehensively studied. This paper addresses these challenges by considering rounded data as interval-censored. The maximum likelihood estimators of the model parameters tailored to account for interval censoring are asymptotically unbiased and efficient. Further, we adapt classic goodness-of-fit tests, such as the Anderson-Darling test, for rounded data based on the maximum likelihood estimator. The resulting tests have appropriate sizes and considerable power. One application of such tests is threshold selection for the peak over threshold approach in extreme value analysis. The efficacy of our estimation approach and the goodness-of-fit tests are demonstrated through a simulation study involving data rounded from generalized Pareto distributions. Applying this method to precipitation data from 18 stations in eastern Washington, an area with typically low precipitation and expecting a significant rounding effect, we observe narrower interval estimates of return levels.</p>
</abstract>
<kwd-group>
<label>Keywords and phrases</label>
<kwd>Discretized continuous distribution</kwd>
<kwd>Interval-censored</kwd>
<kwd>Quantized data</kwd>
<kwd>Threshold selection</kwd>
</kwd-group>
<funding-group><funding-statement>J. Yan’s research was partially supported by the NSF grant DMS 1521730.</funding-statement></funding-group>
</article-meta>
</front>
<body>
<sec id="j_nejsds101_s_001">
<label>1</label>
<title>Introduction</title>
<p>Accurate extreme value modeling is essential in fields such as climate science and risk assessment, where return level estimation informs infrastructure planning and policy decisions. However, when data are rounded (i.e., quantized to a grid), standard statistical inference can become biased, leading to unreliable conclusions [<xref ref-type="bibr" rid="j_nejsds101_ref_012">12</xref>, <xref ref-type="bibr" rid="j_nejsds101_ref_013">13</xref>]. One especially delicate area in the automated threshold selection procedure [<xref ref-type="bibr" rid="j_nejsds101_ref_004">4</xref>], where a sequence of goodness-of-fit tests are used to assess whether exceedances above a candidate threshold follow a generalized Pareto distribution (GPD). The procedure identifies the lowest threshold at which the GPD is statistically acceptable while controlling the False Discovery Rate (FDR) [<xref ref-type="bibr" rid="j_nejsds101_ref_016">16</xref>]. Because the selected threshold directly affects parameter estimation and return-level predictions, any distortion in the testing step can propagate into serious inferential error.</p>
<p>A practical workaround is jittering, a multiple-imputation method. In each imputation, every rounded value is perturbed within its rounding interval, the resulting data is analyzed as continuous data, and the results are pooled across imputations. This method is easy to implement and can work adequately when rounding is minor. However, it has drawbacks. It may induce bias in point estimates (as our simulations show). Its interval estimates depend on combining within- and between-imputation variances [<xref ref-type="bibr" rid="j_nejsds101_ref_028">28</xref>], which many users may not employ. And for goodness-of-fit tests, aggregating statistics across imputations (e.g., via medians) has been shown not to reliably reproduce the null distribution [<xref ref-type="bibr" rid="j_nejsds101_ref_004">4</xref>, Supplement, Figure S5]. Therefore, while jittering is a convenient workaround, it does not fully resolve the distortions caused by quantization, particularly for threshold choice and tail inference.</p>
<p>The impact of data rounding on extreme value analysis has been widely studied. Ignoring it leads to distorted sizes of goodness-of-fit tests [<xref ref-type="bibr" rid="j_nejsds101_ref_012">12</xref>] and poor performance in parameter estimation [<xref ref-type="bibr" rid="j_nejsds101_ref_013">13</xref>]. Solutions treating rounded data as interval-censored have emerged in applications dealing with rounded durations and count-based extreme value problems [<xref ref-type="bibr" rid="j_nejsds101_ref_024">24</xref>, <xref ref-type="bibr" rid="j_nejsds101_ref_027">27</xref>]. Both studies use a discretized GPD, obtained by rounding a continuous GPD [<xref ref-type="bibr" rid="j_nejsds101_ref_018">18</xref>] to integers. One applied to dry-spell analysis in days [<xref ref-type="bibr" rid="j_nejsds101_ref_024">24</xref>], while the other modeled hospital congestion events based on daily emergency room visits [<xref ref-type="bibr" rid="j_nejsds101_ref_027">27</xref>]. While these methods leverage interval censoring to account for rounding effects, they do not generalize to continuous variables rounded at finer levels, such as precipitation recorded to the nearest 0.1 or 0.01 inches. No study has systematically examined the estimation of extreme precipitation under these rounding levels, and existing approaches provide no statistically valid goodness-of-fit test that maintains correct size under rounding.</p>
<p>Statistical methods for inference under rounding have been widely studied [<xref ref-type="bibr" rid="j_nejsds101_ref_017">17</xref>, <xref ref-type="bibr" rid="j_nejsds101_ref_029">29</xref>]. A common assumption is that rounding errors follow a uniform distribution over a symmetric interval with length equal to the rounding precision and are independent of both the rounded and true values. Early work recognized that rounded data require a distinct likelihood function, treating them as interval-censored [<xref ref-type="bibr" rid="j_nejsds101_ref_020">20</xref>], and later research developed likelihood-based estimation methods for specific distributions [<xref ref-type="bibr" rid="j_nejsds101_ref_015">15</xref>, <xref ref-type="bibr" rid="j_nejsds101_ref_033">33</xref>]. For serially dependent data, where the full likelihood is intractable, composite likelihood approaches have been introduced for quantized time series models [<xref ref-type="bibr" rid="j_nejsds101_ref_005">5</xref>], with consistency and asymptotic normality established [<xref ref-type="bibr" rid="j_nejsds101_ref_036">36</xref>]. Despite these advances, the adoption of statistical methods for rounded data in extreme value modeling remains limited among practitioners, with parameter estimation and goodness-of-fit testing under rounding receiving little attention until recently. While prior work examined how symmetric and asymmetric rounding affect estimation accuracy [<xref ref-type="bibr" rid="j_nejsds101_ref_029">29</xref>], systematic investigations into valid goodness-of-fit tests and their performance remain lacking, particularly for threshold-based extreme value analysis.</p>
<p>We provide a well-tested toolset for parameter estimation and goodness-of-fit testing under rounding error. The parameters are estimated by maximizing the true likelihood of the observed rounded data, constructed using interval censoring, which coincides with the continuous-data likelihood when no rounding error is present. For goodness-of-fit tests, we treat rounded data as discrete and adapt standard tests originally designed for continuous data, with p-values obtained through parametric bootstrap. This work makes three key contributions. First, we examine bias correction in parameter estimation under realistic rounding levels in precipitation data. Second, we study valid goodness-of-fit tests under rounding, demonstrating that tests adapted from continuous data, particularly the Anderson–Darling test, have higher power. Third, we show that applying a valid goodness-of-fit test directly impacts automated threshold selection, leading to different results in extreme value analysis. In extensive simulations, the MLEs of the GPD parameters appeared unbiased, goodness-of-fit tests held their sizes, and the adapted tests had substantial power. Applying this method to threshold selection in the POT framework for 18 eastern Washington stations resulted in different selection outcomes, leading to large differences in return level estimation.</p>
<p>The rest of the paper is organized as follows. Section <xref rid="j_nejsds101_s_002">2</xref> introduces the problem setup and presents estimation based on the true likelihood. Section <xref rid="j_nejsds101_s_003">3</xref> adapts classic goodness-of-fit tests for the GPD to handle rounded data. A simulation study in Section <xref rid="j_nejsds101_s_006">4</xref> evaluates the performance of the MLE and the goodness-of-fit tests. Section <xref rid="j_nejsds101_s_011">5</xref> applies the methods to automated threshold selection in extreme value analysis for precipitation at 18 sites in eastern Washington State. Section <xref rid="j_nejsds101_s_012">6</xref> concludes with a discussion.</p>
</sec>
<sec id="j_nejsds101_s_002">
<label>2</label>
<title>Likelihood Estimation</title>
<p>In extreme value theory, threshold exceedances are asymptotically modeled by GPD under broad conditions. This fundamental result [<xref ref-type="bibr" rid="j_nejsds101_ref_026">26</xref>] provides the theoretical foundation for using the GPD in threshold-based extreme value modeling. Without loss of generality, consider a GPD with location zero and distribution function 
<disp-formula id="j_nejsds101_eq_001">
<label>(2.1)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo>;</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mfenced separators="" open="{" close="">
<mml:mrow>
<mml:mtable columnspacing="10.0pt" equalrows="false" columnlines="none" equalcolumns="false" columnalign="left left">
<mml:mtr>
<mml:mtd class="array">
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mfenced separators="" open="[" close="]">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>+</mml:mo><mml:mstyle displaystyle="false">
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">σ</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal" stretchy="false">/</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mspace width="1em"/>
</mml:mtd>
<mml:mtd class="array">
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo stretchy="false">≠</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="array">
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mo movablelimits="false">exp</mml:mo>
<mml:mfenced separators="" open="(" close=")">
<mml:mrow>
<mml:mo>−</mml:mo><mml:mstyle displaystyle="false">
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">σ</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
</mml:mrow>
</mml:mfenced>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mspace width="1em"/>
</mml:mtd>
<mml:mtd class="array">
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mfenced>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ F(x;\sigma ,\xi )=\left\{\begin{array}{l@{\hskip10.0pt}l}1-{\left[1+\frac{\xi x}{\sigma }\right]^{-1/\xi }},\hspace{1em}& \xi \ne 0;\\ {} 1-\exp \left(-\frac{x}{\sigma }\right),\hspace{1em}& \xi =0,\end{array}\right.\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_nejsds101_ineq_001"><alternatives><mml:math>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi>∞</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\sigma \in [0,\infty )$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds101_ineq_002"><alternatives><mml:math>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi>∞</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi>∞</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\xi \in (-\infty ,\infty )$]]></tex-math></alternatives></inline-formula> are scale and shape parameters, respectively. The support of the distribution is <inline-formula id="j_nejsds101_ineq_003"><alternatives><mml:math>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo stretchy="false">≥</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$x\ge 0$]]></tex-math></alternatives></inline-formula> if <inline-formula id="j_nejsds101_ineq_004"><alternatives><mml:math>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo stretchy="false">≥</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$\xi \ge 0$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds101_ineq_005"><alternatives><mml:math>
<mml:mn>0</mml:mn>
<mml:mo stretchy="false">≤</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo stretchy="false">≤</mml:mo>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal" stretchy="false">/</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi></mml:math><tex-math><![CDATA[$0\le x\le -\sigma /\xi $]]></tex-math></alternatives></inline-formula> if <inline-formula id="j_nejsds101_ineq_006"><alternatives><mml:math>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$\xi \lt 0$]]></tex-math></alternatives></inline-formula>. Random samples <inline-formula id="j_nejsds101_ineq_007"><alternatives><mml:math>
<mml:mi mathvariant="bold">X</mml:mi>
<mml:mo>=</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\mathbf{X}=({X_{1}},\dots ,{X_{n}})$]]></tex-math></alternatives></inline-formula> of size <italic>n</italic> are drawn from <inline-formula id="j_nejsds101_ineq_008"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mo>·</mml:mo>
<mml:mo>;</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(\cdot ;\sigma ,\xi )$]]></tex-math></alternatives></inline-formula>. Without rounding, we would observe <inline-formula id="j_nejsds101_ineq_009"><alternatives><mml:math>
<mml:mi mathvariant="bold">x</mml:mi>
<mml:mo>=</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\mathbf{x}=({x_{1}},\dots ,{x_{n}})$]]></tex-math></alternatives></inline-formula>. With rounding, however, we only observe a rounded version of <bold>x</bold> subject to a known rounding level <inline-formula id="j_nejsds101_ineq_010"><alternatives><mml:math>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$\delta \gt 0$]]></tex-math></alternatives></inline-formula>. Each observed data point is rounded to the nearest multiple of <italic>δ</italic>. The observed rounded sample <inline-formula id="j_nejsds101_ineq_011"><alternatives><mml:math><mml:mstyle mathvariant="bold">
<mml:msup>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:mstyle>
<mml:mo>=</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\mathbf{{x^{\ast }}}=({x_{1}^{\ast }},\dots ,{x_{n}^{\ast }})$]]></tex-math></alternatives></inline-formula> satisfies <inline-formula id="j_nejsds101_ineq_012"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo fence="true" stretchy="false">⌈</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" stretchy="false">/</mml:mo>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo>−</mml:mo>
<mml:mn>0.5</mml:mn>
<mml:mo fence="true" stretchy="false">⌉</mml:mo></mml:math><tex-math><![CDATA[${x_{i}^{\ast }}=\delta \lceil {x_{i}}/\delta -0.5\rceil $]]></tex-math></alternatives></inline-formula> for <inline-formula id="j_nejsds101_ineq_013"><alternatives><mml:math>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">n</mml:mi></mml:math><tex-math><![CDATA[$i=1,2,\dots ,n$]]></tex-math></alternatives></inline-formula>, where <inline-formula id="j_nejsds101_ineq_014"><alternatives><mml:math>
<mml:mo fence="true" stretchy="false">⌈</mml:mo>
<mml:mi mathvariant="italic">t</mml:mi>
<mml:mo fence="true" stretchy="false">⌉</mml:mo></mml:math><tex-math><![CDATA[$\lceil t\rceil $]]></tex-math></alternatives></inline-formula> is the ceiling integer of <italic>t</italic>, that is, the least integer greater than or equal to <italic>t</italic>. Our task is to estimate <italic>σ</italic> and <italic>ξ</italic> based on the observed data <inline-formula id="j_nejsds101_ineq_015"><alternatives><mml:math><mml:mstyle mathvariant="bold">
<mml:msup>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:mstyle></mml:math><tex-math><![CDATA[$\mathbf{{x^{\ast }}}$]]></tex-math></alternatives></inline-formula>.</p>
<p>When data are subject to rounding, the observed values follow a discretized version of the underlying continuous distribution. In the context of extreme value analysis, threshold exceedances of unrounded data are modeled by GPD, and the same principle applies when data are quantized. That is, the rounded data should follow a discretized GPD [<xref ref-type="bibr" rid="j_nejsds101_ref_024">24</xref>, <xref ref-type="bibr" rid="j_nejsds101_ref_027">27</xref>, <xref ref-type="bibr" rid="j_nejsds101_ref_018">18</xref>]. This forms the foundation of our likelihood construction. However, naive approaches that ignore this fact and treat rounded data as continuous can lead to substantial bias in parameter estimation [<xref ref-type="bibr" rid="j_nejsds101_ref_013">13</xref>]. These works highlight the impact of rounding but do not provide solutions, leaving the validity of statistical inference unresolved. Our approach explicitly incorporates discretization into the likelihood function, ensuring that estimation and goodness-of-fit tests remain valid under rounding effects.</p>
<p>The contribution of observation <inline-formula id="j_nejsds101_ineq_016"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${x_{i}^{\ast }}$]]></tex-math></alternatives></inline-formula> to the likelihood is 
<disp-formula id="j_nejsds101_eq_002">
<label>(2.2)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mo movablelimits="false">Pr</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">X</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo mathvariant="normal" stretchy="false">/</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>+</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo mathvariant="normal" stretchy="false">/</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd">
<mml:mo>=</mml:mo>
<mml:mspace width="0.1667em"/>
</mml:mtd>
<mml:mtd class="align-even">
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mfenced separators="" open="(" close=")">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>+</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="italic">δ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
</mml:mrow>
</mml:mfenced>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mfenced separators="" open="[" close="]">
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mfenced separators="" open="(" close=")">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>−</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="italic">δ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mo>+</mml:mo>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:mfenced>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}& \Pr ({X_{i}}\in [{({x_{i}^{\ast }}-\delta /2)^{+}},{x_{i}^{\ast }}+\delta /2))\\ {} =\hspace{0.1667em}& F\left({x_{i}^{\ast }}+\frac{\delta }{2}\right)-F\left[{\left({x_{i}^{\ast }}-\frac{\delta }{2}\right)^{+}}\right],\end{aligned}\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_nejsds101_ineq_017"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">t</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>+</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:mo movablelimits="false">max</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">t</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${t^{+}}=\max (0,t)$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds101_ineq_018"><alternatives><mml:math>
<mml:mo fence="true" stretchy="false">[</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo mathvariant="normal" stretchy="false">/</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>+</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo mathvariant="normal" stretchy="false">/</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$[{({x_{i}^{\ast }}-\delta /2)^{+}},{x_{i}^{\ast }}+\delta /2)$]]></tex-math></alternatives></inline-formula> is a half-open interval. Note that <inline-formula id="j_nejsds101_ineq_019"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo mathvariant="normal" stretchy="false">/</mml:mo>
<mml:mn>2</mml:mn></mml:math><tex-math><![CDATA[${x_{i}^{\ast }}+\delta /2$]]></tex-math></alternatives></inline-formula> can be beyond the support of the distribution if <inline-formula id="j_nejsds101_ineq_020"><alternatives><mml:math>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$\xi \lt 0$]]></tex-math></alternatives></inline-formula>, but it has no influence to the MLE since <inline-formula id="j_nejsds101_ineq_021"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo mathvariant="normal" stretchy="false">/</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:math><tex-math><![CDATA[$F({x_{i}^{\ast }}+\delta /2)=1$]]></tex-math></alternatives></inline-formula> in this case. The loglikelihood function is thus 
<disp-formula id="j_nejsds101_eq_003">
<label>(2.3)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:msup>
<mml:mrow>
<mml:mi>ℓ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo>;</mml:mo><mml:mstyle mathvariant="bold">
<mml:msup>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:mstyle>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:munderover accentunder="false" accent="false">
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo largeop="true" movablelimits="false">∑</mml:mo></mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">n</mml:mi>
</mml:mrow>
</mml:munderover>
<mml:mo movablelimits="false">log</mml:mo>
<mml:mfenced separators="" open="{" close="}">
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mfenced separators="" open="(" close=")">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>+</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="italic">δ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
</mml:mrow>
</mml:mfenced>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mfenced separators="" open="[" close="]">
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mfenced separators="" open="(" close=")">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>−</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="italic">δ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mo>+</mml:mo>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfenced>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ {\ell ^{\ast }}(\sigma ,\xi ;\mathbf{{x^{\ast }}})={\sum \limits_{i=1}^{n}}\log \left\{F\left({x_{i}^{\ast }}+\frac{\delta }{2}\right)-F\left[{\left({x_{i}^{\ast }}-\frac{\delta }{2}\right)^{+}}\right]\right\}.\]]]></tex-math></alternatives>
</disp-formula> 
The maximizer of the loglikelihood (<xref rid="j_nejsds101_eq_003">2.3</xref>), <inline-formula id="j_nejsds101_ineq_022"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">σ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">ξ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(\hat{\sigma },\hat{\xi })$]]></tex-math></alternatives></inline-formula> is the MLE of <inline-formula id="j_nejsds101_ineq_023"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(\sigma ,\xi )$]]></tex-math></alternatives></inline-formula>. As long as <inline-formula id="j_nejsds101_ineq_024"><alternatives><mml:math>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mo>−</mml:mo>
<mml:mn>0.5</mml:mn></mml:math><tex-math><![CDATA[$\xi \gt -0.5$]]></tex-math></alternatives></inline-formula>, the MLE is asymptotically unbiased and normally distributed [<xref ref-type="bibr" rid="j_nejsds101_ref_010">10</xref>], with variance being the inverse of the Fisher information matrix [<xref ref-type="bibr" rid="j_nejsds101_ref_031">31</xref>].</p>
<p>The likelihood contribution in (<xref rid="j_nejsds101_eq_002">2.2</xref>) is the key to construct the correct likelihood based on the rounded data. A naive treatment is to ignore the rounding, pretending that the rounded data are continuous observations. In that case, the likelihood contribution of <inline-formula id="j_nejsds101_ineq_025"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${x_{i}^{\ast }}$]]></tex-math></alternatives></inline-formula> is <inline-formula id="j_nejsds101_ineq_026"><alternatives><mml:math>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo>;</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$f(x;\sigma ,\xi )$]]></tex-math></alternatives></inline-formula>, where <italic>f</italic> is the probability density function of <inline-formula id="j_nejsds101_ineq_027"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo>;</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(x;\sigma ,\xi )$]]></tex-math></alternatives></inline-formula>. As <inline-formula id="j_nejsds101_ineq_028"><alternatives><mml:math>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo stretchy="false">→</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$\delta \to 0$]]></tex-math></alternatives></inline-formula>, the contribution in (<xref rid="j_nejsds101_eq_002">2.2</xref>) divided by <italic>δ</italic> converges to <inline-formula id="j_nejsds101_ineq_029"><alternatives><mml:math>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>;</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$f({x_{i}^{\ast }};\sigma ,\xi )$]]></tex-math></alternatives></inline-formula> for <inline-formula id="j_nejsds101_ineq_030"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[${x_{i}^{\ast }}\gt 0$]]></tex-math></alternatives></inline-formula> and to <inline-formula id="j_nejsds101_ineq_031"><alternatives><mml:math>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo>;</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal" stretchy="false">/</mml:mo>
<mml:mn>2</mml:mn></mml:math><tex-math><![CDATA[$f(0;\sigma ,\xi )/2$]]></tex-math></alternatives></inline-formula> for <inline-formula id="j_nejsds101_ineq_032"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[${x_{i}^{\ast }}=0$]]></tex-math></alternatives></inline-formula>. The resulting estimator, hereafter referred to as MLE-N, is similar to the true MLE only when <italic>δ</italic> is small relative to the scale of the distribution. As will be demonstrated in the simulation study, when <italic>δ</italic> increases, the bias of the MLE-N increases while the true MLE remains unbiased. Further, for large samples, the variance of the true MLE can be reasonably well estimated by the inverse of the Fisher information matrix.</p>
</sec>
<sec id="j_nejsds101_s_003">
<label>3</label>
<title>Goodness-of-Fit Test</title>
<p>Let <inline-formula id="j_nejsds101_ineq_033"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">δ</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mo>·</mml:mo>
<mml:mo>;</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${F_{\delta }}(\cdot ;\sigma ,\xi )$]]></tex-math></alternatives></inline-formula> be the discretized version of <inline-formula id="j_nejsds101_ineq_034"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mo>·</mml:mo>
<mml:mo>;</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(\cdot ;\sigma ,\xi )$]]></tex-math></alternatives></inline-formula> with rounding level <inline-formula id="j_nejsds101_ineq_035"><alternatives><mml:math>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$\delta \gt 0$]]></tex-math></alternatives></inline-formula>. For a given parameter vector <inline-formula id="j_nejsds101_ineq_036"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(\sigma ,\xi )$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds101_ineq_037"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">δ</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">t</mml:mi>
<mml:mo>;</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${F_{\delta }}(t;\sigma ,\xi )$]]></tex-math></alternatives></inline-formula> is the distribution function of a discrete random variable with support <inline-formula id="j_nejsds101_ineq_038"><alternatives><mml:math>
<mml:mi mathvariant="italic">t</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[$t\in \{0,\delta ,2\delta ,\dots \}$]]></tex-math></alternatives></inline-formula>; at each point in the support, we have <inline-formula id="j_nejsds101_ineq_039"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">δ</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">t</mml:mi>
<mml:mo>;</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">t</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo mathvariant="normal" stretchy="false">/</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>;</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${F_{\delta }}(t;\sigma ,\xi )=F(t+\delta /2;\sigma ,\xi )$]]></tex-math></alternatives></inline-formula>. The goodness-of-fit test is to test 
<disp-formula id="j_nejsds101_eq_004">
<label>(3.1)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>:</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mspace width="2.5pt"/>
<mml:mtext>is a random sample from distribution function</mml:mtext>
<mml:mspace width="2.5pt"/>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mphantom>
<mml:mi mathvariant="italic">H</mml:mi>
<mml:mn>0</mml:mn>
<mml:mo>:</mml:mo></mml:mphantom>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">F</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">δ</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mo>·</mml:mo>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:mtext>for some</mml:mtext>
<mml:mspace width="2.5pt"/>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}& {H_{0}}:{\mathbf{x}^{\ast }}\hspace{2.5pt}\text{is a random sample from distribution function}\hspace{2.5pt}\\ {} & \phantom{H0:}{F_{\delta }}(\cdot |\sigma ,\xi )\hspace{2.5pt}\text{for some}\hspace{2.5pt}(\sigma ,\xi ).\end{aligned}\]]]></tex-math></alternatives>
</disp-formula> 
That is, we are testing that the data come from a discretized version of a GPD distribution with unknown parameters.</p>
<sec id="j_nejsds101_s_004">
<label>3.1</label>
<title>Chi-Squared Test</title>
<p>The first test to be considered is the chi-squared (CS) test [<xref ref-type="bibr" rid="j_nejsds101_ref_025">25</xref>], which applies to both continuous and discrete data. Suppose that the data are grouped into <italic>k</italic> bins. The testing statistic is 
<disp-formula id="j_nejsds101_eq_005">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">χ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:munderover accentunder="false" accent="false">
<mml:mrow>
<mml:mstyle displaystyle="true">
<mml:mo largeop="true" movablelimits="false">∑</mml:mo></mml:mstyle>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:munderover><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">O</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>−</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ {\chi ^{2}}={\sum \limits_{i=1}^{k}}\frac{{({O_{i}}-{E_{i}})^{2}}}{{E_{i}}},\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_nejsds101_ineq_040"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">O</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${O_{i}}$]]></tex-math></alternatives></inline-formula> is the observed count and <inline-formula id="j_nejsds101_ineq_041"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${E_{i}}$]]></tex-math></alternatives></inline-formula> is the expected count in the <italic>i</italic>th bin, <inline-formula id="j_nejsds101_ineq_042"><alternatives><mml:math>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">k</mml:mi></mml:math><tex-math><![CDATA[$i=1,\dots ,k$]]></tex-math></alternatives></inline-formula>. In practice, each <inline-formula id="j_nejsds101_ineq_043"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${E_{i}}$]]></tex-math></alternatives></inline-formula> is usually set to be no smaller than 5. It is critical to note that the expected count <inline-formula id="j_nejsds101_ineq_044"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">E</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${E_{i}}$]]></tex-math></alternatives></inline-formula> is calculated based on the MLE <inline-formula id="j_nejsds101_ineq_045"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">σ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">ξ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(\hat{\sigma },\hat{\xi })$]]></tex-math></alternatives></inline-formula> of the parameters of the hypothesized GPD. This MLE should be the true MLE based on interval censoring instead of the MLE-N.</p>
<p>To assess the significance of the test, the null distribution of the testing statistic is needed, which is not as simple as many think. Only when the parameters are estimated by minimizing the test statistic does the test statistic under <inline-formula id="j_nejsds101_ineq_046"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${H_{0}}$]]></tex-math></alternatives></inline-formula> follows a <inline-formula id="j_nejsds101_ineq_047"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">χ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${\chi _{k-p-1}^{2}}$]]></tex-math></alternatives></inline-formula> distribution [<xref ref-type="bibr" rid="j_nejsds101_ref_014">14</xref>], where <inline-formula id="j_nejsds101_ineq_048"><alternatives><mml:math>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn></mml:math><tex-math><![CDATA[$p=2$]]></tex-math></alternatives></inline-formula> is the number of parameters in the GPD case. The MLE in general is not the same as that estimate. Therefore, the null distribution of the testing static is between <inline-formula id="j_nejsds101_ineq_049"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">χ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${\chi _{k-p-1}^{2}}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds101_ineq_050"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">χ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>−</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${\chi _{k-1}^{2}}$]]></tex-math></alternatives></inline-formula> [<xref ref-type="bibr" rid="j_nejsds101_ref_007">7</xref>]. We approximate the p-value of the observed statistic through parametric bootstrap, where each bootstrap sample is generated from the fitted distribution with parameters <inline-formula id="j_nejsds101_ineq_051"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">σ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">ξ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(\hat{\sigma },\hat{\xi })$]]></tex-math></alternatives></inline-formula> [<xref ref-type="bibr" rid="j_nejsds101_ref_032">32</xref>, Chapter 21]. This approach leverages distributional assumptions to create resamples, allowing for more efficient inference in cases where the assumption is appropriate and potentially improving accuracy for small samples or structured problems. See Appendix <xref rid="j_nejsds101_app_001">A</xref> for detailed steps.</p>
<p>In our numerical studies, the bins were chosen so that the fitted relative frequency for each bin is about 0.1.</p>
</sec>
<sec id="j_nejsds101_s_005">
<label>3.2</label>
<title>Tests Adapted from the Continuous Case</title>
<p>The Kolmogorov–Smirnov (KS) [<xref ref-type="bibr" rid="j_nejsds101_ref_021">21</xref>, <xref ref-type="bibr" rid="j_nejsds101_ref_030">30</xref>], Cramér–von Mieses (CvM) [<xref ref-type="bibr" rid="j_nejsds101_ref_011">11</xref>, <xref ref-type="bibr" rid="j_nejsds101_ref_035">35</xref>], and Anderson–Darling (AD) [<xref ref-type="bibr" rid="j_nejsds101_ref_001">1</xref>] tests are more powerful than the CS test for continuous data. The test statistics of these tests are obtained by comparing the empirical distribution function with a fitted parametric distribution function. The fitted parameter values should be the true MLE instead of the MLE-N. With a continuous null distribution, there is no tie in the sample, and the distribution of these test statistic does not depend on the hypothesized distribution. For a discrete hypothesized distribution, the calculation of the test statistic remains the same, but the null distribution of the test statistic does depend on the particular hypothesized distribution [<xref ref-type="bibr" rid="j_nejsds101_ref_009">9</xref>, <xref ref-type="bibr" rid="j_nejsds101_ref_008">8</xref>]. When the hypothesized discrete distribution contains no unknown parameters, these tests have been adapted [<xref ref-type="bibr" rid="j_nejsds101_ref_002">2</xref>] and implemented in popular software packages [<xref ref-type="bibr" rid="j_nejsds101_ref_003">3</xref>]. In our case, since the parameters are estimated, the p-value returned from the software cannot be used. Again, we use parametric bootstrap to approximate the p-value of each of the KS, CvM, and AD test; see Appendix <xref rid="j_nejsds101_app_001">A</xref>.</p>
<p>Since the observed data are rounded, the fitted distribution is discrete with probability mass function support points spaced at intervals of <italic>δ</italic>. Consequently, the test statistics depend on <italic>δ</italic> through the fitted distribution. However, <italic>δ</italic> does not explicitly appear in their expressions, as they are computed based on the empirical distribution of the discretized data. The limiting distributions of the test statistics, which are known for continuous data, are affected by the discrete nature of the fitted distribution. When <italic>δ</italic> is small relative to the data scale, the discretized empirical process closely approximates the continuous one, making the limiting distributions of the test statistics nearly the same as in the continuous case. However, for larger <italic>δ</italic>, deviations from the continuous limiting distributions may become more pronounced. The impact of <italic>δ</italic> on the empirical size and power of the tests is evaluated in the simulation study.</p>
<p>As to be shown in the simulation study, these tests have higher power than the CS test in this application as we conjectured.</p>
</sec>
</sec>
<sec id="j_nejsds101_s_006">
<label>4</label>
<title>Numerical Study</title>
<p>A numerical study was carried out to validate the performance of the proposed estimator and the goodness-of-fit tests. The true parameters <inline-formula id="j_nejsds101_ineq_052"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(\sigma ,\xi )$]]></tex-math></alternatives></inline-formula> were set to be <inline-formula id="j_nejsds101_ineq_053"><alternatives><mml:math>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mn>0.3</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>3</mml:mn>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[$\sigma \in \{0.3,3\}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds101_ineq_054"><alternatives><mml:math>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mo>−</mml:mo>
<mml:mn>0.1</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.1</mml:mn>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[$\xi \in \{-0.1,0,0.1\}$]]></tex-math></alternatives></inline-formula>. These settings were chosen to cover the estimated values of <italic>σ</italic> and <italic>ξ</italic> from the Eastern Washington State dataset, which are approximately 0.3 and 0.1, respectively. The rounding level <italic>δ</italic> was set to be 0, 0.01 to 0.1, where <inline-formula id="j_nejsds101_ineq_055"><alternatives><mml:math>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$\delta =0$]]></tex-math></alternatives></inline-formula> meant no rounding, and <inline-formula id="j_nejsds101_ineq_056"><alternatives><mml:math>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mn>0.01</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.1</mml:mn>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[$\delta \in \{0.01,0.1\}$]]></tex-math></alternatives></inline-formula> match those the observed rounding levels in the dataset. In total, we had <inline-formula id="j_nejsds101_ineq_057"><alternatives><mml:math>
<mml:mn>2</mml:mn>
<mml:mo>×</mml:mo>
<mml:mn>3</mml:mn>
<mml:mo>×</mml:mo>
<mml:mn>3</mml:mn>
<mml:mo>=</mml:mo>
<mml:mn>18</mml:mn></mml:math><tex-math><![CDATA[$2\times 3\times 3=18$]]></tex-math></alternatives></inline-formula> settings. For each setting, we generated 1000 datasets with sample size <inline-formula id="j_nejsds101_ineq_058"><alternatives><mml:math>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>500</mml:mn></mml:math><tex-math><![CDATA[$n=500$]]></tex-math></alternatives></inline-formula>. The parametric bootstrap always has 1000 bootstrap replicates.</p>
<p>For comparison, the MLE-N obtained with the rounding ignored was also applied. In the sequel, the true MLE based on interval censoring is denoted as MLE-IC, while the naive MLE is denoted as MLE-N. The maximum likelihood estimates for both methods were obtained using the <monospace>Optim()</monospace> function in <monospace>Julia</monospace>, employing the default Nelder-Mead algorithm. [<xref ref-type="bibr" rid="j_nejsds101_ref_023">23</xref>, <xref ref-type="bibr" rid="j_nejsds101_ref_019">19</xref>]. The MLE-N used the moment estimation as the initial value while MLE-IC used the MLE-N estimation as the initial value. The estimation for all the datasets converged. As shown in Figure S5 (Q-Q plots) in Supplementary Materials of [<xref ref-type="bibr" rid="j_nejsds101_ref_004">4</xref>], although the median AD statistic’s quantiles of jittering are much closer than no adjustment, they still deviated significantly from the 45-degree line. Since the jittering method did not have a good fit, we only include this method in the point estimation.</p>
<sec id="j_nejsds101_s_007">
<label>4.1</label>
<title>Estimation</title>
<fig id="j_nejsds101_fig_001">
<label>Figure 1</label>
<caption>
<p>Boxplots of MLE-N, MLE-IC, and jittering methods. The upper panels show the estimated scale parameters <inline-formula id="j_nejsds101_ineq_059"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">σ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\hat{\sigma }$]]></tex-math></alternatives></inline-formula> and the lower panels show the estimated shape parameters <inline-formula id="j_nejsds101_ineq_060"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">ξ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\hat{\xi }$]]></tex-math></alternatives></inline-formula>.</p>
</caption>
<graphic xlink:href="nejsds101_g001.jpg"/>
</fig>
<p>Figure <xref rid="j_nejsds101_fig_001">1</xref> presents boxplots comparing the estimators from MLE-N, MLE-IC, and jittering methods for different rounding levels (<italic>δ</italic>), scaling parameters (<italic>σ</italic>), and location parameters (<italic>ξ</italic>). Notably, MLE-N and jittering methods exhibit bias when <inline-formula id="j_nejsds101_ineq_061"><alternatives><mml:math>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo mathvariant="normal" stretchy="false">/</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi></mml:math><tex-math><![CDATA[$\delta /\sigma $]]></tex-math></alternatives></inline-formula> is large. For example, in the case of <inline-formula id="j_nejsds101_ineq_062"><alternatives><mml:math>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.3</mml:mn></mml:math><tex-math><![CDATA[$\sigma =0.3$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds101_ineq_063"><alternatives><mml:math>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.1</mml:mn></mml:math><tex-math><![CDATA[$\delta =0.1$]]></tex-math></alternatives></inline-formula>, both <inline-formula id="j_nejsds101_ineq_064"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">σ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\hat{\sigma }$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds101_ineq_065"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">ξ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\hat{\xi }$]]></tex-math></alternatives></inline-formula> from MLE-N and jittering methods show greater bias compared to MLE-IC. Furthermore, the magnitude of this bias in MLE-N and jittering methods systematically increases with increasing <italic>δ</italic>. These findings align with previous studies [<xref ref-type="bibr" rid="j_nejsds101_ref_013">13</xref>]. As expected from its asymptotic properties, MLE-IC method exhibits negligible bias across all tested <italic>δ</italic>, <italic>σ</italic>, and <italic>ξ</italic> values. This observation aligns with recent literature [<xref ref-type="bibr" rid="j_nejsds101_ref_024">24</xref>]. The supplementary materials (Figure S-1) provide an additional more extreme case with <inline-formula id="j_nejsds101_ineq_066"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>0.5</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.2</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(\sigma ,\xi )=(0.5,0.2)$]]></tex-math></alternatives></inline-formula>, and <inline-formula id="j_nejsds101_ineq_067"><alternatives><mml:math>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.5</mml:mn></mml:math><tex-math><![CDATA[$\delta =0.5$]]></tex-math></alternatives></inline-formula>. In this more extreme scenario, the bias of MLE-N and jittering methods is even more pronounced, which further confirms the robustness of MLE-IC method.</p>
<p>We can also observe the mean squared errors (RMSE) in Figure <xref rid="j_nejsds101_fig_001">1</xref>. The pattern of the RMSE of the MLE-N and jittering methods in response to <italic>δ</italic>, <italic>σ</italic>, and <italic>ξ</italic> is similar to pattern of the magnitude of the bias in Figure <xref rid="j_nejsds101_fig_001">1</xref>. This is because the mean squared errors of the MLE-N and jittering methods are dominated by their biases. Conversely, the MLE-IC’s RMSE is dominated by variance, which exhibits a minimal increase with increasing <italic>δ</italic>. This is intuitive since higher rounding levels imply less information, but the increase in variance here is negligible compared to MLE-N.</p>
<fig id="j_nejsds101_fig_002">
<label>Figure 2</label>
<caption>
<p>The estimated and empirical standard error of MLE-IC. The upper panels show the standard error of estimated scale parameter <italic>σ</italic> and the lower panels show the standard error of estimated shape parameter <italic>ξ</italic>.</p>
</caption>
<graphic xlink:href="nejsds101_g002.jpg"/>
</fig>
<p>We also investigated the estimated and empirical standard errors of MLE-IC method, as shown in Figure <xref rid="j_nejsds101_fig_002">2</xref>. For all settings, the average estimated standard errors of MLE-IC closely matched the empirical standard errors. This suggests that the uncertainty associated with MLE-IC, critical for statistical inference, can be accurately estimated by inverting the Fisher information matrix for the sample size <inline-formula id="j_nejsds101_ineq_068"><alternatives><mml:math>
<mml:mi mathvariant="italic">n</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>500</mml:mn></mml:math><tex-math><![CDATA[$n=500$]]></tex-math></alternatives></inline-formula> were used in this study.</p>
</sec>
<sec id="j_nejsds101_s_008">
<label>4.2</label>
<title>Goodness-of-Fit Test</title>
<p>We assessed the size and the power of the goodness-of-fit test. The size of a test is the maximum probability of rejecting the null hypothesis when it is true. The power of a test is the probability of rejecting the null hypothesis when it is false. The 4 goodness-of-fit test methods presented in Section <xref rid="j_nejsds101_s_003">3</xref> (CS, KS, CvM, and AD) were performed on each setting in Section <xref rid="j_nejsds101_s_007">4.1</xref>. For all four tests, we also performed a version where the fitted distribution was based on MLE-N instead of the true MLE-IC. This version helps to show how seriously wrong the tests based on MLE-N can be.</p>
<sec id="j_nejsds101_s_009">
<label>4.2.1</label>
<title>Size</title>
<fig id="j_nejsds101_fig_003">
<label>Figure 3</label>
<caption>
<p>Q-Q plots of the p-values from 1000 replicates by the MLE-N method. Samples were generated by GPD with <inline-formula id="j_nejsds101_ineq_069"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>0.3</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.1</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(\sigma ,\xi )=(0.3,0.1)$]]></tex-math></alternatives></inline-formula>, and no rounding (first row), <inline-formula id="j_nejsds101_ineq_070"><alternatives><mml:math>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.01</mml:mn></mml:math><tex-math><![CDATA[$\delta =0.01$]]></tex-math></alternatives></inline-formula> (second row), and <inline-formula id="j_nejsds101_ineq_071"><alternatives><mml:math>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.1</mml:mn></mml:math><tex-math><![CDATA[$\delta =0.1$]]></tex-math></alternatives></inline-formula> (third row).</p>
</caption>
<graphic xlink:href="nejsds101_g003.jpg"/>
</fig>
<fig id="j_nejsds101_fig_004">
<label>Figure 4</label>
<caption>
<p>Q-Q plots of the p-values from 1000 replicates by the MLE-IC method. Samples were generated by GPD with <inline-formula id="j_nejsds101_ineq_072"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>0.3</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.1</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(\sigma ,\xi )=(0.3,0.1)$]]></tex-math></alternatives></inline-formula>, and <inline-formula id="j_nejsds101_ineq_073"><alternatives><mml:math>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.01</mml:mn></mml:math><tex-math><![CDATA[$\delta =0.01$]]></tex-math></alternatives></inline-formula> (first row), and <inline-formula id="j_nejsds101_ineq_074"><alternatives><mml:math>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.1</mml:mn></mml:math><tex-math><![CDATA[$\delta =0.1$]]></tex-math></alternatives></inline-formula> (second row).</p>
</caption>
<graphic xlink:href="nejsds101_g004.jpg"/>
</fig>
<p>Figure <xref rid="j_nejsds101_fig_003">3</xref> and <xref rid="j_nejsds101_fig_004">4</xref> show the Q-Q plots of the 1000 p-values from tests for the settings with <inline-formula id="j_nejsds101_ineq_075"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>0.3</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.1</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(\sigma ,\xi )=(0.3,0.1)$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds101_ineq_076"><alternatives><mml:math>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.01</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.1</mml:mn>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[$\delta \in \{0,0.01,0.1\}$]]></tex-math></alternatives></inline-formula>. The results from other settings convey the same messages and, hence, are not shown. Since the data were indeed generated from GPD distributions before being rounded, we expect that the p-values from the 1000 replicates to be uniformly distributed over <inline-formula id="j_nejsds101_ineq_077"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(0,1)$]]></tex-math></alternatives></inline-formula> and, consequently, that the Q-Q plots of the p-values to be around the 45 degree line in the unit square. A large deviation from the 45 degree line means that the null-distribution of the test statistics being used in the test is invalid and that the test would not hold its size. The expected Q-Q plot pattern is observed for four tests — CS, KS, CvM, and AD — when the fitted distribution were evaluated with MLE-IC. This is true regardless of the rounding level <italic>δ</italic>. For the MLE-N, when there is no rounding or rounding level is 0.01, the Q-Q plots are still around the 45 degree line. However, when the rounding level is 0.1, the Q-Q plots from MLE-N have a large deviation from the 45 degree line.</p>
</sec>
<sec id="j_nejsds101_s_010">
<label>4.2.2</label>
<title>Power</title>
<p>For the four tests that performed as desired under <inline-formula id="j_nejsds101_ineq_078"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${H_{0}}$]]></tex-math></alternatives></inline-formula>, we compared their powers when the data were not generated under <inline-formula id="j_nejsds101_ineq_079"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${H_{0}}$]]></tex-math></alternatives></inline-formula>. In particular, we generated data from distributions by truncating a hybrid uniform-GPD (HUGPD), which is uniform below a threshold <italic>μ</italic> with probability <italic>p</italic> and follows a GPD with distribution <inline-formula id="j_nejsds101_ineq_080"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mo>·</mml:mo>
<mml:mo>;</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(\cdot ;\sigma ,\xi )$]]></tex-math></alternatives></inline-formula> above <italic>μ</italic>. The density function of this distribution is given in Equation (<xref rid="j_nejsds101_eq_006">4.1</xref>) 
<disp-formula id="j_nejsds101_eq_006">
<label>(4.1)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="italic">g</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo>;</mml:mo>
<mml:mi mathvariant="italic">μ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mfenced separators="" open="{" close="">
<mml:mrow>
<mml:mtable columnspacing="10.0pt" equalrows="false" columnlines="none" equalcolumns="false" columnalign="left left">
<mml:mtr>
<mml:mtd class="array">
<mml:mstyle displaystyle="false">
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">μ</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mspace width="1em"/>
</mml:mtd>
<mml:mtd class="array">
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo stretchy="false">≤</mml:mo>
<mml:mi mathvariant="italic">μ</mml:mi>
<mml:mo>;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="array">
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">μ</mml:mi>
<mml:mo>;</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mspace width="1em"/>
</mml:mtd>
<mml:mtd class="array">
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mi mathvariant="italic">μ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mfenced>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ g(x;\mu ,\sigma ,\xi )=\left\{\begin{array}{l@{\hskip10.0pt}l}\frac{p}{\mu },\hspace{1em}& 0\lt x\le \mu ;\\ {} (1-p)f(x-\mu ;\sigma ,\xi ),\hspace{1em}& x\gt \mu ,\end{array}\right.\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_nejsds101_ineq_081"><alternatives><mml:math>
<mml:mi mathvariant="italic">f</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mo>·</mml:mo>
<mml:mo>;</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$f(\cdot ;\sigma ,\xi )$]]></tex-math></alternatives></inline-formula> is the density of a GPD with distribution function <inline-formula id="j_nejsds101_ineq_082"><alternatives><mml:math>
<mml:mi mathvariant="italic">F</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mo>·</mml:mo>
<mml:mo>;</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$F(\cdot ;\sigma ,\xi )$]]></tex-math></alternatives></inline-formula> in Equation (<xref rid="j_nejsds101_eq_001">2.1</xref>). To ensure continuity at <italic>μ</italic>, only one of <italic>σ</italic> and <italic>ξ</italic> can be free for given <inline-formula id="j_nejsds101_ineq_083"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">μ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(\mu ,p)$]]></tex-math></alternatives></inline-formula>; we set <inline-formula id="j_nejsds101_ineq_084"><alternatives><mml:math>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mi mathvariant="italic">μ</mml:mi>
<mml:mo mathvariant="normal" stretchy="false">/</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$\sigma =p\mu /(1-p)$]]></tex-math></alternatives></inline-formula>. The density of a truncated HUGPD with truncation point <italic>a</italic> is 
<disp-formula id="j_nejsds101_eq_007">
<label>(4.2)</label><alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="italic">g</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mi mathvariant="italic">a</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mi mathvariant="italic">g</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>·</mml:mo>
<mml:mi mathvariant="italic">I</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mi mathvariant="italic">a</mml:mi>
<mml:mo fence="true" stretchy="false">}</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">G</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">a</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ g(x|x\gt a)=\frac{g(x)\cdot I(\{x\gt a\})}{1-G(a)},\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_nejsds101_ineq_085"><alternatives><mml:math>
<mml:mi mathvariant="italic">G</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$G(x)$]]></tex-math></alternatives></inline-formula> is the distribution function of the HUGPD and <inline-formula id="j_nejsds101_ineq_086"><alternatives><mml:math>
<mml:mi mathvariant="italic">I</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mo>·</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$I(\cdot )$]]></tex-math></alternatives></inline-formula> is an indicator function.</p>
<fig id="j_nejsds101_fig_005">
<label>Figure 5</label>
<caption>
<p>The probability density function of HUGPD <inline-formula id="j_nejsds101_ineq_087"><alternatives><mml:math>
<mml:mi mathvariant="italic">g</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi mathvariant="italic">μ</mml:mi>
<mml:mo>;</mml:mo>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$g(x-\mu ;\sigma ,\xi )$]]></tex-math></alternatives></inline-formula>, where <inline-formula id="j_nejsds101_ineq_088"><alternatives><mml:math>
<mml:mi mathvariant="italic">μ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.3</mml:mn></mml:math><tex-math><![CDATA[$\mu =0.3$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds101_ineq_089"><alternatives><mml:math>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mn>0.075</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.3</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>1.2</mml:mn>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[$\sigma \in \{0.075,0.3,1.2\}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds101_ineq_090"><alternatives><mml:math>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.1</mml:mn></mml:math><tex-math><![CDATA[$\xi =0.1$]]></tex-math></alternatives></inline-formula>, and <inline-formula id="j_nejsds101_ineq_091"><alternatives><mml:math>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mn>0.8</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.5</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.2</mml:mn>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[$p\in \{0.8,0.5,0.2\}$]]></tex-math></alternatives></inline-formula>.</p>
</caption>
<graphic xlink:href="nejsds101_g005.jpg"/>
</fig>
<p>In our simulation study, we set <inline-formula id="j_nejsds101_ineq_092"><alternatives><mml:math>
<mml:mi mathvariant="italic">μ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.3</mml:mn></mml:math><tex-math><![CDATA[$\mu =0.3$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds101_ineq_093"><alternatives><mml:math>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.1</mml:mn></mml:math><tex-math><![CDATA[$\xi =0.1$]]></tex-math></alternatives></inline-formula>, and <inline-formula id="j_nejsds101_ineq_094"><alternatives><mml:math>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mn>0.8</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.5</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.2</mml:mn>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[$p\in \{0.8,0.5,0.2\}$]]></tex-math></alternatives></inline-formula>, with corresponding <inline-formula id="j_nejsds101_ineq_095"><alternatives><mml:math>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mn>0.075</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.3</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>1.2</mml:mn>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[$\sigma \in \{0.075,0.3,1.2\}$]]></tex-math></alternatives></inline-formula>. See Figure <xref rid="j_nejsds101_fig_005">5</xref> for the density functions of the three settings. The truncation point <italic>a</italic> were selected from 0.3 to 0 with step 0.05. As the truncation point decreases, the distribution shows more and more deviation from a GPD, and the power of any reasonably good test should increase. The sample size was set as 250 or 500 and the rounding level was set to be <inline-formula id="j_nejsds101_ineq_096"><alternatives><mml:math>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.01</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.1</mml:mn>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[$\delta \in \{0,0.01,0.1\}$]]></tex-math></alternatives></inline-formula>. For each setting, 1000 datasets were generated. For each dataset, the four goodness-of-fit tests were applied, and <inline-formula id="j_nejsds101_ineq_097"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${H_{0}}$]]></tex-math></alternatives></inline-formula> was rejected with significance level 0.05.</p>
<fig id="j_nejsds101_fig_006">
<label>Figure 6</label>
<caption>
<p>Power of <inline-formula id="j_nejsds101_ineq_098"><alternatives><mml:math>
<mml:mi mathvariant="italic">g</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mi mathvariant="italic">a</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$g(x|x\gt a)$]]></tex-math></alternatives></inline-formula> for four tests applied to the HUGPD, with <inline-formula id="j_nejsds101_ineq_099"><alternatives><mml:math>
<mml:mi mathvariant="italic">μ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.3</mml:mn></mml:math><tex-math><![CDATA[$\mu =0.3$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds101_ineq_100"><alternatives><mml:math>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.3</mml:mn></mml:math><tex-math><![CDATA[$\sigma =0.3$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds101_ineq_101"><alternatives><mml:math>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.1</mml:mn></mml:math><tex-math><![CDATA[$\xi =0.1$]]></tex-math></alternatives></inline-formula>, and <inline-formula id="j_nejsds101_ineq_102"><alternatives><mml:math>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.5</mml:mn></mml:math><tex-math><![CDATA[$p=0.5$]]></tex-math></alternatives></inline-formula>. Sample size is 250 and 500.</p>
</caption>
<graphic xlink:href="nejsds101_g006.jpg"/>
</fig>
<p>Figure <xref rid="j_nejsds101_fig_006">6</xref> shows the empirical power of <inline-formula id="j_nejsds101_ineq_103"><alternatives><mml:math>
<mml:mi mathvariant="italic">g</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo stretchy="false">|</mml:mo>
<mml:mi mathvariant="italic">x</mml:mi>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mi mathvariant="italic">a</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$g(x|x\gt a)$]]></tex-math></alternatives></inline-formula> for four statistical tests—AD, CvM, KS, and CS—applied to the HUGPD, with <inline-formula id="j_nejsds101_ineq_104"><alternatives><mml:math>
<mml:mi mathvariant="italic">μ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.3</mml:mn></mml:math><tex-math><![CDATA[$\mu =0.3$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds101_ineq_105"><alternatives><mml:math>
<mml:mi mathvariant="italic">σ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.3</mml:mn></mml:math><tex-math><![CDATA[$\sigma =0.3$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds101_ineq_106"><alternatives><mml:math>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.1</mml:mn></mml:math><tex-math><![CDATA[$\xi =0.1$]]></tex-math></alternatives></inline-formula>, and <inline-formula id="j_nejsds101_ineq_107"><alternatives><mml:math>
<mml:mi mathvariant="italic">p</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.5</mml:mn></mml:math><tex-math><![CDATA[$p=0.5$]]></tex-math></alternatives></inline-formula>. The AD and CvM tests outperform the others, with the KS test following and the CS test showing the least power. As expected, for a constant sample size, all tests’ power increases when the truncation point moves from 0.3 to 0, indicating a greater deviation from the GPD. At a truncation point of 0.3, the powers of all tests approximate 0.05, suggesting that they maintain their nominal size. Particularly at a sample size of 500 and a truncation point of 0, the AD and CvM tests reach almost <inline-formula id="j_nejsds101_ineq_108"><alternatives><mml:math>
<mml:mn>100</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$100\% $]]></tex-math></alternatives></inline-formula> power across all rounding levels, whereas the KS test’s power drops to about <inline-formula id="j_nejsds101_ineq_109"><alternatives><mml:math>
<mml:mn>60</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$60\% $]]></tex-math></alternatives></inline-formula> at a rounding level of 0.1, and the CS test’s power remains low for all rounding levels. The power of all tests decreases as <italic>δ</italic> increases with a fixed truncation point and a sample size of 500, reflecting the expected data loss with higher <italic>δ</italic>. However, the decrease in power from <inline-formula id="j_nejsds101_ineq_110"><alternatives><mml:math>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$\delta =0$]]></tex-math></alternatives></inline-formula> to <inline-formula id="j_nejsds101_ineq_111"><alternatives><mml:math>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mn>0.01</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mn>0.1</mml:mn></mml:math><tex-math><![CDATA[$\delta \in 0.01,0.1$]]></tex-math></alternatives></inline-formula> is minimal for the AD and CvM tests. Moreover, increasing the sample size from 250 to 500 enhances the power for all tests. In conclusion, the AD and CvM tests demonstrate superior performance in the settings of this study. Two additional plots from other settings are shown in Figure S-2 and S-3 in the Supplementary Material, where the GPD portion is the upper 20% and 80% of the mixed distribution. The AD and CvM tests consistently display the strongest sensitivity to deviations, while KS and CS tests show lower power. Doubling the sample size from 250 to 500 markedly increases power, confirming that the ranking of test strengths remains stable under more extreme scenarios.</p>
</sec>
</sec>
</sec>
<sec id="j_nejsds101_s_011">
<label>5</label>
<title>Application to Annual Maximum of Daily Precipitations</title>
<p>The proposed method was applied to model daily precipitation data from 18 monitoring stations in eastern Washington State, covering the period from 1969 to 2018. This region is significantly drier (approximately 80–85% of days with no measurable precipitation) than the western part of the state due to the “rain shadow” effect of the Cascade Mountains. Since the simulation study suggested that the difference between MLE-IC and MLE-N is more pronounced in areas with less precipitation, this dataset provides an ideal setting to evaluate the impact of properly accounting for rounding. Daily precipitation data were obtained from the Global Historical Climatology Network [<xref ref-type="bibr" rid="j_nejsds101_ref_022">22</xref>]. Given that most precipitation occurs in winter, we restricted the analysis to winter months (November through March), resulting in a total of 7512 winter days from 1969 to 2018. For each site, we tested 15 candidate thresholds, taking the 70th to 98th percentiles in increments of 2 percent. Although zero precipitation values are included in the dataset, the lowest candidate threshold at each station is greater than zero. Precipitation amounts are recorded to the nearest hundredth of an inch, meaning <inline-formula id="j_nejsds101_ineq_112"><alternatives><mml:math>
<mml:mi mathvariant="italic">δ</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0.01</mml:mn></mml:math><tex-math><![CDATA[$\delta =0.01$]]></tex-math></alternatives></inline-formula>. Missing data were ignored in the analysis. See Section 2 in the Supplementary Material for additional details of the analysis.</p>
<p>The automated threshold selection procedure [<xref ref-type="bibr" rid="j_nejsds101_ref_004">4</xref>] relies critically on the goodness-of-fit test for the GPD at each candidate threshold. It uses the ForwardStop procedure [<xref ref-type="bibr" rid="j_nejsds101_ref_016">16</xref>] to control the FDR in the sequential testing of ordered null hypotheses; see steps in Appendix <xref rid="j_nejsds101_app_002">B</xref>. However, rounding effects were not accounted for in the test statistic, which our method resolves. We focus on the AD test, which demonstrated the highest power in the simulation study. The results of the sequential AD tests lead to three possible outcomes: (1) no threshold is selected by either MLE-IC or MLE-N; (2) a threshold is selected by MLE-IC but not by MLE-N; or (3) a threshold is selected by both methods, but MLE-IC chooses a lower threshold than MLE-N. The second and third cases highlight the advantage of MLE-IC in selecting thresholds with more exceedances, leading to higher efficiency in statistical inference.</p>
<p>To illustrate the differences from the two tests, consider two stations, one at Chewelah and the other at Ice Harbor Dam. The average yearly winter precipitation was 29.13 inches in Chewelah and 14.99 inches in Ice Harbor Dam. The total number of winter precipitation days was 2533 and 2778, respectively. Table S-2 in the Supplementary Material summarizes the candidate thresholds and the corresponding number of exceedances at the two sites. Figure S-5 in the Supplementary Material shows the p-values at the 15 candidate thresholds at the two sites before and after the ForwardStop adjustment using the AD test [<xref ref-type="bibr" rid="j_nejsds101_ref_004">4</xref>]. At Chewelah, the tests based on MLE-N selects threshold 1.75 with 135 exceedances; the tests based on MLE-IC selects 0.25, which is the 80th percentile, as the threshold with 1596 exceedances. The number of exceedances from MLE-IC is almost 12 times of that from MLE-N. At Ice Harbor Dam, the tests based on MLE-N rejected all candidate thresholds while the tests based on MLE-IC selects threshold 0.51 with 430 exceedances. The threshold selected by the tests based on MLE-IC makes return level estimation possible.</p>
<fig id="j_nejsds101_fig_007">
<label>Figure 7</label>
<caption>
<p>GPD parameter estimates and 95% confidence intervals. The MLE-N and MLE-IC are obviously different in both point estimates and uncertainty.</p>
</caption>
<graphic xlink:href="nejsds101_g007.jpg"/>
</fig>
<p>The statistical inference results are largely affected by the threshold. Figure <xref rid="j_nejsds101_fig_007">7</xref> shows the parameter estimates <inline-formula id="j_nejsds101_ineq_113"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">σ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\hat{\sigma }$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds101_ineq_114"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">ξ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\hat{\xi }$]]></tex-math></alternatives></inline-formula> and their <inline-formula id="j_nejsds101_ineq_115"><alternatives><mml:math>
<mml:mn>95</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$95\% $]]></tex-math></alternatives></inline-formula> confidence intervals for the two sites if a threshold was selected. The <inline-formula id="j_nejsds101_ineq_116"><alternatives><mml:math>
<mml:mn>95</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$95\% $]]></tex-math></alternatives></inline-formula> confidence intervals of <inline-formula id="j_nejsds101_ineq_117"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">σ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\hat{\sigma }$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds101_ineq_118"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">ξ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\hat{\xi }$]]></tex-math></alternatives></inline-formula> are constructed as <inline-formula id="j_nejsds101_ineq_119"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">σ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo>±</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">z</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0.25</mml:mn>
</mml:mrow>
</mml:msub><mml:mover accent="false">
<mml:mrow>
<mml:mtext>SD</mml:mtext>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">σ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo stretchy="true">ˆ</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\hat{\sigma }\pm {z_{0.25}}\widehat{\text{SD}(\hat{\sigma })}$]]></tex-math></alternatives></inline-formula> and <inline-formula id="j_nejsds101_ineq_120"><alternatives><mml:math><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">ξ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo>±</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">z</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0.025</mml:mn>
</mml:mrow>
</mml:msub><mml:mover accent="false">
<mml:mrow>
<mml:mtext>SD</mml:mtext>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">ξ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo stretchy="true">ˆ</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\hat{\xi }\pm {z_{0.025}}\widehat{\text{SD}(\hat{\xi })}$]]></tex-math></alternatives></inline-formula>, respectively, where <inline-formula id="j_nejsds101_ineq_121"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">z</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0.025</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${z_{0.025}}$]]></tex-math></alternatives></inline-formula> is the <inline-formula id="j_nejsds101_ineq_122"><alternatives><mml:math>
<mml:mn>97.5</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$97.5\% $]]></tex-math></alternatives></inline-formula> upper quantile of the standard normal distribution and <inline-formula id="j_nejsds101_ineq_123"><alternatives><mml:math><mml:mover accent="false">
<mml:mrow>
<mml:mtext>SD</mml:mtext>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mo>·</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo stretchy="true">ˆ</mml:mo></mml:mover></mml:math><tex-math><![CDATA[$\widehat{\text{SD}(\cdot )}$]]></tex-math></alternatives></inline-formula> is the standard error of estimate. At Chewelah, with a lower threshold and more exceedances, the parameters are estimated with a much narrower confidence intervals by MLE-IC than those by MLE-N. At Ice Harbor Dam, the tests based on MLE-IC shows that the tail of the annual maximum daily can be modeled by a GPD, allowing estimation of return levels which would otherwise be impossible if MLE-N were used. Figure S-7 in the Supplementary Material presents the estimated 25-, 50-, 100-, and 200-year return levels along with <inline-formula id="j_nejsds101_ineq_124"><alternatives><mml:math>
<mml:mn>95</mml:mn>
<mml:mi mathvariant="normal">%</mml:mi></mml:math><tex-math><![CDATA[$95\% $]]></tex-math></alternatives></inline-formula> confidence intervals constructed from profile likelihood for the two sites. The return level estimates from the two methods at Chewelah are very different, with the confidence interval based on MLE-N about 10 times wider than that based on MLE-IC. The return level estimates at Ice Harbor Dam are smaller than those at Chewelah as expected, with similarly narrow confidence intervals. In the Supplementary Material, the time series plots (Figure S-4 in the Supplementary Material) with thresholds display the precipitations of the two stations over time, and the Q-Q plots (Figure S-6 in the Supplementary Material) show that MLE-IC has a better fit than MLE-N.</p>
<p>We performed an analysis of the return levels for each of the 18 sites; the selected thresholds and the corresponding number of exceedances are summarized in Table S-1 in the Supplementary Material. Among all 18 stations in eastern Washington, 9 stations had thresholds selected via MLE-IC, but only 2 stations had threshold selected via MLE-N. The stations with a threshold selected via MLE-N are a subset of those via MLE-IC. At stations where both methods yielded a threshold, the numbers of exceedances from MLE-IC are much greater than those from MLE-N, which has important implications on inferences on the GPD parameters and the return levels. Although [<xref ref-type="bibr" rid="j_nejsds101_ref_004">4</xref>] reported that the jittering method can also fix some issues, it is an ad-hoc, partial fix while the MLE-IC provides a complete and desired solution. If the same analysis were done on all the sites analyzed in [<xref ref-type="bibr" rid="j_nejsds101_ref_004">4</xref>], we would expect that some dry sites with no threshold selected may have a threshold selected, and some sites with a high threshold selected may have a lower threshold selected. Consequently, return levels at many sites may be estimated much more accurately.</p>
</sec>
<sec id="j_nejsds101_s_012">
<label>6</label>
<title>Discussion</title>
<p>Bias in parameter estimation and over-rejection in goodness-of-fit tests have been documented as consequences of rounding error in extreme value analyses [<xref ref-type="bibr" rid="j_nejsds101_ref_012">12</xref>, <xref ref-type="bibr" rid="j_nejsds101_ref_013">13</xref>] but without satisfying solutions. Our MLE based on interval censoring and goodness-of-fit tests adapted from continuous distributions to discrete distributions provide a solid, feasible approach to the problem. The inferences based on the asymptotic normality of the MLE appear to be valid for the sample size investigated. The method has broad applications in extreme value analyses of precipitation or temperature, potentially leading to significantly different results than those obtained otherwise. When the generalized extreme value (GEV) distribution is used, or when some parameters incorporate covariates, the interval censoring framework can be applied as well. This extension may be particularly relevant in settings where block maxima are subject to rounding.</p>
<p>The correction of the method to the naive approach depends on the rounding level relative to the scale parameter. A change in the measurement unit, for example, from cm to inch, would result in the same scaling effect on the estimated scale parameter of GPD accordingly, but the estimated shape parameter and the p-value of the goodness-of-fit should remain the same. The method requires that the rounding level <italic>δ</italic> is known and can handle datasets with multiple rounding levels, such as cases where later data are recorded with higher precision. Although the AD test for quantized data has the highest power, it is much more computing intensive than the CS test, especially when <italic>δ</italic> is small relative to <italic>σ</italic> and <inline-formula id="j_nejsds101_ineq_125"><alternatives><mml:math>
<mml:mi mathvariant="italic">ξ</mml:mi>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mn>0</mml:mn></mml:math><tex-math><![CDATA[$\xi \gt 0$]]></tex-math></alternatives></inline-formula>, as this increases the number of discrete support points. A faster alternative would be of interest. Additionally, in real data applications, the annual maximum of daily precipitations may have serial dependence, which could affect the accuracy of the proposed methods. Addressing serial dependence merits further research.</p>
<p>The impact of correcting the bias with MLE-IC is greatest at locations with lower precipitation. Our illustration focused on 18 eastern Washington stations, which are known to be much drier than those to the west of the mountains. In batch studies [<xref ref-type="bibr" rid="j_nejsds101_ref_004">4</xref>], where a large number of individual sites are analyzed one by one, we expect more pronounced differences if MLE-IC were used instead of MLE-N, in terms of the number of stations with a threshold selected, the number of exceedances, and the resulting point and interval estimates of various return levels. Since threshold selection is beyond the scope of this paper, we did not study its uncertainty. Bayesian methods could provide a natural approach for handling this [<xref ref-type="bibr" rid="j_nejsds101_ref_006">6</xref>, <xref ref-type="bibr" rid="j_nejsds101_ref_034">34</xref>].</p>
</sec>
</body>
<back>
<app-group>
<app id="j_nejsds101_app_001"><label>Appendix A</label>
<title>Goodness-of-Fit Test with Parametric Bootstrap</title>
<p>The parametric bootstrap procedure for a goodness-of-fit test with testing statistics <italic>S</italic> that depends on some estimator of the parameters are summarized as follows. 
<list>
<list-item id="j_nejsds101_li_001">
<label>1.</label>
<p>Obtain estimates of GPD parameters <inline-formula id="j_nejsds101_ineq_126"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">σ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">ξ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(\hat{\sigma },\hat{\xi })$]]></tex-math></alternatives></inline-formula> from the data <inline-formula id="j_nejsds101_ineq_127"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${\mathbf{x}^{\ast }}$]]></tex-math></alternatives></inline-formula>.</p>
</list-item>
<list-item id="j_nejsds101_li_002">
<label>2.</label>
<p>Calculate test statistic <italic>S</italic> based on <inline-formula id="j_nejsds101_ineq_128"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">σ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">ξ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(\hat{\sigma },\hat{\xi })$]]></tex-math></alternatives></inline-formula>.</p>
</list-item>
<list-item id="j_nejsds101_li_003">
<label>3.</label>
<p>For <inline-formula id="j_nejsds101_ineq_129"><alternatives><mml:math>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo stretchy="false">∈</mml:mo>
<mml:mo fence="true" stretchy="false">{</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mo fence="true" stretchy="false">}</mml:mo></mml:math><tex-math><![CDATA[$i\in \{1,\dots ,B\}$]]></tex-math></alternatives></inline-formula>, where <italic>B</italic> is a large number representing the number of bootstrap replicates:</p>
<list>
<list-item id="j_nejsds101_li_004">
<label>(a)</label>
<p>Generate a sample <inline-formula id="j_nejsds101_ineq_130"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${\mathbf{x}^{(i)}}$]]></tex-math></alternatives></inline-formula> of size <italic>n</italic> from a GPD with fitted parameter <inline-formula id="j_nejsds101_ineq_131"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">σ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo mathvariant="normal">,</mml:mo><mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">ξ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$(\hat{\sigma },\hat{\xi })$]]></tex-math></alternatives></inline-formula>.</p>
</list-item>
<list-item id="j_nejsds101_li_005">
<label>(b)</label>
<p>Round <inline-formula id="j_nejsds101_ineq_132"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${\mathbf{x}^{(i)}}$]]></tex-math></alternatives></inline-formula> to <inline-formula id="j_nejsds101_ineq_133"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${\mathbf{x}^{(i)\ast }}$]]></tex-math></alternatives></inline-formula> with rounding level <italic>δ</italic>.</p>
</list-item>
<list-item id="j_nejsds101_li_006">
<label>(c)</label>
<p>Obtain GPD parameter <inline-formula id="j_nejsds101_ineq_134"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">σ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">ξ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$({\hat{\sigma }^{(i)}},{\hat{\xi }^{(i)}})$]]></tex-math></alternatives></inline-formula> based on bootstrap sample <inline-formula id="j_nejsds101_ineq_135"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
<mml:mo>∗</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${\mathbf{x}^{(i)\ast }}$]]></tex-math></alternatives></inline-formula>.</p>
</list-item>
<list-item id="j_nejsds101_li_007">
<label>(d)</label>
<p>Calculate the test statistic <inline-formula id="j_nejsds101_ineq_136"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${S^{(i)}}$]]></tex-math></alternatives></inline-formula> based on <inline-formula id="j_nejsds101_ineq_137"><alternatives><mml:math>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">σ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">ξ</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
</mml:mrow>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$({\hat{\sigma }^{(i)}},{\hat{\xi }^{(i)}})$]]></tex-math></alternatives></inline-formula>.</p>
</list-item>
</list>
</list-item>
<list-item id="j_nejsds101_li_008">
<label>4.</label>
<p>Approximate p-value by 
<disp-formula id="j_nejsds101_eq_008">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">ˆ</mml:mo></mml:mover>
<mml:mo>=</mml:mo><mml:mstyle displaystyle="true">
<mml:mfrac>
<mml:mrow>
<mml:mn>0.5</mml:mn>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mo largeop="false" movablelimits="false">∑</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">b</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mi mathvariant="italic">I</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">b</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo mathvariant="normal">&gt;</mml:mo>
<mml:mi mathvariant="italic">S</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">B</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:mo mathvariant="normal">,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[ \hat{p}=\frac{0.5+{\textstyle\textstyle\sum _{b=1}^{B}}I({S^{(b)}}\gt S)}{B+1},\]]]></tex-math></alternatives>
</disp-formula> 
where <inline-formula id="j_nejsds101_ineq_138"><alternatives><mml:math>
<mml:mi mathvariant="italic">I</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mo>·</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[$I(\cdot )$]]></tex-math></alternatives></inline-formula> the indicator function.</p>
</list-item>
</list>
</p></app>
<app id="j_nejsds101_app_002"><label>Appendix B</label>
<title>ForwardStop in Ordered Hypotheses Testing</title>
<p>ForwardStop is used to control the false discovery rate at level <italic>α</italic>. It has demonstrated resilience to various data distributions, including those with correlated p-values, which is relevant to our threshold selection process[<xref ref-type="bibr" rid="j_nejsds101_ref_016">16</xref>]. Consider a sequence of ordered hypothesis <inline-formula id="j_nejsds101_ineq_139"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${H_{0}^{(i)}}$]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds101_ineq_140"><alternatives><mml:math>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">L</mml:mi></mml:math><tex-math><![CDATA[$i=1,\dots ,L$]]></tex-math></alternatives></inline-formula>. They are ordered in the sense that if <inline-formula id="j_nejsds101_ineq_141"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${H_{0}^{(i)}}$]]></tex-math></alternatives></inline-formula> is rejected, then all <inline-formula id="j_nejsds101_ineq_142"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">j</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${H_{0}^{(j)}}$]]></tex-math></alternatives></inline-formula> for <inline-formula id="j_nejsds101_ineq_143"><alternatives><mml:math>
<mml:mi mathvariant="italic">j</mml:mi>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:mi mathvariant="italic">i</mml:mi></mml:math><tex-math><![CDATA[$j\lt i$]]></tex-math></alternatives></inline-formula> will be rejected. In the context of threshold selection with candidate thresholds <inline-formula id="j_nejsds101_ineq_144"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">u</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:mo stretchy="false">⋯</mml:mo>
<mml:mo mathvariant="normal">&lt;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">u</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">L</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${u_{1}}\lt \cdots \lt {u_{L}}$]]></tex-math></alternatives></inline-formula> [<xref ref-type="bibr" rid="j_nejsds101_ref_004">4</xref>], the sequence of null hypotheses are 
<disp-formula id="j_nejsds101_eq_009">
<alternatives><mml:math display="block">
<mml:mtable displaystyle="true" columnalign="right left" columnspacing="0pt">
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mi mathvariant="italic">i</mml:mi>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>:</mml:mo>
<mml:mspace width="2.5pt"/>
<mml:mtext>the distribution of the observations exceeding</mml:mtext>
<mml:mspace width="2.5pt"/>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">u</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mspace width="2.5pt"/>
<mml:mtext>follow</mml:mtext>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd class="align-odd"/>
<mml:mtd class="align-even">
<mml:mtext>a GPD.</mml:mtext>
</mml:mtd>
</mml:mtr>
</mml:mtable></mml:math><tex-math><![CDATA[\[\begin{aligned}{}& {H_{0}^{(i)}}:\hspace{2.5pt}\text{the distribution of the observations exceeding}\hspace{2.5pt}{u_{i}}\hspace{2.5pt}\text{follow}\\ {} & \text{a GPD.}\end{aligned}\]]]></tex-math></alternatives>
</disp-formula> 
That is, if the <inline-formula id="j_nejsds101_ineq_145"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">u</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">i</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${u_{i}}$]]></tex-math></alternatives></inline-formula> is rejected as a threshold, then any threshold lower than <inline-formula id="j_nejsds101_ineq_146"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">u</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">j</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${u_{j}}$]]></tex-math></alternatives></inline-formula> would be rejected.</p>
<p>The detailed steps of calculating ForwardStop adjusted p-values are as follows.</p>
<list>
<list-item id="j_nejsds101_li_009">
<label>1.</label>
<p>Obtain the p-values <inline-formula id="j_nejsds101_ineq_147"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">L</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${p_{1}},\dots ,{p_{L}}$]]></tex-math></alternatives></inline-formula> of the <italic>L</italic> hypotheses.</p>
</list-item>
<list-item id="j_nejsds101_li_010">
<label>2.</label>
<p>Arrange the p-values in reverse order <inline-formula id="j_nejsds101_ineq_148"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>′</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">L</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo></mml:math><tex-math><![CDATA[${p^{\prime }_{1}}={p_{L}},\dots $]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds101_ineq_149"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">L</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>′</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${p^{\prime }_{L}}={p_{1}}$]]></tex-math></alternatives></inline-formula>.</p>
</list-item>
<list-item id="j_nejsds101_li_011">
<label>3.</label>
<p>For <inline-formula id="j_nejsds101_ineq_150"><alternatives><mml:math>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">L</mml:mi></mml:math><tex-math><![CDATA[$k=1,\dots ,L$]]></tex-math></alternatives></inline-formula>, calculate the corresponding <inline-formula id="j_nejsds101_ineq_151"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">q</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>′</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">q</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">L</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>′</mml:mo>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${q^{\prime }_{1}},\dots ,{q^{\prime }_{L}}$]]></tex-math></alternatives></inline-formula> by <inline-formula id="j_nejsds101_ineq_152"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">q</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo><mml:mstyle displaystyle="false">
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mstyle>
<mml:msubsup>
<mml:mrow>
<mml:mo largeop="false" movablelimits="false">∑</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mo>−</mml:mo>
<mml:mo movablelimits="false">log</mml:mo>
<mml:mo mathvariant="normal" fence="true" stretchy="false">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>−</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>′</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo mathvariant="normal" fence="true" stretchy="false">)</mml:mo></mml:math><tex-math><![CDATA[${q_{k}}=\frac{1}{k}{\textstyle\sum _{j=1}^{k}}-\log (1-{p^{\prime }_{k}})$]]></tex-math></alternatives></inline-formula>.</p>
</list-item>
<list-item id="j_nejsds101_li_012">
<label>4.</label>
<p>Arrange the p-values <inline-formula id="j_nejsds101_ineq_153"><alternatives><mml:math>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">q</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>′</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">q</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">L</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>′</mml:mo>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${q^{\prime }_{1}},\dots ,{q^{\prime }_{L}}$]]></tex-math></alternatives></inline-formula> in reverse order to <inline-formula id="j_nejsds101_ineq_154"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">q</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">q</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">L</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>′</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo></mml:math><tex-math><![CDATA[${q_{1}}={q^{\prime }_{L}},\dots $]]></tex-math></alternatives></inline-formula>, <inline-formula id="j_nejsds101_ineq_155"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">q</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">L</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="italic">q</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>′</mml:mo>
</mml:mrow>
</mml:msubsup></mml:math><tex-math><![CDATA[${q_{L}}={q^{\prime }_{1}}$]]></tex-math></alternatives></inline-formula>, and then <inline-formula id="j_nejsds101_ineq_156"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">q</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${q_{k}}$]]></tex-math></alternatives></inline-formula> is the corresponding ForwardStop adjusted p-value of <inline-formula id="j_nejsds101_ineq_157"><alternatives><mml:math>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="italic">p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi mathvariant="italic">k</mml:mi>
</mml:mrow>
</mml:msub></mml:math><tex-math><![CDATA[${p_{k}}$]]></tex-math></alternatives></inline-formula> for <inline-formula id="j_nejsds101_ineq_158"><alternatives><mml:math>
<mml:mi mathvariant="italic">k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mo>…</mml:mo>
<mml:mo mathvariant="normal">,</mml:mo>
<mml:mi mathvariant="italic">L</mml:mi></mml:math><tex-math><![CDATA[$k=1,\dots ,L$]]></tex-math></alternatives></inline-formula>.</p>
</list-item>
</list>
</app></app-group>
<ref-list id="j_nejsds101_reflist_001">
<title>References</title>
<ref id="j_nejsds101_ref_001">
<label>[1]</label><mixed-citation publication-type="journal"><string-name><surname>Anderson</surname>, <given-names>T. W.</given-names></string-name> and <string-name><surname>Darling</surname>, <given-names>D. A.</given-names></string-name> (<year>1952</year>). <article-title>Asymptotic Theory of Certain “Goodness of Fit” Criteria Based on Stochastic Processes</article-title>. <source>The Annals of Mathematical Statistics</source> <volume>23</volume>(<issue>2</issue>) <fpage>193</fpage>–<lpage>212</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/aoms/1177729437" xlink:type="simple">https://doi.org/10.1214/aoms/1177729437</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0050238">MR0050238</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_002">
<label>[2]</label><mixed-citation publication-type="journal"><string-name><surname>Arnold</surname>, <given-names>T. B.</given-names></string-name> and <string-name><surname>Emerson</surname>, <given-names>J. W.</given-names></string-name> (<year>2011</year>). <article-title>Nonparametric Goodness-of-fit Tests for Discrete Null Distributions</article-title>. <source>R Journal</source> <volume>3</volume>(<issue>2</issue>) <fpage>34</fpage>–<lpage>39</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds101_ref_003">
<label>[3]</label><mixed-citation publication-type="other"><string-name><surname>Arnold</surname>, <given-names>T. B.</given-names></string-name> and <string-name><surname>Emerson</surname>, <given-names>J. W.</given-names></string-name> (2013). dgof: Discrete Goodness-of-Fit Tests. R package version 1.2. <uri>https://CRAN.R-project.org/package=dgof</uri>.</mixed-citation>
</ref>
<ref id="j_nejsds101_ref_004">
<label>[4]</label><mixed-citation publication-type="journal"><string-name><surname>Bader</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>Yan</surname>, <given-names>J.</given-names></string-name> and <string-name><surname>Zhang</surname>, <given-names>X.</given-names></string-name> (<year>2018</year>). <article-title>Automated Threshold Selection for Extreme Value Analysis via Ordered Goodness-of-fit Tests with Adjustment for False Discovery Rate</article-title>. <source>The Annals of Applied Statistics</source> <volume>12</volume>(<issue>1</issue>) <fpage>310</fpage>–<lpage>329</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/17-AOAS1092" xlink:type="simple">https://doi.org/10.1214/17-AOAS1092</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=3773395">MR3773395</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_005">
<label>[5]</label><mixed-citation publication-type="journal"><string-name><surname>Bai</surname>, <given-names>Z.</given-names></string-name>, <string-name><surname>Zheng</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Zhang</surname>, <given-names>B.</given-names></string-name> and <string-name><surname>Hu</surname>, <given-names>G.</given-names></string-name> (<year>2009</year>). <article-title>Statistical Analysis for Rounded Data</article-title>. <source>Journal of Statistical Planning and Inference</source> <volume>139</volume>(<issue>8</issue>) <fpage>2526</fpage>–<lpage>2542</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/j.jspi.2008.11.018" xlink:type="simple">https://doi.org/10.1016/j.jspi.2008.11.018</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2523645">MR2523645</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_006">
<label>[6]</label><mixed-citation publication-type="journal"><string-name><surname>Behrens</surname>, <given-names>C. N.</given-names></string-name>, <string-name><surname>Lopes</surname>, <given-names>H. F.</given-names></string-name> and <string-name><surname>Gamerman</surname>, <given-names>D.</given-names></string-name> (<year>2004</year>). <article-title>Bayesian Analysis of Extreme Events with Threshold Estimation</article-title>. <source>Statistical Modelling</source> <volume>4</volume>(<issue>3</issue>) <fpage>227</fpage>–<lpage>244</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1191/1471082X04st075oa" xlink:type="simple">https://doi.org/10.1191/1471082X04st075oa</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2062102">MR2062102</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_007">
<label>[7]</label><mixed-citation publication-type="journal"><string-name><surname>Chernoff</surname>, <given-names>H.</given-names></string-name> and <string-name><surname>Lehmann</surname>, <given-names>E.</given-names></string-name> (<year>1954</year>). <article-title>The Use of Maximum Likelihood Estimates in <inline-formula id="j_nejsds101_ineq_159"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">χ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${\chi ^{2}}$]]></tex-math></alternatives></inline-formula> Tests for Goodness of Fit</article-title>. <source>The Annals of Mathematical Statistics</source> <volume>25</volume>(<issue>3</issue>) <fpage>579</fpage>–<lpage>586</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1214/aoms/1177728726" xlink:type="simple">https://doi.org/10.1214/aoms/1177728726</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0065109">MR0065109</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_008">
<label>[8]</label><mixed-citation publication-type="journal"><string-name><surname>Choulakian</surname>, <given-names>V.</given-names></string-name>, <string-name><surname>Lockhart</surname>, <given-names>R. A.</given-names></string-name> and <string-name><surname>Stephens</surname>, <given-names>M. A.</given-names></string-name> (<year>1994</year>). <article-title>Cramér-von Mises Statistics for Discrete Distributions</article-title>. <source>Canadian Journal of Statistics</source> <volume>22</volume>(<issue>1</issue>) <fpage>125</fpage>–<lpage>137</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.2307/3315828" xlink:type="simple">https://doi.org/10.2307/3315828</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1271450">MR1271450</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_009">
<label>[9]</label><mixed-citation publication-type="journal"><string-name><surname>Conover</surname>, <given-names>W. J.</given-names></string-name> (<year>1972</year>). <article-title>A Kolmogorov Goodness-of-fit Test for Discontinuous Distributions</article-title>. <source>Journal of the American Statistical Association</source> <volume>67</volume>(<issue>339</issue>) <fpage>591</fpage>–<lpage>596</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0391375">MR0391375</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_010">
<label>[10]</label><mixed-citation publication-type="journal"><string-name><surname>Cramér</surname>, <given-names>H.</given-names></string-name> (<year>1946</year>). <article-title>A Contribution to The Theory of Statistical Estimation</article-title>. <source>Scandinavian Actuarial Journal</source> <volume>1946</volume>(<issue>1</issue>) <fpage>85</fpage>–<lpage>94</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/03461238.1946.10419631" xlink:type="simple">https://doi.org/10.1080/03461238.1946.10419631</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0017505">MR0017505</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_011">
<label>[11]</label><mixed-citation publication-type="journal"><string-name><surname>Cramér</surname>, <given-names>H.</given-names></string-name> (<year>1928</year>). <article-title>On the Composition of Elementary Errors</article-title>. <source>Scandinavian Actuarial Journal</source> <volume>1928</volume>(<issue>1</issue>) <fpage>13</fpage>–<lpage>74</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds101_ref_012">
<label>[12]</label><mixed-citation publication-type="journal"><string-name><surname>Deidda</surname>, <given-names>R.</given-names></string-name> and <string-name><surname>Puliga</surname>, <given-names>M.</given-names></string-name> (<year>2006</year>). <article-title>Sensitivity of Goodness-of-fit Statistics to Rainfall Data Rounding off</article-title>. <source>Physics and Chemistry of the Earth, Parts A/B/C</source> <volume>31</volume>(<issue>18</issue>) <fpage>1240</fpage>–<lpage>1251</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds101_ref_013">
<label>[13]</label><mixed-citation publication-type="journal"><string-name><surname>Deidda</surname>, <given-names>R.</given-names></string-name> and <string-name><surname>Puliga</surname>, <given-names>M.</given-names></string-name> (<year>2009</year>). <article-title>Performances of Some Parameter Estimators of the Generalized Pareto Distribution over Rounded-off Samples</article-title>. <source>Physics and Chemistry of the Earth, Parts A/B/C</source> <volume>34</volume>(<issue>10–12</issue>) <fpage>626</fpage>–<lpage>634</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds101_ref_014">
<label>[14]</label><mixed-citation publication-type="journal"><string-name><surname>Fisher</surname>, <given-names>R. A.</given-names></string-name> (<year>1922</year>). <article-title>On the Interpretation of <inline-formula id="j_nejsds101_ineq_160"><alternatives><mml:math>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="italic">χ</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup></mml:math><tex-math><![CDATA[${\chi ^{2}}$]]></tex-math></alternatives></inline-formula> from Contingency Tables, and the Calculation of P</article-title>. <source>Journal of the Royal Statistical Society</source> <volume>85</volume>(<issue>1</issue>) <fpage>87</fpage>–<lpage>94</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds101_ref_015">
<label>[15]</label><mixed-citation publication-type="journal"><string-name><surname>Giesbrecht</surname>, <given-names>F.</given-names></string-name> and <string-name><surname>Kempthorne</surname>, <given-names>O.</given-names></string-name> (<year>1976</year>). <article-title>Maximum Likelihood Estimation in the Three-parameter Lognormal Distribution</article-title>. <source>Journal of the Royal Statistical Society: Series B (Methodological)</source> <volume>38</volume>(<issue>3</issue>) <fpage>257</fpage>–<lpage>264</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0652563">MR0652563</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_016">
<label>[16]</label><mixed-citation publication-type="journal"><string-name><surname>G’Sell</surname>, <given-names>M. G.</given-names></string-name>, <string-name><surname>Wager</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Chouldechova</surname>, <given-names>A.</given-names></string-name> and <string-name><surname>Tibshirani</surname>, <given-names>R.</given-names></string-name> (<year>2016</year>). <article-title>Sequential Selection Procedures and False Discovery Rate Control</article-title>. <source>Journal of the Royal Statistical Society: Series B (Statistical Methodology)</source> <volume>78</volume>(<issue>2</issue>) <fpage>423</fpage>–<lpage>444</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/rssb.12122" xlink:type="simple">https://doi.org/10.1111/rssb.12122</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=3454203">MR3454203</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_017">
<label>[17]</label><mixed-citation publication-type="journal"><string-name><surname>Heitjan</surname>, <given-names>D. F.</given-names></string-name> (<year>1989</year>). <article-title>Inference from Grouped Continuous Data: A Review</article-title>. <source>Statistical Science</source> <volume>4</volume> <fpage>164</fpage>–<lpage>179</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds101_ref_018">
<label>[18]</label><mixed-citation publication-type="journal"><string-name><surname>Hitz</surname>, <given-names>A. S.</given-names></string-name>, <string-name><surname>Davis</surname>, <given-names>R. A.</given-names></string-name> and <string-name><surname>Samorodnitsky</surname>, <given-names>G.</given-names></string-name> (<year>2024</year>). <article-title>Discrete Extremes</article-title>. <source>Journal of Data Science</source> <volume>22</volume>(<issue>4</issue>) <fpage>524</fpage>–<lpage>536</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds101_ref_019">
<label>[19]</label><mixed-citation publication-type="journal"><string-name><surname>Jalbert</surname>, <given-names>J.</given-names></string-name>, <string-name><surname>Farmer</surname>, <given-names>M.</given-names></string-name>, <string-name><surname>Gobeil</surname>, <given-names>G.</given-names></string-name> and <string-name><surname>Roy</surname>, <given-names>P.</given-names></string-name> (<year>2024</year>). <article-title>Extremes.jl: Extreme Value Analysis in Julia</article-title>. <source>Journal of Statistical Software</source> <volume>109</volume>(<issue>6</issue>) <fpage>1</fpage>–<lpage>35</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.18637/jss.v109.i06" xlink:type="simple">https://doi.org/10.18637/jss.v109.i06</ext-link>.</mixed-citation>
</ref>
<ref id="j_nejsds101_ref_020">
<label>[20]</label><mixed-citation publication-type="journal"><string-name><surname>Kempthorne</surname>, <given-names>O.</given-names></string-name> (<year>1966</year>). <article-title>Some Aspects of Experimental Inference</article-title>. <source>Journal of the American Statistical Association</source> <volume>61</volume>(<issue>313</issue>) <fpage>11</fpage>–<lpage>34</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0195211">MR0195211</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_021">
<label>[21]</label><mixed-citation publication-type="journal"><string-name><surname>Kolmogorov</surname>, <given-names>A.</given-names></string-name> (<year>1933</year>). <article-title>Sulla Determinazione Empirica di una Legge di Distribuzione</article-title>. <source>Giornale dell’Istituto Italiano degli Attuari</source> <volume>4</volume> <fpage>83</fpage>–<lpage>91</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds101_ref_022">
<label>[22]</label><mixed-citation publication-type="journal"><string-name><surname>Menne</surname>, <given-names>M. J.</given-names></string-name>, <string-name><surname>Durre</surname>, <given-names>I.</given-names></string-name>, <string-name><surname>Vose</surname>, <given-names>R. S.</given-names></string-name>, <string-name><surname>Gleason</surname>, <given-names>B. E.</given-names></string-name> and <string-name><surname>Houston</surname>, <given-names>T. G.</given-names></string-name> (<year>2012</year>). <article-title>An Overview of the Global Historical Climatology Network-daily Database</article-title>. <source>Journal of Atmospheric and Oceanic Technology</source> <volume>29</volume>(<issue>7</issue>) <fpage>897</fpage>–<lpage>910</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds101_ref_023">
<label>[23]</label><mixed-citation publication-type="journal"><string-name><surname>Mogensen</surname>, <given-names>P.</given-names></string-name> and <string-name><surname>Riseth</surname>, <given-names>A.</given-names></string-name> (<year>2018</year>). <article-title>Optim: A Mathematical Optimization Package For Julia</article-title>. <source>Journal of Open Source Software</source> <volume>3</volume>(<issue>24</issue>).</mixed-citation>
</ref>
<ref id="j_nejsds101_ref_024">
<label>[24]</label><mixed-citation publication-type="journal"><string-name><surname>Pasari</surname>, <given-names>Z.</given-names></string-name> and <string-name><surname>Cindri</surname>, <given-names>K.</given-names></string-name> (<year>2019</year>). <article-title>Generalised Pareto Distribution: Impact of Rounding on Parameter Estimation</article-title>. <source>Theoretical and Applied Climatology</source> <volume>136</volume>(<issue>1</issue>) <fpage>417</fpage>–<lpage>427</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds101_ref_025">
<label>[25]</label><mixed-citation publication-type="journal"><string-name><surname>Pearson</surname>, <given-names>K.</given-names></string-name> (<year>1900</year>). <article-title>On the Criterion that a Given System of Deviations from the Probable in the Case of a Correlated System of Variables is such that it can be Reasonably Supposed to Have Arisen from Random Sampling</article-title>. <source>The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science</source> <volume>50</volume>(<issue>302</issue>) <fpage>157</fpage>–<lpage>175</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds101_ref_026">
<label>[26]</label><mixed-citation publication-type="journal"><string-name><surname>Pickands</surname>, <given-names>J.</given-names></string-name> (<year>1975</year>). <article-title>Statistical Inference Using Extreme Order Statistics</article-title>. <source>The Annals of Statistics</source> <volume>3</volume>(<issue>1</issue>) <fpage>119</fpage>–<lpage>131</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0423667">MR0423667</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_027">
<label>[27]</label><mixed-citation publication-type="journal"><string-name><surname>Ranjbar</surname>, <given-names>S.</given-names></string-name>, <string-name><surname>Cantoni</surname>, <given-names>E.</given-names></string-name>, <string-name><surname>Chavez-Demoulin</surname>, <given-names>V.</given-names></string-name>, <string-name><surname>Marra</surname>, <given-names>G.</given-names></string-name>, <string-name><surname>Radice</surname>, <given-names>R.</given-names></string-name> and <string-name><surname>Jaton-Ogay</surname>, <given-names>K.</given-names></string-name> (<year>2022</year>). <article-title>Modelling the Extremes of Seasonal Viruses and Hospital Congestion: The Example of Flu in a Swiss Hospital</article-title>. <source>Journal of the Royal Statistical Society: Series C (Applied Statistics)</source> <volume>71</volume>(<issue>4</issue>) <fpage>884</fpage>–<lpage>905</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1111/rssc.12559" xlink:type="simple">https://doi.org/10.1111/rssc.12559</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=4470824">MR4470824</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_028">
<label>[28]</label><mixed-citation publication-type="book"><string-name><surname>Rubin</surname>, <given-names>D. B.</given-names></string-name> (<year>2004</year>) <source>Multiple Imputation for Nonresponse in Surveys</source> <volume>81</volume>. <publisher-name>John Wiley &amp; Sons</publisher-name>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2117498">MR2117498</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_029">
<label>[29]</label><mixed-citation publication-type="journal"><string-name><surname>SchneeweiSS</surname>, <given-names>H.</given-names></string-name>, <string-name><surname>Komlos</surname>, <given-names>J.</given-names></string-name> and <string-name><surname>Ahmad</surname>, <given-names>A. S.</given-names></string-name> (<year>2010</year>). <article-title>Symmetric and Asymmetric Rounding: A Review and Some New Results</article-title>. <source>AStA Advances in Statistical Analysis</source> <volume>94</volume>(<issue>3</issue>) <fpage>247</fpage>–<lpage>271</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1007/s10182-010-0125-2" xlink:type="simple">https://doi.org/10.1007/s10182-010-0125-2</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2733174">MR2733174</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_030">
<label>[30]</label><mixed-citation publication-type="journal"><string-name><surname>Smirnov</surname>, <given-names>N.</given-names></string-name> (<year>1939</year>). <article-title>On the Estimation of the Discrepancy Between Empirical Curves of Distribution for Two Independent Samples</article-title>. <source>Bulletin Mathématique de l’Université de Moscou</source> <volume>2</volume> <fpage>3</fpage>–<lpage>14</lpage>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0002062">MR0002062</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_031">
<label>[31]</label><mixed-citation publication-type="journal"><string-name><surname>Smith</surname>, <given-names>R. L.</given-names></string-name> (<year>1985</year>). <article-title>Maximum Likelihood Estimation in a Class of Nonregular Cases</article-title>. <source>Biometrika</source> <volume>72</volume>(<issue>1</issue>) <fpage>67</fpage>–<lpage>90</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1093/biomet/72.1.67" xlink:type="simple">https://doi.org/10.1093/biomet/72.1.67</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=0790201">MR0790201</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_032">
<label>[32]</label><mixed-citation publication-type="book"><string-name><surname>Tibshirani</surname>, <given-names>R. J.</given-names></string-name> and <string-name><surname>Efron</surname>, <given-names>B.</given-names></string-name> (<year>1993</year>) <source>An Introduction to the Bootstrap</source>. <publisher-name>Chapman &amp; Hall</publisher-name>, <publisher-loc>New York</publisher-loc>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1007/978-1-4899-4541-9" xlink:type="simple">https://doi.org/10.1007/978-1-4899-4541-9</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=1270903">MR1270903</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_033">
<label>[33]</label><mixed-citation publication-type="journal"><string-name><surname>Vardeman</surname>, <given-names>S. B.</given-names></string-name> and <string-name><surname>Lee</surname>, <given-names>C. q. q. S.</given-names></string-name> (<year>2005</year>). <article-title>Likelihood-based Statistical Estimation from Quantized Data</article-title>. <source>IEEE Transactions on Instrumentation and Measurement</source> <volume>54</volume>(<issue>1</issue>) <fpage>409</fpage>–<lpage>414</lpage>.</mixed-citation>
</ref>
<ref id="j_nejsds101_ref_034">
<label>[34]</label><mixed-citation publication-type="journal"><string-name><surname>Villa</surname>, <given-names>C.</given-names></string-name> (<year>2017</year>). <article-title>Bayesian Estimation of the Threshold of a Generalised Pareto Distribution for Heavy-tailed Observations</article-title>. <source>Test</source> <volume>26</volume>(<issue>1</issue>) <fpage>95</fpage>–<lpage>118</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1007/s11749-016-0501-7" xlink:type="simple">https://doi.org/10.1007/s11749-016-0501-7</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=3613607">MR3613607</ext-link></mixed-citation>
</ref>
<ref id="j_nejsds101_ref_035">
<label>[35]</label><mixed-citation publication-type="book"><string-name><surname>von Mises</surname>, <given-names>R.</given-names></string-name> (<year>1931</year>) <source>Wahrscheinlichkeitsrechnung und ihre Anwendung in der Statistik und Theoretischen Physik</source>. <publisher-name>Deuticke</publisher-name>, <publisher-loc>Leipzig und Wien</publisher-loc>.</mixed-citation>
</ref>
<ref id="j_nejsds101_ref_036">
<label>[36]</label><mixed-citation publication-type="journal"><string-name><surname>Zhang</surname>, <given-names>B.</given-names></string-name>, <string-name><surname>Liu</surname>, <given-names>T.</given-names></string-name> and <string-name><surname>Bai</surname>, <given-names>Z.</given-names></string-name> (<year>2010</year>). <article-title>Analysis of Rounded Data from Dependent Sequences</article-title>. <source>Annals of the Institute of Statistical Mathematics</source> <volume>62</volume>(<issue>6</issue>) <fpage>1143</fpage>–<lpage>1173</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1007/s10463-009-0224-6" xlink:type="simple">https://doi.org/10.1007/s10463-009-0224-6</ext-link>. <ext-link ext-link-type="uri" xlink:href="https://mathscinet.ams.org/mathscinet-getitem?mr=2729157">MR2729157</ext-link></mixed-citation>
</ref>
</ref-list>
</back>
</article>
