Stimatori della massima verosimiglianza per una distribuzione troncata


28

Considerare campioni indipendenti ottenuta da una variabile casuale che si presume seguire una distribuzione troncata (ad esempio un tronco distribuzione normale ) di nota (finito) valori minimo e massimo e ma di parametri ignoti e . Se seguisse una distribuzione non troncata, gli stimatori della massima verosimiglianza e per e da sarebbero la media del campioneNSXabμσ2Xμ^σ^2μσ2Sμ^=1NiSie la varianza di esempio . Tuttavia, per una distribuzione troncata, la varianza del campione definita in questo modo è limitata da quindi non è sempre uno stimatore coerente: per , non può convergere in probabilità in come va all'infinito. Quindi sembra che e non siano gli stimatori della massima verosimiglianza di e per una distribuzione troncata. Naturalmente, questo è prevedibile dal momento che eσ^2=1Ni(Siμ^)2(ba)2σ2>(ba)2σ2Nμ^σ^2μσ2μσ2 i parametri di una distribuzione normale troncata non sono la media e la varianza.

Quindi, quali sono gli stimatori della massima verosimiglianza di eμσ parameters of a truncated distribution of known minimum and maximum values?


Are you sure about your analysis? I think you are making an invalid assumption: for the truncated situation, the MLE of σ2 is no longer the sample variance (and, in general, the MLE of μ is no longer the sample mean)!
whuber

whuber: I know, this is precisely my question: what are the MLEs of σ2 and μ in the truncated case? Adding a sentence to insist on this.
a3nm

1
There isn't a closed form solution. All you can do is numerically minimize the log likelihood. But this is qualitatively no different than many other models, such as logistic regression, which also have no closed form solution.
whuber

whuber: If this is true, this is pretty disappointing. Do you have references about the lack of closed form solutions? Are there closed-form estimators that are not maximum likelihood but are at least consistent (and optionally unbiased?).
a3nm

1
@whuber: Can you at least simplify your samples into sufficient statistics so that the minimization is fast?
Neil G

Risposte:


29

Consider any location-scale family determined by a "standard" distribution F,

ΩF={F(μ,σ):xF(xμσ)σ>0}.

Assuming F differentiable we readily find that the PDFs are 1σf((xμ)/σ)dx.

Truncating these distributions to restrict their support between a and b, a<b, means that the PDFs are replaced by

f(μ,σ;a,b)(x)=f(xμσ)dxσC(μ,σ,a,b),axb

(and are zero for all other values of x) where C(μ,σ,a,b)=F(μ,σ)(b)F(μ,σ)(a) is the normalizing factor needed to ensure that f(μ,σ;a,b) integrates to unity. (Note that C is identically 1 in the absence of truncation.) The log likelihood for iid data xi therefore is

Λ(μ,σ)=i[logf(xiμσ)logσlogC(μ,σ,a,b)].

Critical points (including any global minima) are found where either σ=0 (a special case I will ignore here) or the gradient vanishes. Using subscripts to denote derivatives, we may formally compute the gradient and write the likelihood equations as

0=Λμ=i[fμ(xiμσ)f(xiμσ)Cμ(μ,σ,a,b)C(μ,σ,a,b)]0=Λσ=i[fσ(xiμσ)σ2f(xiμσ)1σCσ(μ,σ,a,b)C(μ,σ,a,b)]

Because a and b are fixed, drop them from the notation and write nCμ(μ,σ,a,b)/C(μ,σ,a,b) as A(μ,σ) and nCσ(μ,σ,a,b)/C(μ,σ,a,b) as B(μ,σ). (With no truncation, both functions would be identically zero.) Separating the terms involving the data from the rest gives

A(μ,σ)=ifμ(xiμσ)f(xiμσ)σ2B(μ,σ)nσ=ifσ(xiμσ)f(xiμσ)

By comparing these to the no-truncation situation it is evident that

  • Any sufficient statistics for the original problem are sufficient for the truncated problem (because the right hand sides have not changed).

  • Our ability to find closed-form solutions relies on the tractability of A and B. If these do not involve μ and σ in simple ways, we cannot hope to obtain closed-form solutions in general.

For the case of a normal family, C(μ,σ,a,b) of course is given by the cumulative normal PDF, which is a difference of error functions: there is no chance that a closed-form solution can be obtained in general. However, there are only two sufficient statistics (the sample mean and variance will do) and the CDF is as smooth as can be, so numerical solutions will be relatively easy to obtain.


Thanks a lot for this very detailed answer! I'm not sure I get what fμ, fσ , Cμ, and Cσ are, could you define them? Also, it's obvious but to be precise maybe you could say that your expression for the pdf is for x[a,b] (and the pdf is zero outside of that). Thanks again!
a3nm

1
The usual longer notation is Cμ=μC(μ,σ,a,b), etc: as announced, it is a derivative. I will make the second change you suggest because it's an important clarification, thanks.
whuber

Also, since your answer is more general than the one I expected, I edited my question to insist less on the case of normal distributions. Thanks again for your effort.
a3nm

1
It was easier to explain at this level of generality compared to focusing on the Normal distributions! Computing the derivatives and showing the precise form of the CDF are unnecessary distractions (although useful when you start actually coding the numerical solution).
whuber

1
Thanks for fixing! You missed one of them; could you review my edit?
a3nm
Utilizzando il nostro sito, riconosci di aver letto e compreso le nostre Informativa sui cookie e Informativa sulla privacy.
Licensed under cc by-sa 3.0 with attribution required.