Re: Beta distribution



"doherjo1@xxxxxxxxx" <doherjo1@xxxxxxxxx> wrote in
news:1193840038.646278.316380@xxxxxxxxxxxxxxxxxxxxxxxxxxx:

How do I approach regression if my raw data exhibits the profile of a
beta or gamma distribution. For example, if I have sales data that is
bounded on the left tail at zero and has a heavy right tail. I
usually use a log-transformation, but is there an approach using OLSR
that incorporates these distribution vs. the normal?

If you are making decisions based on the "raw data" then you missed the key
point in class about the normality assumption applying only to the
residuals, ... rather than to the marginal distribution of the dependent
variable. You should do the analysis without any transformation (ordinary
least squares) and then examine the residuals. Only then would you have a
basis for choosing another analysis. The estimates will be much easier to
interpret if you avoid unnecessary transformations.

The estimates from the OLS method will still be unbiased, but there may be
errors in the inferential tests (confidence intervals or F tests) that need
to be assessed by other methods. Most of the analytic/inferential results
will be resistant to skewness (in the residuals), especially if they can
reasonably be expected to be only right-tailed. The usual statement is that
transformations (log or rank) may be needed to avoid inferential errors
brought on by heteroscedasticity. (Neither sort of transformation will help
if some regions of the predictor space have left skewed and other right
skewed dependent variables.)

--
David Winsemius
.



Relevant Pages

  • Re: Gumbel curve fitting
    ... > I am modelling the extreme tail of a distribution, ... > the fit is very poor. ... > If I use a method by fitting the distribution at the tail say using ...
    (sci.stat.math)
  • Re: test of normality
    ... I am not an expert statistician, but I NEVER made a mistake alike. ... skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable. ... a distribution has positive skew if the higher tail is longer and negative skew if the lower tail is longer (confusing the two is a common ...
    (sci.stat.math)
  • Re: Area under the curve transformation
    ... > analysis using this this transformation and still could not understand ... the AUC is referred to cumulative distribution function ... Using the CDF is one way to create random numbers ... 'raw' scores are then transformed to the T-scores by using ...
    (sci.stat.math)
  • Re: What is this rv?
    ... is the distribution of Z. ... "transformation of vars" as the transformation is not specified. ... makes X and Y normal, non-zero mean, stat indep rvs (the Tau is what ... at the pdfs they have derived, the true phase and the true phase + pi ...
    (sci.stat.edu)
  • Re: Linear Regression - Assessment of predictor importance
    ... When I compare the range or ... variability of my parameters, I can only do that when all parameters ... the choice of 'transformation' or metric ... spread out the bulk of *one* distribution, but not the other, ...
    (sci.stat.consult)