Re: Approximate solution to linear regression



On 17 Jun., 21:46, "vincen...@xxxxxxxxx" <datashap...@xxxxxxxxx>
wrote:
Problem can have 40,000 variables, most of them highly correlated.
More variables than observations in some cases.


I haven't looked over your solution in great detail, but I would
suggest that:

1) Group your variables into clusters, based on their correlations
2) Construct an ensemble of regression models, each based on just one
exemplar from each cluster
3) Weight each model conservatively (because you have so many
variables); e.g. equal weighting.

Best regards,

Stefan W. Christensen

.



Relevant Pages

  • Re: Approximate solution to linear regression
    ... Group your variables into clusters, ... Construct an ensemble of regression models, ... variables); e.g. equal weighting. ... published examples of the validity of the procedure, ...
    (sci.stat.consult)
  • Re: Approximate solution to linear regression
    ... Construct an ensemble of regression models, ... variables); e.g. equal weighting. ... and where you would have to defend the validity of the procedure, ... ever have of its correctness (do you have faith in the correctness of ...
    (sci.stat.consult)