Consistent estimation of residual variance with random forest Out-Of-Bag errors

Publication date: Available online 1 April 2019Source: Statistics & Probability LettersAuthor(s): Burim Ramosaj, Markus PaulyAbstractThe issue of estimating residual variance in regression models with unknown and eventually complex link-function is still an open problem. Predictions of such outcomes are usually conducted by black-box procedures without seriously restricting the link-function class. However, quantifying uncertainty by means of residual variance estimators is of primary interest in many practical applications, e.g. as a primary step towards the construction of prediction intervals. Here, we consider this issue for the random forest. Therein, the functional relationship between covariates and response variable is modeled by a weighted sum of the latter. The dependence structure is, however, involved in the weights that are constructed during the tree construction process making the model complex in mathematical analysis. Restricting to L2-consistent random forest models, we provide random forest based residual variance estimators and prove their consistency.
Source: Statistics and Probability Letters - Category: Statistics Source Type: research
More News: Statistics