Stacey Ronaghan
1 min readOct 17, 2019

--

Hi Dylan Smith,

For Random Forest regression models, feature importance values are determined from variance reduction (Spark & Scikit-learn calculate this with mean square error).

Gini impurity, or alternatively Entropy, is used for Random Forest classification models.

Best,

Stacey

--

--

Stacey Ronaghan
Stacey Ronaghan

Written by Stacey Ronaghan

Data Scientist keen to share experiences & learnings from work & studies

No responses yet