what's the difference between DeltaCritDecisionSplit property vs. Gini's Diversity Index?

Question

0 votes

I'm implementing a Random Forests code for selecting the most important predictors for my application. The treebagger webminar has two examples of ways to estimating predictor importance (DeltaCritDecisionSplit, OOBPermutedVarDeltaError). is DeltaCritDecisionSplit like the Gini diversity index (of predictorImportance)? If not, how are they different?

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Ilya on 19 Jan 2012

0 votes

Yes, DeltaCritDecisionSplit property of TreeBagger is the equivalent of predictorImportance method for an ensemble produced by fitensemble function. It is obtained by summing the impurity gain over all splits on a given predictor. Gini is the default impurity for classification trees.

1 Comment
Show -1 older comments Hide -1 older comments

Offer on 19 Jan 2012

Will they be using the same scale? are they comparable in that sense?

Sign in to comment.

Answer 2

Ilya on 19 Jan 2012

0 votes

Predictor importance estimates for every tree in an ensemble are added together. The sum is then divided by the number of trees. This means that the estimates are comparable if the two ensembles are composed of trees of roughly the same depth (that is, trees using roughly the same number of splits). Boosted trees by default use stumps (one-split trees), and many predictors may be never split on. Bagged trees by default are deep, and most predictors get many splits.

In general, comparing predictor importance estimates across ensembles of different types may not produce anything useful. These can only tell you what predictors are important for this particular ensemble.

1 Comment
Show -1 older comments Hide -1 older comments

Offer on 20 Jan 2012

thanks.

Sign in to comment.

what's the difference between DeltaCritDecisionSplit property vs. Gini's Diversity Index?

0 Comments
Show -2 older comments Hide -2 older comments

Answers (2)

1 Comment
Show -1 older comments Hide -1 older comments

1 Comment
Show -1 older comments Hide -1 older comments

Categories

Products

Tags

Community Treasure Hunt

what's the difference between DeltaCritDecisionSplit property vs. Gini's Diversity Index?

0 Comments Show -2 older comments Hide -2 older comments

Answers (2)

1 Comment Show -1 older comments Hide -1 older comments

1 Comment Show -1 older comments Hide -1 older comments

Categories

Products

Tags

See Also

Community Treasure Hunt

0 Comments
Show -2 older comments Hide -2 older comments

1 Comment
Show -1 older comments Hide -1 older comments

1 Comment
Show -1 older comments Hide -1 older comments