�ݺ�ߣ

Feature and Variable Selection in Classiﬁcation
Aaron Karper
University of Bern

Aaron Karper (UniBe)

Feature selection

1 / 12

Why?

Why not use all the features?

training error
test error

Interpretability

error

Overﬁtting
Computational Complexity

Model complexity


Feature selection

2 / 12

What are the options?

Ranking

Measure relevance for each feature separately.

The good:

The bad:

Xor problem.

Fast


Feature selection

3 / 12


Xor problem


Feature selection

4 / 12


Filters

Walk in feature
subset space

evaluate
proxy measure

train
classiﬁer

The bad:

The good:

Suboptimal
performance

Flexibility


Feature selection

5 / 12


Wrappers

Walk in feature
subset space

The good:

train
classiﬁer

The bad:

Slow training

Accuracy


Feature selection

6 / 12


Embedded methods

Integrate feature selection into classiﬁer.

The good:

The bad:

Accuracy, training
time


Feature selection

Lacks ﬂexibility

7 / 12

What should I use?

What is the best one?

Accuracy-wise: embedded or wrapper.
Complexity-wise: ranking, ﬁlters.
Why not both?


Feature selection

8 / 12

Examples

Probabilistic feature selection
For model p(c|x) ∝ p(c) p(x|c)

Can be retroﬁtted with
p(c) = p(M) p(c|M) for
model M.
More degrees of freedom
spread the model thin.

probability

specific model
wide spread model

Standard optimizations
apply.
possible data


Feature selection

9 / 12

Examples

Probabilistic feature selection

Akaike information criterion every additional variable needs to explain e times as
much data.
Bayesian information criterion Unused parameters are marginalized.
Minimum descriptor length


Feature selection

10 / 12

Examples

Autoencoder
Reconstruction
2000
1000
500

Deep neural network.
Create ﬁxed size information
bottleneck.

Bottleneck

Train to being able to reconstruct
original data.

30
500
1000
2000
Input


Feature selection

11 / 12

Prediction

Predictions

Embedded methods will improve more than other approaches.
Others as ﬁrst step for complexity reasons.


Feature selection

12 / 12

�ݺ�ߣ

Feature and Variable Selection in Classification

More Related Content

Feature and Variable Selection in Classification