FeatureSelection.get_X_reduced¶

FeatureSelection.get_X_reduced()¶

Restituisce il DataFrame contenente solo le feature finali selezionate.

Seleziona dal DataFrame originale (attributo db) solo le colonne presenti nell’attributo final_feat, che è stata popolata dalla fase di selezione. Questo rappresenta il set di feature dopo la feature selection.

Returns:: Un DataFrame contenente solo le colonne delle feature finali selezionate.
Return type:: pd.DataFrame

Note

Assicurarsi di aver eseguito i metodi dei filtri desiderati prima di chiamare questo metodo per ottenere un risultato significativo.

Dati utilizzati per gli esempi:

>>> df_test_filters

	feature_B	feature_C	target	feature_A1	feature_A2	sample_col
0	34.835708	Z	75.013312	-0.270712	-0.812137	train
1	3.086785	X	109.194174	0.104848	0.314544	train
2	42.384427	Y	96.287048	0.250528	0.751583	train
3	86.151493	X	264.905765	-0.925200	-2.775600	train
4	-1.707669	V	2.880829	0.567144	1.701431	train
5	-1.706848	V	2.318509	-1.040180	-3.120541	train
6	88.960641	X	273.054387	-0.153676	-0.461028	train
7	48.371736	V	101.779140	0.789852	2.369555	train
8	-13.473719	Z	-25.266714	-1.226216	-3.678648	train
9	37.128002	Y	73.118623	-0.948007	-2.844021	train
10	-13.170885	X	69.538553	-0.569654	-1.708962	train
11	-13.286488	Z	-30.168523	-0.977150	-2.931451	train
12	22.098114	W	54.445288	-0.770632	-2.311895	train
13	-85.664012	W	-171.324610	-0.033711	-0.101134	train
14	-76.245892	X	-48.581133	-1.032859	-3.098578	train
15	-18.114376	X	59.816750	1.142427	3.427282	train
16	-40.641556	Z	-85.182377	-0.609778	-1.829334	train
17	25.712367	Y	56.834657	1.469416	4.408249	train
18	-35.401204	Y	-77.550289	1.492679	4.478037	train
19	-60.615185	Y	-123.306439	0.707125	2.121376	train

Esempio:

>>> from cefeste.selection import FeatureSelection
>>> fs = FeatureSelection(
...  db=df_test_filters,
...  target_col='target',
...  sample_col='sample_col',
...  sample_train_value='train',
...  verbose=True
... )
>>> fs.run()
>>> fs.make_report()

	feat_name	result	drop_reason
0	feature_A1	drop	unexplanatory
1	feature_A2	drop	unexplanatory
2	feature_C	keep	NaN
3	feature_B	keep	NaN

>>> fs.get_X_reduced()

	feature_C	feature_B
0	Z	34.835708
1	X	3.086785
2	Y	42.384427
3	X	86.151493
4	V	-1.707669
5	V	-1.706848
6	X	88.960641
7	V	48.371736
8	Z	-13.473719
9	Y	37.128002
10	X	-13.170885
11	Z	-13.286488
12	W	22.098114
13	W	-85.664012
14	X	-76.245892
15	X	-18.114376
16	Z	-40.641556
17	Y	25.712367
18	Y	-35.401204
19	Y	-60.615185

FeatureSelection.get_X_reduced¶

Table of Contents

Previous topic

Next topic

This Page