FeatureSelection.get_X_original

FeatureSelection.get_X_original()

Restituisce il DataFrame originale contenente solo le feature inizialmente considerate.

Seleziona dal DataFrame originale (attributo db) solo le colonne presenti nell’attributo feat_to_check, ovvero le feature che sono state incluse all’inizio del processo di analisi/selezione.

Returns:

Un DataFrame contenente solo le colonne specificate in feat_to_check.

Return type:

pd.DataFrame

Dati utilizzati per gli esempi:

>>> db_test_filters
feature_Bfeature_Cfeature_A1feature_A2
034.835708Z-0.270712-0.812137
13.086785X0.1048480.314544
242.384427Y0.2505280.751583
386.151493X-0.925200-2.775600
4-1.707669V0.5671441.701431
5-1.706848V-1.040180-3.120541
688.960641X-0.153676-0.461028
748.371736V0.7898522.369555
8-13.473719Z-1.226216-3.678648
937.128002Y-0.948007-2.844021
10-13.170885X-0.569654-1.708962
11-13.286488Z-0.977150-2.931451
1222.098114W-0.770632-2.311895
13-85.664012W-0.033711-0.101134
14-76.245892X-1.032859-3.098578
15-18.114376X1.1424273.427282
16-40.641556Z-0.609778-1.829334
1725.712367Y1.4694164.408249
18-35.401204Y1.4926794.478037
19-60.615185Y0.7071252.121376

Esempio:

>>> from cefeste.selection import FeatureSelection
>>> fs = FeatureSelection(
...    db=df_test_filters,
...    verbose=True # Utile per vedere cosa succede
... )
>>> fs.find_correlated_features()
>>> fs.make_report()
feat_name result drop_reason
0 feature_A2 drop correlated
1 feature_A1 keep NaN
2 feature_C keep NaN
3 feature_B keep NaN
>>> fs.get_X_original()
feature_Bfeature_Cfeature_A1feature_A2
034.835708Z-0.270712-0.812137
13.086785X0.1048480.314544
242.384427Y0.2505280.751583
386.151493X-0.925200-2.775600
4-1.707669V0.5671441.701431
5-1.706848V-1.040180-3.120541
688.960641X-0.153676-0.461028
748.371736V0.7898522.369555
8-13.473719Z-1.226216-3.678648
937.128002Y-0.948007-2.844021
10-13.170885X-0.569654-1.708962
11-13.286488Z-0.977150-2.931451
1222.098114W-0.770632-2.311895
13-85.664012W-0.033711-0.101134
14-76.245892X-1.032859-3.098578
15-18.114376X1.1424273.427282
16-40.641556Z-0.609778-1.829334
1725.712367Y1.4694164.408249
18-35.401204Y1.4926794.478037
19-60.615185Y0.7071252.121376