Encoding
Method | Definition |
---|---|
vDataFrame[].cut | Discretizes the vColumn using the input list. |
vDataFrame[].decode | Encodes the vColumn using a user-defined encoding. |
vDataFrame[].discretize | Discretizes the vColumn using the input method. |
vDataFrame.get_dummies | Encodes the vColumn using the One-Hot Encoding algorithm. |
vDataFrame[].get_dummies | Encodes the vColumn using the One-Hot Encoding algorithm. |
vDataFrame[].label_encode | Encodes the vColumn using a bijection from the different categories to [0, n - 1] |
vDataFrame[].mean_encode | Encode the vColumn using the average of the response partitioned by the different vcolumn categories. |
Dealing with Missing Values
Method | Definition |
---|---|
vDataFrame.dropna | Filters the vDataFrame where the input vColumns are missing. |
vDataFrame[].dropna | Filters the vDataFrame where the vColumn is missing. |
vDataFrame.fillna | Fills the vColumns missing elements using specific rules. |
vDataFrame[].fillna | Fills the vColumn missing elements using specific rules. |
vDataFrame.merge_similar_names | Merges columns with similar names. |
Normalization and Global Outliers
Method | Definition |
---|---|
vDataFrame[].clip | Clips the vColumn. |
vDataFrame[].fill_outliers | Fills the vColumns outliers using the input method. |
vDataFrame.normalize | Normalizes the input vColumns using the input method. |
vDataFrame[].normalize | Normalizes the input vColumns using the input method. |
vDataFrame.outliers | Adds a new vColumns labeled with 0 and 1. 1 means that the record is a global outlier. |
Splitting into Train/Test
Method | Definition |
---|---|
vDataFrame.train_test_split | Creates 2 vDataFrame (train/test) which can be to use to evaluate a model. |
Data Types Conversion
Method | Definition |
---|---|
vDataFrame.astype | Converts the vColumns to the input types. |
vDataFrame[].astype | Converts the vColumn to the input type. |
vDataFrame.bool_to_int | Converts all the booleans vColumns to integers. |
Renaming
Method | Definition |
---|---|
vDataFrame[].rename | Renames the vColumn. |
Working with weights
Method | Definition |
---|---|
vDataFrame.add_duplicates | Duplicates the vDataFrame using the input weight. |
Complete Disjunctive Table
Method | Definition |
---|---|
vDataFrame.cdt | Returns the complete disjunctive table of the vDataFrame. |