Loading...

Data Preparation

6 mins

Joins

Combine datasets seamlessly using various join techniques to enrich and consolidate your data.

notebooks/data_prep/joins/index.html

2 mins

Duplicates

Identify and manage duplicate records, ensuring data integrity and accuracy.

notebooks/data_prep/duplicates/index.html

5 mins

Outliers

Detect and handle outliers in your dataset for robust and reliable analyses.

notebooks/data_prep/outliers/index.html

18 mins

Missing values

Address missing data through imputation and imporve the completeness of your dataset.

notebooks/data_prep/missing_values/index.html

5 mins

Encoding

Transform categorical data into numerical formats, a crucial step for machine learning models.

notebooks/data_prep/encoding/index.html

5 mins

Normalization

Standardize numerical features to a common scale, optimizing model performance.

notebooks/data_prep/normalization/index.html

4 mins

Decomposition

Break down complex data structures into simpler components, facilitating analysis and understanding.

notebooks/data_prep/decomposition/index.html

7 mins

Feature engineering

Create new informative features to enhance the predictive power of your models.

notebooks/data_prep/features_engineering/index.html