WassupAI

WassupAI

07
Dec
Interactive SkLearn Series - Feature Discretization

Interactive SkLearn Series - Feature Discretization

Sometimes continuous data works better as categories. Learn to use KBinsDiscretizer to transform continuous features into buckets, helping linear models handle non-linear relationships.
10 min read
07
Dec
Interactive SkLearn Series - Categorical Encoding

Interactive SkLearn Series - Categorical Encoding

ML models require numbers, not strings. We’ll convert categorical data into machine-readable formats using One-Hot Encoding for nominal data and Ordinal Encoding for ranked categories.
10 min read
07
Dec
Interactive SkLearn Series - Handling Missing Values

Interactive SkLearn Series - Handling Missing Values

Real-world data is rarely clean. Move beyond dropping rows by learning to impute missing values using simple strategies like the mean, or advanced multivariate techniques like KNN imputation.
11 min read
07
Dec
Interactive SkLearn Series - Feature Scaling

Interactive SkLearn Series - Feature Scaling

Algorithms like SVM and KNN are sensitive to scale. We’ll apply Standardization and Min-Max scaling to normalize features, ensuring no single variable dominates the model due to magnitude.
7 min read
07
Dec
Interactive SkLearn Series - Data Splitting Strategies

Interactive SkLearn Series - Data Splitting Strategies

Never test on training data. Learn to use train_test_split for validation, and explore stratified splitting to maintain class balance and time-series splitting for temporal data.
10 min read
07
Dec
Interactive SkLearn Series - Built-in Datasets

Interactive SkLearn Series - Built-in Datasets

Don't spend hours finding data. We'll use sklearn.datasets to load toy datasets for quick learning and fetch larger, real-world datasets to test your models on actual problems.
9 min read
07
Dec
Interactive SkLearn Series - The Estimator API

Interactive SkLearn Series - The Estimator API

The heart of sklearn lies in fit, predict, and transform. Master these three methods to unlock the entire library, treating every algorithm as a standardized "estimator" object.
9 min read
07
Dec
Interactive SkLearn Series - Data Representation

Interactive SkLearn Series - Data Representation

ML models expect specific structures. We'll cover feature matrices (X) and target vectors (y), and how to bridge the gap between Pandas DataFrames and NumPy arrays for seamless integration.
9 min read
07
Dec
Interactive NumPy Series - Customization

Interactive NumPy Series - Customization

Extend NumPy’s capabilities. We look at structured arrays for heterogeneous data (like C structs) and how to write custom ufuncs to apply your own logic with NumPy’s native speed.
9 min read
07
Dec
Interactive NumPy Series - Advanced Indexing Techniques

Interactive NumPy Series - Advanced Indexing Techniques

Tackle complex data access with ix_ for open meshes and einsum (Einstein Summation). This powerful mini-language allows you to express complex tensor contractions efficiently.
9 min read