interactive-sklearn-series

12
Dec
Interactive SkLearn Series - Anomaly Detection

Interactive SkLearn Series - Anomaly Detection

Identify the rare and the strange. We’ll apply Isolation Forests and One-Class SVMs to detect outliers, useful for fraud detection, network security, and cleaning training data.
9 min read
12
Dec
Interactive SkLearn Series - Manifold Learning

Interactive SkLearn Series - Manifold Learning

Visualize the impossible. We’ll use t-SNE and UMAP to map high-dimensional datasets into 2D or 3D space, revealing local structures and clusters that linear methods like PCA miss.
10 min read
12
Dec
Interactive SkLearn Series - Dimensionality Reduction

Interactive SkLearn Series - Dimensionality Reduction

Simplify complex data. We’ll use PCA to project high-dimensional data into fewer components, reducing noise and speeding up training while preserving variance.
9 min read
12
Dec
Interactive SkLearn Series - Clustering

Interactive SkLearn Series - Clustering

Discover hidden groups. We’ll use K-Means for centroid-based grouping, DBSCAN to find arbitrary shapes and outliers, and Hierarchical Clustering to build taxonomies.
8 min read
12
Dec
Interactive SkLearn Series - Voting and Stacking

Interactive SkLearn Series - Voting and Stacking

Two heads are better than one. We’ll use Voting Classifiers to average predictions and Stacking to train a meta-model that learns how to best combine the outputs of diverse base models.
8 min read
12
Dec
Interactive SkLearn Series - Boosting

Interactive SkLearn Series - Boosting

Reduce bias by learning from mistakes. We’ll implement AdaBoost and Gradient Boosting, and specifically look at Histogram-based Gradient Boosting for state-of-the-art speed on large data.
9 min read
12
Dec
Interactive SkLearn Series - Bagging

Interactive SkLearn Series - Bagging

Reduce variance by combining models. We’ll explore Random Forests and ExtraTrees, learning how they aggregate predictions from many decision trees to create a robust, stable estimator.
9 min read
07
Dec
Interactive SkLearn Series - Evaluation Metrics (Classification)

Interactive SkLearn Series - Evaluation Metrics (Classification)

Accuracy isn't everything. We’ll dive into Precision, Recall, and F1-Score for imbalanced datasets, and use ROC-AUC and Confusion Matrices to fully diagnose classifier performance.
10 min read
07
Dec
Interactive SkLearn Series - Nearest Neighbors

Interactive SkLearn Series - Nearest Neighbors

Predict based on proximity. We’ll use K-Nearest Neighbors (KNN) for classification, understanding how distance metrics work and why the "curse of dimensionality" affects performance.
7 min read
07
Dec
Interactive SkLearn Series - Tree-Based Models

Interactive SkLearn Series - Tree-Based Models

Mimic human decision-making. We’ll visualize tree structures, understand splitting criteria like Gini impurity, and learn how to prune trees to prevent overfitting.
9 min read