interactive-sklearn-series - WassupAI (Page 2)

12

Dec

Interactive SkLearn Series - Anomaly Detection

Identify the rare and the strange. We’ll apply Isolation Forests and One-Class SVMs to detect outliers, useful for fraud detection, network security, and cleaning training data.

12 Dec 2025

9 min read

12

Dec

Interactive SkLearn Series - Manifold Learning

Visualize the impossible. We’ll use t-SNE and UMAP to map high-dimensional datasets into 2D or 3D space, revealing local structures and clusters that linear methods like PCA miss.

12 Dec 2025

10 min read

12

Dec

Interactive SkLearn Series - Dimensionality Reduction

Simplify complex data. We’ll use PCA to project high-dimensional data into fewer components, reducing noise and speeding up training while preserving variance.

12 Dec 2025

9 min read

12

Dec

Interactive SkLearn Series - Clustering

Discover hidden groups. We’ll use K-Means for centroid-based grouping, DBSCAN to find arbitrary shapes and outliers, and Hierarchical Clustering to build taxonomies.

12 Dec 2025

8 min read

12

Dec

Interactive SkLearn Series - Voting and Stacking

Two heads are better than one. We’ll use Voting Classifiers to average predictions and Stacking to train a meta-model that learns how to best combine the outputs of diverse base models.

12 Dec 2025

8 min read

12

Dec

Interactive SkLearn Series - Boosting

Reduce bias by learning from mistakes. We’ll implement AdaBoost and Gradient Boosting, and specifically look at Histogram-based Gradient Boosting for state-of-the-art speed on large data.

12 Dec 2025

9 min read

12

Dec

Interactive SkLearn Series - Bagging

Reduce variance by combining models. We’ll explore Random Forests and ExtraTrees, learning how they aggregate predictions from many decision trees to create a robust, stable estimator.

12 Dec 2025

9 min read

07

Dec

Interactive SkLearn Series - Evaluation Metrics (Classification)

Accuracy isn't everything. We’ll dive into Precision, Recall, and F1-Score for imbalanced datasets, and use ROC-AUC and Confusion Matrices to fully diagnose classifier performance.

07 Dec 2025

10 min read

07

Dec

Interactive SkLearn Series - Nearest Neighbors

Predict based on proximity. We’ll use K-Nearest Neighbors (KNN) for classification, understanding how distance metrics work and why the "curse of dimensionality" affects performance.

07 Dec 2025

7 min read

07

Dec

Interactive SkLearn Series - Tree-Based Models

Mimic human decision-making. We’ll visualize tree structures, understand splitting criteria like Gini impurity, and learn how to prune trees to prevent overfitting.

07 Dec 2025

9 min read