1 |
Use ColumnTransformer to apply different preprocessing to different columns |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
2 |
Seven ways to select columns using ColumnTransformer |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
3 |
What is the difference between "fit" and "transform"? |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
4 |
Use "fit_transform" on training data, but "transform" (only) on testing/new data |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
5 |
Four reasons to use scikit-learn (not pandas) for ML preprocessing |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
6 |
Encode categorical features using OneHotEncoder or OrdinalEncoder |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
7 |
Handle unknown categories with OneHotEncoder by encoding them as zeros |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
8 |
Use Pipeline to chain together multiple steps |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
9 |
Add a missing indicator to encode "missingness" as a feature |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
10 |
Set a "random_state" to make your code reproducible |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
11 |
Impute missing values using KNNImputer or IterativeImputer |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
12 |
What is the difference between Pipeline and make_pipeline ? |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
13 |
Examine the intermediate steps in a Pipeline |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
14 |
HistGradientBoostingClassifier natively supports missing values |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
15 |
Three reasons not to use drop='first' with OneHotEncoder |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
16 |
Use cross_val_score and GridSearchCV on a Pipeline |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
17 |
Try RandomizedSearchCV if GridSearchCV is taking too long |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
18 |
Display GridSearchCV or RandomizedSearchCV results in a DataFrame |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
19 |
Important tuning parameters for LogisticRegression |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
20 |
Plot a confusion matrix |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
21 |
Compare multiple ROC curves in a single plot |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
22 |
Use the correct methods for each type of Pipeline |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
23 |
Display the intercept and coefficients for a linear model |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
24 |
Visualize a decision tree two different ways |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
25 |
Prune a decision tree to avoid overfitting |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
26 |
Use stratified sampling with train_test_split |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
27 |
Two ways to impute missing values for a categorical feature |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
28 |
Save a model or Pipeline using joblib |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
29 |
Vectorize two text columns in a ColumnTransformer |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
30 |
Four ways to examine the steps of a Pipeline |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
31 |
Shuffle your dataset when using cross_val_score |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
32 |
Use AUC to evaluate multiclass problems |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
33 |
Use FunctionTransformer to convert functions into transformers |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
34 |
Add feature selection to a Pipeline |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
35 |
Don't use .values when passing a pandas object to scikit-learn |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
36 |
Most parameters should be passed as keyword arguments |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
37 |
Create an interactive diagram of a Pipeline in Jupyter |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
38 |
Get the feature names output by a ColumnTransformer |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
39 |
Load a toy dataset into a DataFrame |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
40 |
Estimators only print parameters that have been changed |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
41 |
Drop the first category from binary features (only) with OneHotEncoder |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
42 |
Passthrough some columns and drop others in a ColumnTransformer |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
43 |
Use OrdinalEncoder instead of OneHotEncoder with tree-based models |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
44 |
Speed up GridSearchCV using parallel processing |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
45 |
Create feature interactions using PolynomialFeatures |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
46 |
Ensemble multiple models using VotingClassifer or VotingRegressor |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
47 |
Tune the parameters of a VotingClassifer or VotingRegressor |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
48 |
Access part of a Pipeline using slicing |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
49 |
Tune multiple models simultaneously with GridSearchCV |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |
50 |
Adapt this pattern to solve many Machine Learning problems |
data:image/s3,"s3://crabby-images/19c9a/19c9a07d0c2d64cb1900d1f14a939716f7a2be7c" alt="Watch video" |