Classification
Machine learning with scikit-learn
()
The classification challenge
()
k-Nearest Neighbors: Fit
k-Nearest Neighbors: Predict
Measuring model performance
()
Train/test split + computing accuracy
Overfitting and underfitting
Visualizing model complexity
Regression
Introduction to regression
()
Creating features
Building a linear regression model
Visualizing a linear regression model
The basics of linear regression
()
Fit and predict for regression
Regression performance
Cross-validation
()
Cross-validation for R-squared
Analyzing cross-validation metrics
Regularized regression
()
Regularized regression: Ridge
Lasso regression for feature importance
Fine-Tuning Your Model
How good is your model?
()
Assessing a diabetes prediction classifier
Logistic regression and the ROC curve
()
Building a logistic regression model
ROC AUC
Hyperparameter tuning
()
Hyperparameter tuning with GridSearchCV
Hyperparameter tuning with RandomizedSearchCV
Preprocessing and Pipelines
Preprocessing data
()
Creating dummy variables
Regression with categorical features
Handling missing data
()
Pipeline for song genre prediction: I
Pipeline for song genre prediction: II
Centering and scaling
()
Centering and scaling for regression
Centering and scaling for classification
Evaluating multiple models
()
Visualizing regression model performance
Predicting on the test set
Visualizing classification model performance
Pipeline for predicting song popularity
Congratulations
()
advertising_and_sales_clean.csv
(175 KB)
diabetes_clean.csv
(23 KB)
telecom_churn_clean.csv
(258 KB)
music_clean.csv
(102 KB)