A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

Last update: Jan 05, 2023

Related tags

Machine Learning handson-ml3

Overview

Machine Learning Notebooks, 3rd edition

This project aims at teaching you the fundamentals of Machine Learning in python. It contains the example code and solutions to the exercises in the third edition of my O'Reilly book Hands-on Machine Learning with Scikit-Learn, Keras and TensorFlow (3rd edition):

Note: If you are looking for the second edition notebooks, check out ageron/handson-ml2. For the first edition, see ageron/handson-ml.

Quick Start

Want to play with these notebooks online without having to install anything?

Use any of the following services (I recommended Colab or Kaggle, since they offer free GPUs and TPUs).

WARNING: Please be aware that these services provide temporary environments: anything you do will be deleted after a while, so make sure you download any data you care about.

Just want to quickly look at some notebooks, without executing any code?

github.com's notebook viewer also works but it's not ideal: it's slower, the math equations are not always displayed correctly, and large notebooks often fail to open.

Want to run this project using a Docker image?

Read the Docker instructions.

Want to install this project on your own machine?

Start by installing Anaconda (or Miniconda), git, and if you have a TensorFlow-compatible GPU, install the GPU driver, as well as the appropriate version of CUDA and cuDNN (see TensorFlow's documentation for more details).

Next, clone this project by opening a terminal and typing the following commands (do not type the first $ signs on each line, they just indicate that these are terminal commands):

$ git clone https://github.com/ageron/handson-ml3.git
$ cd handson-ml3

Next, run the following commands:

$ conda env create -f environment.yml
$ conda activate homl3
$ python -m ipykernel install --user --name=python3

Finally, start Jupyter:

$ jupyter notebook

If you need further instructions, read the detailed installation instructions.

FAQ

Which Python version should I use?

I recommend Python 3.8. If you follow the installation instructions above, that's the version you will get. Most code will work with other versions of Python 3, but some libraries do not support Python 3.9 or 3.10 yet, which is why I recommend Python 3.8.

I'm getting an error when I call load_housing_data()

Make sure you call fetch_housing_data() before you call load_housing_data(). If you're getting an HTTP error, make sure you're running the exact same code as in the notebook (copy/paste it if needed). If the problem persists, please check your network configuration.

I'm getting an SSL error on MacOSX

You probably need to install the SSL certificates (see this StackOverflow question). If you downloaded Python from the official website, then run /Applications/Python\ 3.8/Install\ Certificates.command in a terminal (change 3.8 to whatever version you installed). If you installed Python using MacPorts, run sudo port install curl-ca-bundle in a terminal.

I've installed this project locally. How do I update it to the latest version?

See INSTALL.md

How do I update my Python libraries to the latest versions, when using Anaconda?

See INSTALL.md

Contributors

I would like to thank everyone who contributed to this project, either by providing useful feedback, filing issues or submitting Pull Requests. Special thanks go to Haesun Park and Ian Beauregard who reviewed every notebook and submitted many PRs, including help on some of the exercise solutions. Thanks as well to Steven Bunkley and Ziembla who created the docker directory, and to github user SuperYorio who helped on some exercise solutions.

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

Related tags

Overview

Machine Learning Notebooks, 3rd edition

Quick Start

Want to play with these notebooks online without having to install anything?

Just want to quickly look at some notebooks, without executing any code?

Want to run this project using a Docker image?

Want to install this project on your own machine?

FAQ

Contributors

Owner

Aurélien Geron

Kaggle Competition using 15 numerical predictors to predict a continuous outcome.

ml4ir: Machine Learning for Information Retrieval

JMP is a Mixed Precision library for JAX.

TIANCHI Purchase Redemption Forecast Challenge

A complete guide to start and improve in machine learning (ML)

Distributed deep learning on Hadoop and Spark clusters.

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Python Research Framework

Napari sklearn decomposition

This machine learning model was developed for House Prices

Data science, Data manipulation and Machine learning package.

Automatic extraction of relevant features from time series:

A Pythonic framework for threat modeling

Houseprices - Predict sales prices and practice feature engineering, RFs, and gradient boosting

MachineLearningStocks is designed to be an intuitive and highly extensible template project applying machine learning to making stock predictions.

100 Days of Machine and Deep Learning Code

A naive Bayes model for cancer classification using a set of documents

This repo implements a Topological SLAM: Deep Visual Odometry with Long Term Place Recognition (Loop Closure Detection)

A repository of PyBullet utility functions for robotic motion planning, manipulation planning, and task and motion planning

Data Version Control or DVC is an open-source tool for data science and machine learning projects