Azure Cloud Advocates at Microsoft are pleased to offer a 12-week, 24-lesson curriculum all about Machine Learning

Overview

GitHub license GitHub contributors GitHub issues GitHub pull-requests PRs Welcome

GitHub watchers GitHub forks GitHub stars

Machine Learning for Beginners - A Curriculum

๐ŸŒ Travel around the world as we explore Machine Learning by means of world cultures ๐ŸŒ

Azure Cloud Advocates at Microsoft are pleased to offer a 12-week, 24-lesson curriculum all about Machine Learning. In this curriculum, you will learn about what is sometimes called classic machine learning, using primarily Scikit-learn as a library and avoiding deep learning, which is covered in our forthcoming 'AI for Beginners' curriculum. Pair these lessons with our forthcoming 'Data Science for Beginners' curriculum, as well!

Travel with us around the world as we apply these classic techniques to data from many areas of the world. Each lesson includes pre- and post-lesson quizzes, written instructions to complete the lesson, a solution, an assignment and more. Our project-based pedagogy allows you to learn while building, a proven way for new skills to 'stick'.

โœ๏ธ Hearty thanks to our authors Jen Looper, Stephen Howell, Francesca Lazzeri, Tomomi Imura, Cassie Breviu, Dmitry Soshnikov, Chris Noring, Ornella Altunyan, and Amy Boyd

๐ŸŽจ Thanks as well to our illustrators Tomomi Imura, Dasani Madipalli, and Jen Looper

๐Ÿ™ Special thanks ๐Ÿ™ to our Microsoft Student Ambassador authors, reviewers and content contributors, notably Rishit Dagli, Muhammad Sakib Khan Inan, Rohan Raj, Alexandru Petrescu, Abhishek Jaiswal, Nawrin Tabassum, Ioan Samuila, and Snigdha Agarwal


Getting Started

Students, to use this curriculum, fork the entire repo to your own GitHub account and complete the exercises on your own or with a group:

  • Start with a pre-lecture quiz
  • Read the lecture and complete the activities, pausing and reflecting at each knowledge check.
  • Try to create the projects by comprehending the lessons rather than running the solution code; however that code is available in the /solution folders in each project-oriented lesson.
  • Take the post-lecture quiz
  • Complete the challenge
  • Complete the assignment
  • After completing a lesson group, visit the Discussion board and "learn out loud" by filling out the appropriate PAT rubric. A 'PAT' is a Progress Assessment Tool that is a rubric you fill out to further your learning. You can also react to other PATs so we can learn together.

For further study, we recommend following these Microsoft Learn modules and learning paths.

Teachers, we have included some suggestions on how to use this curriculum.


Meet the Team

Promo video

๐ŸŽฅ Click the image above for a video about the project and the folks who created it!


Pedagogy

We have chosen two pedagogical tenets while building this curriculum: ensuring that it is hands-on project-based and that it includes frequent quizzes. In addition, this curriculum has a common theme to give it cohesion.

By ensuring that the content aligns with projects, the process is made more engaging for students and retention of concepts will be augmented. In addition, a low-stakes quiz before a class sets the intention of the student towards learning a topic, while a second quiz after class ensures further retention. This curriculum was designed to be flexible and fun and can be taken in whole or in part. The projects start small and become increasingly complex by the end of the 12 week cycle. This curriculum also includes a postscript on real-world applications of ML, which can be used as extra credit or as a basis for discussion.

Find our Code of Conduct, Contributing, and Translation guidelines. We welcome your constructive feedback!

Each lesson includes:

  • optional sketchnote
  • optional supplemental video
  • pre-lecture warmup quiz
  • written lesson
  • for project-based lessons, step-by-step guides on how to build the project
  • knowledge checks
  • a challenge
  • supplemental reading
  • assignment
  • post-lecture quiz

A note about quizzes: All quizzes are contained in this app, for 50 total quizzes of three questions each. They are linked from within the lessons but the quiz app can be run locally; follow the instruction in the quiz-app folder.

Lesson Number Topic Lesson Grouping Learning Objectives Linked Lesson Author
01 Introduction to machine learning Introduction Learn the basic concepts behind machine learning lesson Muhammad
02 The History of machine learning Introduction Learn the history underlying this field lesson Jen and Amy
03 Fairness and machine learning Introduction What are the important philosophical issues around fairness that students should consider when building and applying ML models? lesson Tomomi
04 Techniques for machine learning Introduction What techniques do ML researchers use to build ML models? lesson Chris and Jen
05 Introduction to regression Regression Get started with Python and Scikit-learn for regression models lesson Jen
06 North American pumpkin prices ๐ŸŽƒ Regression Visualize and clean data in preparation for ML lesson Jen
07 North American pumpkin prices ๐ŸŽƒ Regression Build linear and polynomial regression models lesson Jen
08 North American pumpkin prices ๐ŸŽƒ Regression Build a logistic regression model lesson Jen
09 A Web App ๐Ÿ”Œ Web App Build a web app to use your trained model lesson Jen
10 Introduction to classification Classification Clean, prep, and visualize your data; introduction to classification lesson Jen and Cassie
11 Delicious Asian and Indian cuisines ๐Ÿœ Classification Introduction to classifiers lesson Jen and Cassie
12 Delicious Asian and Indian cuisines ๐Ÿœ Classification More classifiers lesson Jen and Cassie
13 Delicious Asian and Indian cuisines ๐Ÿœ Classification Build a recommender web app using your model lesson Jen
14 Introduction to clustering Clustering Clean, prep, and visualize your data; Introduction to clustering lesson Jen
15 Exploring Nigerian Musical Tastes ๐ŸŽง Clustering Explore the K-Means clustering method lesson Jen
16 Introduction to natural language processing โ˜•๏ธ Natural language processing Learn the basics about NLP by building a simple bot lesson Stephen
17 Common NLP Tasks โ˜•๏ธ Natural language processing Deepen your NLP knowledge by understanding common tasks required when dealing with language structures lesson Stephen
18 Translation and sentiment analysis โ™ฅ๏ธ Natural language processing Translation and sentiment analysis with Jane Austen lesson Stephen
19 Romantic hotels of Europe โ™ฅ๏ธ Natural language processing Sentiment analysis with hotel reviews, 1 lesson Stephen
20 Romantic hotels of Europe โ™ฅ๏ธ Natural language processing Sentiment analysis with hotel reviews 2 lesson Stephen
21 Introduction to time series forecasting Time series Introduction to time series forecasting lesson Francesca
22 โšก๏ธ World Power Usage โšก๏ธ - time series forecasting with ARIMA Time series Time series forecasting with ARIMA lesson Francesca
23 Introduction to reinforcement learning Reinforcement learning Introduction to reinforcement learning with Q-Learning lesson Dmitry
24 Help Peter avoid the wolf! ๐Ÿบ Reinforcement learning Reinforcement learning Gym lesson Dmitry
Postscript Real-World ML scenarios and applications ML in the Wild Interesting and revealing real-world applications of classical ML lesson Team

Offline access

You can run this documentation offline by using Docsify. Fork this repo, install Docsify on your local machine, and then in the root folder of this repo, type docsify serve. The website will be served on port 3000 on your localhost: localhost:3000.

PDFs

Find a pdf of the curriculum with links here

Help Wanted!

Would you like to contribute a translation? Please read our translation guidelines and add input here

Other Curricula

Our team produces other curricula! Check out:

Owner
Microsoft
Open source projects and samples from Microsoft
Microsoft
A toolkit for making real world machine learning and data analysis applications in C++

dlib C++ library Dlib is a modern C++ toolkit containing machine learning algorithms and tools for creating complex software in C++ to solve real worl

Davis E. King 11.6k Jan 02, 2023
Programming assignments and quizzes from all courses within the Machine Learning Engineering for Production (MLOps) specialization offered by deeplearning.ai

Machine Learning Engineering for Production (MLOps) Specialization on Coursera (offered by deeplearning.ai) Programming assignments from all courses i

Aman Chadha 173 Jan 05, 2023
A GitHub action that suggests type annotations for Python using machine learning.

Typilus: Suggest Python Type Annotations A GitHub action that suggests type annotations for Python using machine learning. This action makes suggestio

40 Sep 18, 2022
A Python implementation of the Robotics Toolbox for MATLAB

Robotics Toolbox for Python A Python implementation of the Robotics Toolbox for MATLABยฎ GitHub repository Documentation Wiki (examples and details) Sy

Peter Corke 1.2k Jan 07, 2023
Flask app to predict daily radiation from the time series of Solcast from Islamabad, Pakistan

Solar-radiation-ISB-MLOps - Flask app to predict daily radiation from the time series of Solcast from Islamabad, Pakistan.

Abid Ali Awan 1 Dec 31, 2021
This machine-learning algorithm takes in data from the last 60 days and tries to predict tomorrow's price of any crypto you ask it.

Crypto-Currency-Predictor This machine-learning algorithm takes in data from the last 60 days and tries to predict tomorrow's price of any crypto you

Hazim Arafa 6 Dec 04, 2022
A Python library for choreographing your machine learning research.

A Python library for choreographing your machine learning research.

AI2 270 Jan 06, 2023
PyNNDescent is a Python nearest neighbor descent for approximate nearest neighbors.

PyNNDescent PyNNDescent is a Python nearest neighbor descent for approximate nearest neighbors. It provides a python implementation of Nearest Neighbo

Leland McInnes 699 Jan 09, 2023
This is a curated list of medical data for machine learning

Medical Data for Machine Learning This is a curated list of medical data for machine learning. This list is provided for informational purposes only,

Andrew L. Beam 5.4k Dec 26, 2022
This machine learning model was developed for House Prices

This machine learning model was developed for House Prices - Advanced Regression Techniques competition in Kaggle by using several machine learning models such as Random Forest, XGBoost and LightGBM.

serhat_derya 1 Mar 02, 2022
mlpack: a scalable C++ machine learning library --

a fast, flexible machine learning library Home | Documentation | Doxygen | Community | Help | IRC Chat Download: current stable version (3.4.2) mlpack

mlpack 4.2k Jan 01, 2023
Anomaly Detection and Correlation library

luminol Overview Luminol is a light weight python library for time series data analysis. The two major functionalities it supports are anomaly detecti

LinkedIn 1.1k Jan 01, 2023
Management of exclusive GPU access for distributed machine learning workloads

TensorHive is an open source tool for managing computing resources used by multiple users across distributed hosts. It focuses on granting

Paweล‚ Roล›ciszewski 131 Dec 12, 2022
Nevergrad - A gradient-free optimization platform

Nevergrad - A gradient-free optimization platform nevergrad is a Python 3.6+ library. It can be installed with: pip install nevergrad More installati

Meta Research 3.4k Jan 08, 2023
In this Repo a simple Sklearn Model will be trained and pushed to MLFlow

SKlearn_to_MLFLow In this Repo a simple Sklearn Model will be trained and pushed to MLFlow Install This Repo is based on poetry python3 -m venv .venv

1 Dec 13, 2021
Bottleneck a collection of fast, NaN-aware NumPy array functions written in C.

Bottleneck Bottleneck is a collection of fast, NaN-aware NumPy array functions written in C. As one example, to check if a np.array has any NaNs using

Python for Data 835 Dec 27, 2022
BentoML is a flexible, high-performance framework for serving, managing, and deploying machine learning models.

Model Serving Made Easy BentoML is a flexible, high-performance framework for serving, managing, and deploying machine learning models. Supports multi

BentoML 4.4k Jan 04, 2023
Adversarial Framework for (non-) Parametric Image Stylisation Mosaics

Fully Adversarial Mosaics (FAMOS) Pytorch implementation of the paper "Copy the Old or Paint Anew? An Adversarial Framework for (non-) Parametric Imag

Zalando Research 120 Dec 24, 2022
TensorFlow implementation of an arbitrary order Factorization Machine

This is a TensorFlow implementation of an arbitrary order (=2) Factorization Machine based on paper Factorization Machines with libFM. It supports: d

Mikhail Trofimov 785 Dec 21, 2022
Mortality risk prediction for COVID-19 patients using XGBoost models

Mortality risk prediction for COVID-19 patients using XGBoost models Using demographic and lab test data received from the HM Hospitales in Spain, I b

1 Jan 19, 2022