Decision tree is the most powerful and popular tool for classification and prediction

Last update: Jan 23, 2022

Overview

Diabetes Prediction Using Decision Tree

Introduction

Decision tree is the most powerful and popular tool for classification and prediction. A Decision tree is a flowchart like tree structure, where each internal node denotes a test on an attribute, each branch represents an outcome of the test, and each leaf node (terminal node) holds a class label.

In this project we build a decsion tree to predict diabetes for Pima Indians dataset with variables such as age, blood, pressure etc

Major Steps

Load the required libraries
Load the data sets using Pandas
Divide the columns to two types of variables dependent and independent variables
Bulding Decision Tree using scikit-learn
Evaluvating the model or classifier
Creating a visual Decision Tree

Group Members

Reference

Decision Tree Classification on Diabetes-Dataset using Python : https://medium.com/@ananya_bt18/decision-tree-classification-on-diabetes-dataset-using-python-scikit-learn-package-f7be624c344e

Decision tree is the most powerful and popular tool for classification and prediction

Related tags

Overview

Diabetes Prediction Using Decision Tree

Introduction

Major Steps

Group Members

Reference

Owner

Arjun U

A handy tool for common machine learning models' hyper-parameter tuning.

A visual dataflow programming language for sklearn

Bottleneck a collection of fast, NaN-aware NumPy array functions written in C.

SageMaker Python SDK is an open source library for training and deploying machine learning models on Amazon SageMaker.

Simple Machine Learning Tool Kit

This repository contains the code to predict house price using Linear Regression Method

GRaNDPapA: Generator of Rad Names from Decent Paper Acronyms

Data Efficient Decision Making

Kalman filter library

Implementations of Machine Learning models, Regularizers, Optimizers and different Cost functions.

neurodsp is a collection of approaches for applying digital signal processing to neural time series

Given the names and grades for each student in a class N of students, store them in a nested list and print the name(s) of any student(s) having the second lowest grade.

ml4h is a toolkit for machine learning on clinical data of all kinds including genetics, labs, imaging, clinical notes, and more

K-means clustering is a method used for clustering analysis, especially in data mining and statistics.

Customers Segmentation with RFM Scores and K-means

Send rockets to Mars with artificial intelligence(Genetic algorithm) in python.

Drug prediction

Pandas-method-chaining is a plugin for flake8 that provides method chaining linting for pandas code

XGBoost + Optuna

PennyLane is a cross-platform Python library for differentiable programming of quantum computers