Mortgage-loan-prediction - Show how to perform advanced Analytics and Machine Learning in Python using a full complement of PyData utilities

Last update: Dec 26, 2021

Overview

MORTGAGE LOAN AQUISITION REQUIREMENT

This entire project encompasses both Data Analysis and Machine Learning. It was carefully structured and compiled for easy understanding.

Installation:

To run this notebook you can either install.

Download anaconda from anaconda site this have almost all dependencies pre-installed. Feel free to use any environment of choice

Dependencies:

Personal project | Mortgage loan elegibility prediction

The Home Mortgage Disclosure Act (HMDA) requires many financial institutions to maintain, report, and publicly disclose information about mortgages. These public data are important because:

- they help show whether lenders are serving the housing needs of their communities.
- help authourities to determine and fish out all predatory act of lending.
- they give public officials information that helps them make decisions and policies.
- They shed light on lending patterns that could be discriminatory. Eg. a reported increase in mortgage borrowing by blacks and Hispanics as of 1993.

On my Kaggle site My Homepage.

Goal for this Notebook:

Show how to perform advanced Analytics and Machine Learning in Python using a full complement of PyData utilities. This is aimed for those looking to get into the field Data Science or those who are already in the field and looking to solve a real world project with python.

This Notebook will teach the following:

Data Handling

Importing Data with Pandas
Cleaning Data
Exploring Data through Visualizations with Matplotlib
Doing predictive Analysis with various Machine Learning Algorithms

Data Analysis/Machine Learning

Supervised Machine learning Techniques: + RandomForestClassifier + StratifiedKfold ( 5 folds) + ETC

Valuation of the Analysis

K-folds cross validation to valuate results locally
Output the results from the IPython Notebook to Kaggle

Results obtained

Was able to derive excerpt insights to give pro recommendation to borrowers
Was able to predict applicant loan approval with 74% accuracy

Mortgage-loan-prediction - Show how to perform advanced Analytics and Machine Learning in Python using a full complement of PyData utilities

Related tags

Overview

MORTGAGE LOAN AQUISITION REQUIREMENT

Installation:

Dependencies:

Personal project | Mortgage loan elegibility prediction

Goal for this Notebook:

This Notebook will teach the following:

Data Handling

Data Analysis/Machine Learning

Valuation of the Analysis

Results obtained

Owner

Joachim

Pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.

In this project, ETL pipeline is build on data warehouse hosted on AWS Redshift.

PipeChain is a utility library for creating functional pipelines.

Shot notebooks resuming the main functions of GeoPandas

Toolchest provides APIs for scientific and bioinformatic data analysis.

Visions provides an extensible suite of tools to support common data analysis operations

A python package which can be pip installed to perform statistics and visualize binomial and gaussian distributions of the dataset

A data parser for the internal syncing data format used by Fog of World.

Datashredder is a simple data corruption engine written in python. You can corrupt anything text, images and video.

Bigdata Simulation Library Of Dream By Sandman Books

Orchest is a browser based IDE for Data Science.

Python reader for Linked Data in HDF5 files

MetPy is a collection of tools in Python for reading, visualizing and performing calculations with weather data.

Detecting Underwater Objects (DUO)

Streamz helps you build pipelines to manage continuous streams of data

Numerical Analysis toolkit centred around PDEs, for demonstration and understanding purposes not production

Repository created with LinkedIn profile analysis project done

NumPy and Pandas interface to Big Data

Flexible HDF5 saving/loading and other data science tools from the University of Chicago

Statistical Rethinking course winter 2022