Create a machine learning model which will predict if the mortgage will be approved or not based on 5 variables

Last update: Jan 29, 2022

Overview

Mortgage-Application-Analysis

Create a machine learning model which will predict if the mortgage will be approved or not based on 5 variables: age, income level, occupancy type, accepted, and debt-income ratio, Eliminating all the demographic bias except for age We picked 5 attributes from the Mortgage data set provided and created a separate *.csv file to avoid extra data loss from the null values of the attributes which we neglect in our model. We preprocessed the data to drop any null values of the applicants which might skew our datasets using the pandas library For the processing part, we had some classification data with controlled intervals. We used Ordinal encoding to convert those into numeric discrete data for training and testing our model. We also had one, unique string data attribute, which was encoded using One-hot encoding to extract numeric values for processing. With this clean data, we divided the data into two groups, 80% for validation and 20%, and trained our model to establish a correlation between mortgage application acceptance.

Using Matlab plot, we carried out data/representation/ visualization and found out, other than debt-to-income ratio, there isn’t any significant correlation between acceptance and other non-demographic factors After this visualization to establish our hypothesis, we trained our model using the data set we created., and evaluate the model we created we applied 4 types of algorithms to test it out: We used the Logistic Regression model to create a line the best fit for log-odds values to calculate the acceptance rate for the mortgage application. The F1 score, precision score, and recall score for this testing were very high, which suggested that the non-demographic factor which we accounted for didn’t have many roles in the application being accepted or rejected. Similarly, we carried out a random forest model, Decision Tree, and Support Vector machine algorithm and each of those evaluations had really high precision, recall, and F1 score supporting the evidence from data visualization.

Create a machine learning model which will predict if the mortgage will be approved or not based on 5 variables

Related tags

Overview

Mortgage-Application-Analysis

Owner

Almost State-of-the-art Text Generation library

A sentence aligner for comparable corpora

Towards Nonlinear Disentanglement in Natural Data with Temporal Sparse Coding

Code for Findings of ACL 2022 Paper "Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR Errors"

Under the hood working of transformers, fine-tuning GPT-3 models, DeBERTa, vision models, and the start of Metaverse, using a variety of NLP platforms: Hugging Face, OpenAI API, Trax, and AllenNLP

Write Alphabet, Words and Sentences with your eyes.

An open source framework for seq2seq models in PyTorch.

Blazing fast language detection using fastText model

Awesome-NLP-Research (ANLP)

This is a modification of the OpenAI-CLIP repository of moein-shariatnia

This repository contains examples of Task-Informed Meta-Learning

Use AutoModelForSeq2SeqLM in Huggingface Transformers to train COMET

Conversational text Analysis using various NLP techniques

Sploitus - Command line search tool for sploitus.com. Think searchsploit, but with more POCs

Athena is an open-source implementation of end-to-end speech processing engine.

This repository consists of a complete guide on natural language processing (NLP) in Python where we'll learn various techniques for implementing NLP including parsing & text processing and understand how to use NLP for text feature engineering.

The Easy-to-use Dialogue Response Selection Toolkit for Researchers

Graph4nlp is the library for the easy use of Graph Neural Networks for NLP

PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP. Democratize AI for everyone.

Modified GPT using average pooling to reduce the softmax attention memory constraints.