DeepSpamReview: Detection of Fake Reviews on Online Review Platforms using Deep Learning Architectures. Summer Internship project at CoreView Systems.

Last update: Dec 17, 2022

Overview

Detection of Fake Reviews on Online Review Platforms using Deep Learning Architectures

Dataset: https://s3.amazonaws.com/fast-ai-nlp/yelp_review_polarity_csv.tgz
https://www.kaggle.com/rtatman/deceptive-opinion-spam-corpus
The data includes 1,569,264 samples from the Yelp Dataset Challenge 2015. This subset has 280,000 training samples and 19,000 test samples in each polarity.
**Also, if you happen to refer my work, a citation would do wonders for me. Thanks! **
The following implementations are done:

Bidirectional LSTM with GLoVE 50D word embeddings
LSTM with GLoVE 100D word embeddings
LSTM with GLoVE 300D word embeddings
CNN-LSTM with Doc2Vec and TF-IDF
Attention mechanism with GLoVe 100D word embeddings
Logistic Regression
Multinomial Naive Bayes
Support Vector Machine - Stochastic Gradient Descent (SGD)

The results obtained were as follows:

Sr. No.	Model Accuracy (%)	Precision Score	Recall Score	F1 Score
1	MultinomialNB	90.25	0.9325	0.8601
2	Stochastic Gradient Descent (SGD)	87.75	0.8913	0.8497
3	Logistic Regression	87.00	0.8691	0.8601
4	Support Vector Machine	56.25	0.525	0.9792
5	Gaussian Naive Bayes	63.5	0.6424	0.6169
6	K-Nearest Neighbour	57.5	0.8604	0.1840
7	Decision tree	68.5	0.6681	0.7412

Model	Training accuracy(%)	Testing accuracy(%)
Bidirectional LSTM + GLoVe(50D)	92.17	88.13
LSTM + GLoVe(100D)	99.18	85.75
CNN + LSTM + Doc2Vec +TF-IDF	96.23	92.19
CNN + Attention + GLoVe(100D)	99.00	90.25
BiLSTM + Attention + GLoVe(100D)	99.18	89.27
CNN + BiLSTM + Attention + GLoVe(100D)	99.75	81.25
LogisticRegression + TF-IDF	99.11	87.21

Future scope includes improvement in the attention layer to increase testing accuracy. BERT and XLNet can be implemented to improve the performance further.

DeepSpamReview: Detection of Fake Reviews on Online Review Platforms using Deep Learning Architectures. Summer Internship project at CoreView Systems.

Related tags

Overview

Detection of Fake Reviews on Online Review Platforms using Deep Learning Architectures

Owner

Ashish Salunkhe

MaskTrackRCNN for video instance segmentation based on mmdetection

Repository for the electrical and ICT benchmark model developed in the ERIGrid 2.0 project.

Code Repository for Liquid Time-Constant Networks (LTCs)

🎓Automatically Update CV Papers Daily using Github Actions (Update at 12:00 UTC Every Day)

POCO: Point Convolution for Surface Reconstruction

Few-Shot Graph Learning for Molecular Property Prediction

Minimal PyTorch implementation of Generative Latent Optimization from the paper "Optimizing the Latent Space of Generative Networks"

Python-kafka-reset-consumergroup-offset-example - Python Kafka reset consumergroup offset example

A Context-aware Visual Attention-based training pipeline for Object Detection from a Webpage screenshot!

This is the implementation of the paper "Self-supervised Outdoor Scene Relighting"

The repository includes the code for training cell counting applications. (Keras + Tensorflow)

Code for ICCV 2021 paper "HuMoR: 3D Human Motion Model for Robust Pose Estimation"

Code accompanying the paper Shared Independent Component Analysis for Multi-subject Neuroimaging

The fastai deep learning library

The Official PyTorch Implementation of "VAEBM: A Symbiosis between Variational Autoencoders and Energy-based Models" (ICLR 2021 spotlight paper)

Disentangled Face Attribute Editing via Instance-Aware Latent Space Search, accepted by IJCAI 2021.

MILK: Machine Learning Toolkit

Simulation environments for the CrazyFlie quadrotor: Used for Reinforcement Learning and Sim-to-Real Transfer

Benchmarking Pipeline for Prediction of Protein-Protein Interactions

Official Keras Implementation for UNet++ in IEEE Transactions on Medical Imaging and DLMIA 2018