Netflix-recommendation-system

NLP, Machine learning

About

Recommendation algorithms are at the core of the Netflix product. It provides their members with personalized suggestions to reduce the amount of time and frustration to find something great content to watch. Because of the importance of our recommendations, they continually seek to improve them by advancing the state-of-the-art in the field. They do this by using the data about what content our members watch and enjoy along with how they interact with our service to get better at figuring out what the next great movie or TV show for them will be.

Types

The categories under "Trending Now" and "New Releases" are Non-Personalized Recommendation System
The categories under "Because you watched" are Personalized Recommendation System

NLP

Natural language processing is a subfield of linguistics, computer science, and artificial intelligence concerned with the interactions between computers and human language, in particular how to program computers to process and analyze large amounts of natural language data.

#1 Tokenization

Tokenization is the process of breaking down sentence or paragraphs into smaller chunks of words called tokens.

#2 Stop Words Removal

On removal of some words, the meaning of the sentence doesn't change, like and, am. Those words are called stop-words and should be removed before feeding to any algorithm. In datasets, some non-stop words repeat very frequently. Those words too should be removed to get an unbiased result from the algorithm.

#3 Vectorization

After tokenization, and stop words removal, our "content" are still in string format. We need to convert those strings to numbers based on their importance (features). We use TF-IDF vectorization to convert those text to vector of importance. With TF-IDF we can extract important words in our data. It assign rarely occurring words a high number, and frequently occurring words a very low number.

NLP, Machine learning

Related tags

Overview

Netflix-recommendation-system

About

Types

NLP

#1 Tokenization

#2 Stop Words Removal

#3 Vectorization

Owner

Harshith VH

Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS)

Product-Review-Summarizer - Created a product review summarizer which clustered thousands of product reviews and summarized them into a maximum of 500 characters, saving precious time of customers and helping them make a wise buying decision.

Train and use generative text models in a few lines of code.

Machine learning models from Singapore's NLP research community

Use Tensorflow2.7.0 Build OpenAI'GPT-2

Python3 to Crystal Translation using Python AST Walker

COVID-19 Chatbot with Rasa 2.0: open source conversational AI

A python package to fine-tune transformer-based models for named entity recognition (NER).

I label phrases on a scale of five values: negative, somewhat negative, neutral, somewhat positive, positive

Two-stage text summarization with BERT and BART

SpeechBrain is an open-source and all-in-one speech toolkit based on PyTorch.

DLO8012: Natural Language Processing & CSL804: Computational Lab - II

Repository for the paper "Optimal Subarchitecture Extraction for BERT"

The implementation of Parameter Differentiation based Multilingual Neural Machine Translation

In this Notebook I've build some machine-learning and deep-learning to classify corona virus tweets, in both multi class classification and binary classification.

Pipeline for fast building text classification TF-IDF + LogReg baselines.

基于pytorch_rnn的古诗词生成

Yomichad - a Japanese pop-up dictionary that can display readings and English definitions of Japanese words

Quantifiers and Negations in RE Documents

:hot_pepper: R²SQL: "Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing." (AAAI 2021)