Image Captioning using CNN ,LSTM and Attention

Last update: Dec 16, 2021

Related tags

Deep Learning imagecaptioningproject

Overview

Image Captioning using CNN ,LSTM and Attention

This is a deeplearning model which tries to summarize an image into a text .

Installation

Install this project with pip3. Use python version 3.7

  pip3 install -R requirements.txt
  python3 app.py

these commands are applicable if you want to try the website in localhost.

you can also install docker and build an image from the docker file and run it.

  docker build -f Dockerfile -t imagecaptioning:api .
  docker run -p 8080:8080 -ti imagecaptioning

Deployment

To deploy this project in google cloud app engine . First create an project in app engine. Install google SDK to push ptojects into your local machine then run the following commands.

  gcloud init
  gcloud app deploy

choose the right project and then push the application to the cloud. This is an monolithic application so a single docker image is complied on the app engine.

Demo

link to demo-https://lucky-dahlia-333406.el.r.appspot.com/index

FAQ

why is this project implimented in tensorflow ?

Tensorflow is actively maintained by google and is very convenient to deploy on a server .It automatically switches to gpu while training if it finds one.

what is BELU score ?

BLEU, or the Bilingual Evaluation Understudy, is a score for comparing a candidate translation of text to one or more reference translations.Although developed for translation, it can be used to evaluate text generated for a suite of natural language processing tasks.

In this project, you will discover the BLEU score for evaluating and scoring candidate text using the NLTK library in Python.

Authors

License

MIT

Image Captioning using CNN ,LSTM and Attention

Related tags

Overview

Image Captioning using CNN ,LSTM and Attention

Installation

Deployment

Demo

FAQ

why is this project implimented in tensorflow ?

what is BELU score ?

Authors

License

Owner

ASUTOSH GHANTO

Dynamic Head: Unifying Object Detection Heads with Attentions

Yolov5-opencv-cpp-python - Example of using ultralytics YOLO V5 with OpenCV 4.5.4, C++ and Python

State of the art Semantic Sentence Embeddings

Semi-supervised Video Deraining with Dynamical Rain Generator (CVPR, 2021, Pytorch)

Code for the paper "Learning-Augmented Algorithms for Online Steiner Tree"

Autonomous Robots Kalman Filters

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Code of the paper "Shaping Visual Representations with Attributes for Few-Shot Learning (ASL)".

Controlling the MicriSpotAI robot from scratch

This is the code for the paper "Motion-Focused Contrastive Learning of Video Representations" (ICCV'21).

Github for the conference paper GLOD-Gaussian Likelihood OOD detector

Official Implementation of DE-CondDETR and DELA-CondDETR in "Towards Data-Efficient Detection Transformers"

Supplemental Code for "ImpressionNet :A Multi view Approach to Predict Socio Facial Impressions"

YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931)

This project uses Template Matching technique for object detecting by detection of template image over base image.

Official implementation of "Membership Inference Attacks Against Self-supervised Speech Models"

Implementation of the paper titled "Using Sampling to Estimate and Improve Performance of Automated Scoring Systems with Guarantees"

The official implementation of Variable-Length Piano Infilling (VLI).

Project page for the paper Semi-Supervised Raw-to-Raw Mapping 2021.

A Dataset of Python Challenges for AI Research