GuideDog is an AI/ML-based mobile app designed to assist the lives of the visually impaired, 100% voice-controlled

Last update: Nov 24, 2021

Related tags

Overview

Guidedog

Authors: Kyuhee Jo, Steven Gunarso, Jacky Wang, Raghav Sharma

GuideDog is an AI/ML-based mobile app designed to assist the lives of the visually impaired, 100% voice-controlled. You may as well think of it as "speaking guide dog," as the name suggests. It has three key features based on the scene captured by your mobile phone:

Reads text upon command
Describes the scene around you upon command
Warns you if there is an obstacle in front of you

Check out this demo video to learn more about our app!

Android App

UI/UX
- Simple and Responsive
- Voice Assistant architecture for targeted audience
Libraries / APIs
- GC Speech-to-text and Text-to-Speech
- Android SDK , androidX
- ML Kit object detection and tracking api
- TensorFlow Lite MobileNet Image Classification Model

Backend

Flask API
- Image Captioning
- Optical Character Recognition
Deployment
- Google App Engine
- fast central API with different endpoints

Image Captioning

We used tensorflow to build and train model for image captioning on MS-COCO 2014 based on the paper Show, Attend and Tell: Neural Image Caption Generation with Visual Attention. The model uses standard convolutional network as an encoder to extract features from images (we use Inception V3) and feed the generated features into an attention-based decoder generate sentences. While the paper used LSTM model as a decoder, we use a simpler RNN instead.

GuideDog is an AI/ML-based mobile app designed to assist the lives of the visually impaired, 100% voice-controlled

Related tags

Overview

Guidedog

Android App

Backend

Image Captioning

Get more insights : Devpost

Owner

Kyuhee Jo

clustering moroccan stocks time series data using k-means with dtw (dynamic time warping)

95.47% on CIFAR10 with PyTorch

Random Walk Graph Neural Networks

Guided Internet-delivered Cognitive Behavioral Therapy Adherence Forecasting

PyTorch implementation of CVPR'18 - Perturbative Neural Networks

An easy way to build PyTorch datasets. Modularly build datasets and automatically cache processed results

Photographic Image Synthesis with Cascaded Refinement Networks - Pytorch Implementation

The official implementation of A Unified Game-Theoretic Interpretation of Adversarial Robustness.

YOLOv5 in PyTorch > ONNX > CoreML > TFLite

Uncertainty Estimation via Response Scaling for Pseudo-mask Noise Mitigation in Weakly-supervised Semantic Segmentation

Leibniz is a python package which provide facilities to express learnable partial differential equations with PyTorch

[CVPR 2022] Unsupervised Image-to-Image Translation with Generative Prior

The Noise Contrastive Estimation for softmax output written in Pytorch

Framework web SnakeServer.

PyTorch Implementation of our paper Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation

Multi-Anchor Active Domain Adaptation for Semantic Segmentation (ICCV 2021 Oral)

Implementing yolov4 target detection and tracking based on nao robot

This repository comes with the paper "On the Robustness of Counterfactual Explanations to Adverse Perturbations"

Hooks for VCOCO

A general python framework for single object tracking in LiDAR point clouds, based on PyTorch Lightning.