this repository has datasets containing information of Uber pickups in NYC from April 2014 to September 2014 and January to June 2015. data Analysis , virtualization and some insights are gathered here

Last update: Nov 02, 2021

Related tags

Text Data & NLP uber-pickups-analysis

Overview

uber-pickups-analysis

Data Source: https://www.kaggle.com/fivethirtyeight/uber-pickups-in-new-york-city

Information about data set

The dataset contains, roughly, TWO groups of files: ● Uber trip data from 2014 (April - September), separated by month, with detailed location information. ● Uber trip data from 2015 (January - June), with less fine-grained location information.

Uber trip data from 2014 There are six files of raw data on Uber pickups in New York City from April to September 2014. The files are separated by month and each has the following columns: ● Date/Time : The date and time of the Uber pickup ● Lat : The latitude of the Uber pickup ● Lon : The longitude of the Uber pickup ● Base : The TLC base company code affiliated with the Uber pickup. These files are named: ● uber-raw-data-apr14.csv ● uber-raw-data-aug14.csv ● uber-raw-data-jul14.csv ● uber-raw-data-jun14.csv ● uber-raw-data-may14.csv ● uber-raw-data-sep14.csv

Uber trip data from 2015

Also included is the file uber-raw-data-janjune-15.csv This file has the following columns: ● Dispatching_base_num : The TLC base company code of the base that dispatched the Uber. ● Pickup_date : The date and time of the Uber pickup ● Affiliated_base_num : The TLC base company code affiliated with the Uber pickup. ● locationID : The pickup location ID affiliated with the Uber pickup These files are named:

uber-raw-data-janjune-15.csv

motive of Project

To analyze the data of the customer rides and visualize the data to find insights that can help improve business. Data analysis and visualization is an important part of data science. They are used to gather insights from the data and with visualization you can get quick information from the data.

How to Run the Project

In order to run the project just download the data from above mentioned source then run any file.

Prerequisites

You need to have installed following softwares and libraries in your machine before running this project.

Python 3 Anaconda: It will install ipython notebook and most of the libraries which are needed like sklearn, pandas, seaborn, matplotlib, numpy, scipy.

Installing

Python 3: https://www.python.org/downloads/ Anaconda: https://www.anaconda.com/download/

Authors

KILARI JASWANTH and DEVA DEEKSHITH(https://github.com/deva025) - combined work

this repository has datasets containing information of Uber pickups in NYC from April 2014 to September 2014 and January to June 2015. data Analysis , virtualization and some insights are gathered here

Related tags

Overview

uber-pickups-analysis

Information about data set

motive of Project

How to Run the Project

Prerequisites

Installing

Authors

Owner

DeepSpeech - Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.

Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.

Turkish Stop Words Türkçe Dolgu Sözcükleri

A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/casual, active/passive, and many more. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.

A list of NLP(Natural Language Processing) tutorials

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Practical Natural Language Processing Tools for Humans is build on the top of Senna Natural Language Processing (NLP)

Code for ACL 2020 paper "Rigid Formats Controlled Text Generation"

Toy example of an applied ML pipeline for me to experiment with MLOps tools.

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

A fast and lightweight python-based CTC beam search decoder for speech recognition.

Baseline code for Korean open domain question answering(ODQA)

构建一个多源（公众号、RSS）、干净、个性化的阅读环境

Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"

Let Xiao Ai speakers control third-party devices

Weird Sort-and-Compress Thing

An evaluation toolkit for voice conversion models.

Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding

Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)