it's about time

Code repository for "It's About Time: Analog clock Reading in the Wild"

Packages required: pytorch (used 1.9, any reasonable version should work), kornia (for homography), einops, scikit-learn (for RANSAC), tensorboardX (for logging)

Using pretrained model:

prediction python predict.py will predict on your data (or by default, whatever is in data/demo). This does assume the images being already cropped, we use CBNetv2. (you could instead add something like a yolov5 to the code if you prefer not installing anything extra).
evaluation python eval.py (requires dataset) should return the numbers reported in the paper

Training:

sh full_cycle.sh should do the job
if you want to do it individually, then do use
- train.py train on SynClock
- generate_pseudo_labels.py use the model to generate pseudo labels for timelapse
- train_refine.py train on SynClock+timelapse.
- The latter two can be repeated iteratively.

Dataset (Train):

SynClock is generated on the fly (via SynClock.py)
Timelapse will be uploaded later.

Dataset (Eval):

COCO and OpenImages: The .csv files in data/ contains the image ids, predicted bbox's (by CBNetV2), gt bbox's, and the manual time label. We will upload this subset later for convenience, but if you already have the respective datasets it should already work.
Clock Movies do not contain bbox's. We may not be able to release the data directly due to copyright, but the csv files do contain the image file names, and they are scraped from https://theclock.fandom.com/wiki/Special:NewFiles

Note: src/cyclic_ransac.py is adapted from the source code of scikit-learn (authored by Johannes Schönberger under BSD 3 clause license), to fit a sawtooth wave for cyclic linear data.

Coming soon (early December):

video
dataset
detection

Code repository for "It's About Time: Analog clock Reading in the Wild"

Related tags

Overview

it's about time

Owner

Scikit-learn style model finetuning for NLP

SASE : Self-Adaptive noise distribution network for Speech Enhancement with heterogeneous data of Cross-Silo Federated learning

Few-shot Natural Language Generation for Task-Oriented Dialog

ConferencingSpeech2022; Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge

A Practitioner's Guide to Natural Language Processing

Yodatranslator is a simple translator English to Yoda-language

Poetry PEP 517 Build Backend & Core Utilities

DeepSpeech - Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.

Code for Findings at EMNLP 2021 paper: "Learn Continually, Generalize Rapidly: Lifelong Knowledge Accumulation for Few-shot Learning"

The SVO-Probes Dataset for Verb Understanding

sangha, pronounced "suhng-guh", is a social networking, booking platform where students and teachers can share their practice.

Python implementation of TextRank for phrase extraction and summarization of text documents

Deploying a Text Summarization NLP use case on Docker Container Utilizing Nvidia GPU

Course project of [email protected]

ETM - R package for Topic Modelling in Embedding Spaces

VADER Sentiment Analysis. VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon and rule-based sentiment analysis tool that is specifically attuned to sentiments expressed in social media, and works well on texts from other domains.

MPNet: Masked and Permuted Pre-training for Language Understanding

LSTM model - IMDB review sentiment analysis

AI and Machine Learning workflows on Anthos Bare Metal.

📝An easy-to-use package to restore punctuation of the text.