The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021)

Last update: Dec 27, 2022

Related tags

Deep Learning bmvc2021

Overview

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels

Subhabrata Choudhury, Iro Laina, Christian Rupprecht, Andrea Vedaldi

Code will be relased soon.

Abstract:

^{Most of us are not experts in specific fields, such as ornithology. Nonetheless, we do have general image and language understanding capabilities that we use to match what we see to expert resources. This allows us to expand our knowledge and perform novel tasks without ad-hoc external supervision. On the contrary, machines have a much harder time consulting expert-curated knowledge bases unless trained specifically with that knowledge in mind. Thus, in this paper we consider a new problem: fine-grained image recognition without expert annotations, which we address by leveraging the vast knowledge available in web encyclopedias. First, we learn a model to describe the visual appearance of objects using non-expert image descriptions. We then train a fine- grained textual similarity model that matches image descriptions with documents on a sentence-level basis. We evaluate the method on two datasets and compare with several strong baselines and the state of the art in cross-modal retrieval.}

Citation

@inproceedings{choudhury2021curious,
author = {Choudhury, Subhabrata and Laina, Iro and Rupprecht, Christian and Vedaldi, Andrea},
booktitle = {British Machine Vision Conference}
title = {The Curious Layperson: Fine-Grained Image Recognition without Expert Labels}
volume = {32},
year = {2021}
}

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021)

Related tags

Overview

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels

Subhabrata Choudhury, Iro Laina, Christian Rupprecht, Andrea Vedaldi

Code will be relased soon.

Abstract:

Citation

Owner

Subhabrata Choudhury

Benchmark datasets, data loaders, and evaluators for graph machine learning

Parsing, analyzing, and comparing source code across many languages

Deeplearning project at The Technological University of Denmark (DTU) about Neural ODEs for finding dynamics in ordinary differential equations and real world time series data

Unofficial JAX implementations of Deep Learning models

Pytorch implementation of "Neural Wireframe Renderer: Learning Wireframe to Image Translations"

Code for Understanding Pooling in Graph Neural Networks

RipsNet: a general architecture for fast and robust estimation of the persistent homology of point clouds

🚗 INGI Dakar 2K21 - Be the first one on the finish line ! 🚗

Official PyTorch implementation of "RMGN: A Regional Mask Guided Network for Parser-free Virtual Try-on" (IJCAI-ECAI 2022)

A high-level Python library for Quantum Natural Language Processing

Face detection using deep learning.

Deep universal probabilistic programming with Python and PyTorch

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Open & Efficient for Framework for Aspect-based Sentiment Analysis

PyTorch implementation of Algorithm 1 of "On the Anatomy of MCMC-Based Maximum Likelihood Learning of Energy-Based Models"

Source code for Transformer-based Multi-task Learning for Disaster Tweet Categorisation (UCD's participation in TREC-IS 2020A, 2020B and 2021A).

Improving Convolutional Networks via Attention Transfer (ICLR 2017)

Hierarchical User Intent Graph Network for Multimedia Recommendation

Code and models for "Rethinking Deep Image Prior for Denoising" (ICCV 2021)

Deployment of PyTorch chatbot with Flask