PyTorch Implementation of our paper Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation

Last update: Jul 08, 2022

Related tags

Overview

Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation

[Code] [Data] [Project Page]

Official PyTorch Implementation of our paper Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation, published at ICCV 2021.

Have you ever looked at a painting and wondered what is the story behind it? This work presents a framework to bring art closer to people by generating comprehensive descriptions of ﬁne-art paintings. Generating informative descriptions for artworks, however, is extremely challenging, as it requires to 1) describe multiple aspects of the image such as its style, content, or composition, and 2) provide background and contextual knowledge about the artist, their inﬂuences, or the historical period. To address these challenges, we introduce a multi-topic and knowledgeable art description framework, which modules the generated sentences according to three artistic topics and, additionally, enhances each description with external knowledge. The framework is validated through an exhaustive analysis, both quantitative and qualitative, as well as a comparative human evaluation, demonstrating outstanding results in terms of both topic diversity and information veracity.

Setup

Requirements

The code are tested under Python3.6 with the following packages:

torch==1.1.0
torchvision==0.2.2
numpy==1.16.2
visdom==0.1.8.9
transformers==2.1.1
nltk==3.2.3
stanfordcorenlp==3.9.1.1
scipy==1.3.1
pandas==0.25.1

Prepare Data

1.Download the dataset from this repository

2.Put the annotation folder into the MaskedSentenceGeneration

Masked Sentence Generation

cd MaskedSentenceGeneration
python prepare_dataset.py
bash train.sh
bash test_one.sh / bash test_all.sh

Knowledge Retrieval

Please look into here

Knowledge Filling

cd KnowledgeFilling
python create_dataset_drqa_src.py
bash train.sh
bash test.sh

Citation

If you find the data in this repository useful, please cite our paper:

@InProceedings{bai2021explain,
   author    = {Zechen Bai and Yuta Nakashima and Noa Garcia},
   title     = {Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation},
   booktitle = {International Conference in Computer Vision},
   year      = {2021},
}

PyTorch Implementation of our paper Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation

Related tags

Overview

Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation

[Code] [Data] [Project Page]

Setup

Requirements

Prepare Data

Masked Sentence Generation

Knowledge Retrieval

Knowledge Filling

Citation

Owner

Zechen Bai

Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"

Global Filter Networks for Image Classification

CarND-LaneLines-P1 - Lane Finding Project for Self-Driving Car ND

PyTorch implementation of a collections of scalable Video Transformer Benchmarks.

A code repository associated with the paper A Benchmark for Rough Sketch Cleanup by Chuan Yan, David Vanderhaeghe, and Yotam Gingold from SIGGRAPH Asia 2020.

The hippynn python package - a modular library for atomistic machine learning with pytorch.

Romanian Automatic Speech Recognition from the ROBIN project

Heart Arrhythmia Classification

2021搜狐校园文本匹配算法大赛分比我们低的都是帅哥队

joint detection and semantic segmentation, based on ultralytics/yolov5,

Churn prediction

[AAAI 2022] Separate Contrastive Learning for Organs-at-Risk and Gross-Tumor-Volume Segmentation with Limited Annotation

Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)

A PyTorch implementation of PointRend: Image Segmentation as Rendering

A hue shift helper for OBS

This is Official implementation for "Pose-guided Feature Disentangling for Occluded Person Re-Identification Based on Transformer" in AAAI2022

Source code and notebooks to reproduce experiments and benchmarks on Bias Faces in the Wild (BFW).

S2s2net - Sentinel-2 Super-Resolution Segmentation Network

Learning Energy-Based Models by Diffusion Recovery Likelihood

A disassembler for the RP2040 Programmable I/O State-machine!

PyTorch Implementation of our paper Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation

Related tags

Overview

Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation

[Code] [Data] [Project Page]

Setup

Requirements

Prepare Data

Masked Sentence Generation

Knowledge Retrieval

Knowledge Filling

Citation

Owner

Zechen Bai

Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"

Global Filter Networks for Image Classification

CarND-LaneLines-P1 - Lane Finding Project for Self-Driving Car ND

PyTorch implementation of a collections of scalable Video Transformer Benchmarks.

A code repository associated with the paper A Benchmark for Rough Sketch Cleanup by Chuan Yan, David Vanderhaeghe, and Yotam Gingold from SIGGRAPH Asia 2020.

The hippynn python package - a modular library for atomistic machine learning with pytorch.

Romanian Automatic Speech Recognition from the ROBIN project

Heart Arrhythmia Classification

2021搜狐校园文本匹配算法大赛 分比我们低的都是帅哥队

joint detection and semantic segmentation, based on ultralytics/yolov5,

Churn prediction

[AAAI 2022] Separate Contrastive Learning for Organs-at-Risk and Gross-Tumor-Volume Segmentation with Limited Annotation

Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)

A PyTorch implementation of PointRend: Image Segmentation as Rendering

A hue shift helper for OBS

This is Official implementation for "Pose-guided Feature Disentangling for Occluded Person Re-Identification Based on Transformer" in AAAI2022

Source code and notebooks to reproduce experiments and benchmarks on Bias Faces in the Wild (BFW).

S2s2net - Sentinel-2 Super-Resolution Segmentation Network

Learning Energy-Based Models by Diffusion Recovery Likelihood

A disassembler for the RP2040 Programmable I/O State-machine!

2021搜狐校园文本匹配算法大赛分比我们低的都是帅哥队